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INTENDED AUDIENCE 


This is an introductory textbook for a first course in applied statistics and probability for undergraduate 
students in engineering and the physical or chemical sciences. These individuals play a significant role in 
designing and developing new products and manufacturing systems and processes, and they also improve 
existing systems. Statistical methods are an important tool in these activities because they provide the en- 
gineer with both descriptive and analytical methods for dealing with the variability in observed data. 
Although many of the methods we present are fundamental to statistical analysis in other disciplines, such 
as business and management, the life sciences, and the social sciences, we have elected to focus on an 
engineering-oriented audience. We believe that this approach will best serve students in engineering and 
the chemical/physical sciences and will allow them to concentrate on the many applications of statistics 
in these disciplines. We have worked hard to ensure that our examples and exercises are engineering- and 
science-based, and in almost all cases we have used examples of real data—either taken from a published 
source or based on our consulting experiences. 

We believe that engineers in all disciplines should take at least one course in statistics. 
Unfortunately, because of other requirements, most engineers will only take one statistics course. This 
book can be used for a single course, although we have provided enough material for two courses in the 
hope that more students will see the important applications of statistics in their everyday work and elect 
a second course. We believe that this book will also serve as a useful reference. 

We have retained the relatively modest mathematical level of the first four editions. We have found 
that engineering students who have completed one or two semesters of calculus should have no difficulty 
reading almost all of the text. It is our intent to give the reader an understanding of the methodology and 
how to apply it, not the mathematical theory. We have made many enhancements in this edition, including 
reorganizing and rewriting major portions of the book and adding a number of new exercises. 


ORGANIZATION OF THE BOOK 


Perhaps the most common criticism of engineering statistics texts is that they are too long. Both instructors 
and students complain that it is impossible to cover all of the topics in the book in one or even two terms. For 
authors, this is a serious issue because there is great variety in both the content and level of these courses, and 
the decisions about what material to delete without limiting the value of the text are not easy. Decisions about 
which topics to include in this edition were made based on a survey of instructors. 

Chapter 1 is an introduction to the field of statistics and how engineers use statistical methodology as 
part of the engineering problem-solving process. This chapter also introduces the reader to some engineer- 
ing applications of statistics, including building empirical models, designing engineering experiments, and 
monitoring manufacturing processes. These topics are discussed in more depth in subsequent chapters. 

Chapters 2, 3, 4, and 5 cover the basic concepts of probability, discrete and continuous random vari- 
ables, probability distributions, expected values, joint probability distributions, and independence. We 
have given a reasonably complete treatment of these topics but have avoided many of the mathematical 
or more theoretical details. 

Chapter 6 begins the treatment of statistical methods with random sampling; data summary and de- 
scription techniques, including stem-and-leaf plots, histograms, box plots, and probability plotting; and 
several types of time series plots. Chapter 7 discusses sampling distributions, the central limit theorem, 
and point estimation of parameters. This chapter also introduces some of the important properties of esti- 
mators, the method of maximum likelihood, the method of moments, and Bayesian estimation. 

Chapter 8 discusses interval estimation for a single sample. Topics included are confidence intervals for 
means, variances or standard deviations, proportions, prediction intervals, and tolerance intervals. Chapter 9 
discusses hypothesis tests for a single sample. Chapter 10 presents tests and confidence intervals for two 
samples. This material has been extensively rewritten and reorganized. There is detailed information and 
examples of methods for determining appropriate sample sizes. We want the student to become familiar with 
how these techniques are used to solve real-world engineering problems and to get some understanding of 
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the concepts behind them. We give a logical, heuristic development of the procedures rather than a formal, 
mathematical one. We have also included some material on nonparametric methods in these chapters. 

Chapters 11 and 12 present simple and multiple linear regression including model adequacy checking and 
regression model diagnostics and an introduction to logistic regression. We use matrix algebra throughout the 
multiple regression material (Chapter 12) because it is the only easy way to understand the concepts presented. 
Scalar arithmetic presentations of multiple regression are awkward at best, and we have found that undergrad- 
uate engineers are exposed to enough matrix algebra to understand the presentation of this material. 

Chapters 13 and 14 deal with single- and multifactor experiments, respectively. The notions of ran- 
domization, blocking, factorial designs, interactions, graphical data analysis, and fractional factorials are 
emphasized. Chapter 15 introduces statistical quality control, emphasizing the control chart and the fun- 
damentals of statistical process control. 


WHAT’S NEW IN THIS EDITION? 


We received much feedback from users of the fourth edition of the book, and in response we have made 
substantial changes in this new edition. 
¢ The most obvious change is that the chapter on nonparametric methods is gone. We have inte- 


grated most of this material into Chapter 9 and 10 on statistical hypothesis testing, where we 
think it is a much better fit if instructors elect to cover these techniques. 


e Another substantial change is the increased emphasis on the use of P-value in hypothesis test- 
ing. Many sections of several chapters were rewritten to reflect this. 


¢ We have also rewritten and modified many portions of the book to improve the explanations and 
try to make the concepts easier to understand. 


¢ We have added brief comments at the end of examples to emphasize the practical interpretations 
of the results. 


¢ We have also added approximately 200 new homework exercises. 


FEATURED IN THIS BOOK 


Definitions, Key Concepts, and Equations 

Throughout the text, definitions and key con- 
cepts and equations are highlighted by a box 
to emphasize their importance. 


Learning Objectives 

Learning Objectives at the start 
of each chapter guide the F , 
students in what they are 

expected to take away from this 
chapter and serve as a study reference. 


MEAN AND VARIANCE OF A CONTINUOUS 
RANDOM VARIABLE 


The mean and variance can also be defined for a continuous random variable. Integration 
replaces summation in the discrete definitions. If a probability density function is viewed as a 
loading on a beam as in Fig. 4-1, the mean is the balance point. 


Mean 
_ and | Suppose X is a continuous random variable with probability density function f(x). 
Variance The mean or expected value of X, denoted as 2 or E(X), is 


p= E(X) = | Axflx) de (4-4) 


The variance of X, denoted as V(X) or 0°, is 


w= nay = | (= wine) de =f xMayae— Ww 


The standard deviation of Xiso = Vo". 


LEARNING OBJECTIVES 


After careful study of this chapter you should be able to do the following: 

1. Determine probabilities from probability density functions 

2. Determine probabilities from cumulative distribution functions and cumulative distribution fune- 
tions from probability density functions, and the reverse 

3. Calculate means and variances for continuous random variables » 

4. Understand the assumptions for some common continuous probability distributions 


The equivalence of the two formulas for variance can be derived from the same approach used 
for discrete random variables. 


tric Current 


asurement in Example 4-1, the mean The variance of X is 


20 
(x — 10)7f(x) dv = 0.05(x — 10/3] = 33.33 
0 


20 S 
}) dx = 0.05x°/2] = 10 WX) = 


5, Select an appropriate continuous probability distribution to calculate probabilities in specific applications 

6. Calculate probabilities, determine means and variances for some common continuous probability 
distributions 

7. Standardize normal random variables 

8. Use the table for the cumulative distribution function of a standard normal distribution to calcu- 
late probabilities 

9. Approximate probabilities for some binomial and Poisson distributions 


0 0 


[he expected value of a function A(X) of a continuous random variable is also defined in a 
traightforward manner. 


Function of a 
Continuous 
Random 
Variable 


If is a continuous random variable with probability density function f(x), 


aria] = | mere ae 


(4-5) 
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Seven-Step Procedure for Hypothesis Testing 
The text introduces a sequence of seven steps in 
applying hypothesis-testing methodology and 
explicitly exhibits this procedure in examples. 


Figures 


AU YN 


9-1.6 General Procedure for Hypothesis Tests 


This chapter develops hypothesis-testing procedures for many practical problems. Use of the 
following sequence of steps in applying hypothesis-testing methodology is recommended. 


Parameter of interest: From the problem context, identify the parameter of interest. 
Null hypothesis, Hg: State the null hypothesis, Ho. 


Alternative hypothesis, H,: Specify an appropriate alternative hypothesis, Hy. 


Test statistic: Determine an appropriate test statistic. 


Reject H, if: State the rejection criteria for the null hypothesis. 


Computations: Compute any necessary sample quantities, substitute these into the 
equation for the test statistic, and compute that value. 


Draw conclusions: Decide whether or not Hp should be rejected and report that in 
the problem context. 


Steps 1-4 should be completed prior to examination of the sample data. This sequence of 


steps will be illustrated in subsequent sections. 


1.15 1.25 


1.35 


Hydrocarbon level (x) 


1.45 


1.55 


Scatter diagram of oxygen purity versus hydrocarbon. 
level from Table 11-1. 


Table 11-1 Oxygen and Hydrocarbon Levels 
Numerous figures throughout the text Observation Hydrocarbon Level Purity 
ill istical 3 Itipl Number x(%) y(%) 
illustrate statistical concepts in multiple i as Salo 
formats. 2 1.02 89.05 
3 1.15 91.43 
4 1.29 93.74 
5 1.46 96.73 
6 1.36 94.45 ‘ 
100 
i 0.87 87.59 
8 1.23 91.77 98 
9 1.55 99.42 a 
10 1.40 93.65 
1 1.19 93.54 S&S 94 
12 1.15 92.52 £ 
7 sae 9056 = 
xy 
lowe 89.54 90 
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90.39 ° 
Bo + By 1.25) Trlie regression line 93.25 86 
Ly|x =Bo+ Bix 0.85 
vx =Bot fix 93.41 
94.98 
Bo + By (1.00) 87.33 Figure 11-1 
x= 1.00 x=1.25 x (Hydrocarbon level) 


Figure 11-2 The distribution of Y for a given value of x for the 
oxygen purity—hydrocarbon data. 


Minitab Output 


Throughout the book, we have 
presented output from Minitab as 
typical examples of what can be done 
with modern statistical software. 


Figure 6-6 A stem- 
and-leaf diagram from 
Minitab. 
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Example Problems 
EXAMPLE 10-1 Paint Drying Time 


A set of example problems provides the stu- A product developer is interested in reducing the drying time 4. Test statistic: The test statistic is 
dent with detailed solutions and co ents for of a primer paint. Two formulations of the paint are tested; for- 


mulation | is the standard chemistry, and formulation 2 has a 


interesting, real-world situations. Brief practi- new drying ingredient that should reduce the drying time. 


. . . . From experience, it is known that the standard deviation of 
cal interpretations have been added in this drying time is 8 minutes, and this inherent variability should 
edition. 


where oj = 03 = (8)? = 64 and m, = 1, = 10. 


ainted with formulation 1, and another 10 speci- 5. Reject Hp if: Reject Ho: ju; = p2 if the P-value is less 
with formulation 2; the 20 specimens are than 0.05. 
painted in random oftex, The two sample average drying times 6, Computations: Since ¥, = 121 minutes and ¥ = 112 


are X,; = 121 minutes antsy, = 112 minutes, respectively. minutes, the test statistic is 

What conclusions can the pro leveloper draw about the 1-112 

effectiveness of the new ingredient, usimga = 0.05? 4 = SS = 
We apply the seven-step procedure to™thj (8) ES (3° 

follows: 10 


1. Parameter of interest: The quantity of interest is the dr 
ference in mean drying times, 4, — 4), and Ay = 0. 


Conclusion: Since z) = 2.52, the P-value is P = 
1— (2.52) = 0.0059, so we reject Hy at the a = 0.05 level 


2. Non hypothesis: Ho: 4) — 2 = 0, or Ho: Bi = H2- Practical Interpretation) We conclude that adding the 
3. Alternative hypothesis: H,: 2; > 2. We want to reject new ingredient to the paint significantly reduces the drying 
Ai, if the new ingredient reduces mean drying time. time. This is a strong conclusion. 


Exercis 
ian EXERCISES FOR SECTION 5-5 
Each chapter has an extensive collection 5-67. Suppose that _Y is a random variable with probability 5-73. Suppose that X has the probability distribution 
of exercises, including end-of-section cistibution fra) =1, 1 2 
: * ‘ ‘i fy(x) = 1, Ses 
exercises that emphasize the material in fee) = 1/4, x= 1,2,3,4 
that section, supplemental exercises at | find they —_ aa ee ee 
the end of the chapter that cover the 66a. 1 (a) ene Ne Event distribution of the random variable Supplemental Exercises 
scope of chapter topics and require the orate (b) Find the expected value of Y. 5-75. Show that the following function satisfies the proper- 
sos 5-70. Suppose that X has a uniform probability distribution ties of a joint probability mass function: 
student to make a decision about the 5-69. § ee 
: probabilit} =1 0=<x=1 
approach they will use to solve the fxs) = 1, x i Fi Ties 
pre oblem, and mind-expanding exer- Show that the probability distribution of the random variable 0 0 1/4 
cises that often require the student to Y = —2 In Yis chi-squared with two degrees of freedom. 0 1 1/8 
“ 5-71. A random variable X has the following probability 1 0 1/8 
extend the text material somewhat or to distribution: 1 i 1/4 
apply it in a novel situation. —_ 7 2 2 1/4 
Answers are provided to most 
odd-numbered exercises in Appendix C MIND-EXPANDING EXERCISES 
in the text, and the WileyPLUS online ; ; 5) 
7 :. * 5-96. Show that if X,, .%5,..., X, are independent, and h(y), is a function only of y. Show that Y and Y 
learning environment includes for stu- continuous random variables, P(X, © A,X; € Ay, ..., are independent. eae Sante 
dents complete detailed solutions to X, €A,) = P(X, € A\)P(%Q € Aa)... P(X, € A,) forany 5-100. This exercise extends the hypergeometric dis- 
. regions 4,, 4,..., A, in the range of X,, X;,..., X, tribution to multiple variables. Consider a population that ¥ = 1 
selected exercises. respectively. with N items of k different types. Assume there are N, 
5-97. Show that if X;, %5,..., X, are independent _ items of type 1, N) items of type 2, ..., N; items of type k 
random variables and Y = c,X, + c)X) + --- + ¢,X,, so that N, + N, +... +... NM, = N. Suppose that a ran- 
dom sample of size n is selected, without replacement, eumatoid 
WY) = ciV(X1) + BV(%) +o + opV(Xp) from the population. Let X), X,, ..., X;, denote the number ide effects 
You may assume that the random variables are continuous. of items of each type in the sample so that Xj, %2, +... + ople react 
GlORMSunpecet ha thence mobabiliyttmetonte sme X, = n. Show that for feasible values of the parame- 
the continuous random variables Xand Yis constant on ‘°° "2 *B-->%e MON re ESATA 
the rectangle 0 <x <a,0 <y <b, Show that Yand Y eC a) 
are independent. opih@ erie =e 1] \x2 Sn 
5-99. Suppose that the range of the continuous (*) 
variables X and Y is 0 <x <a and 0<y <b. Also n 
suppose that the joint probability density function 
Sey(% ¥) = g(x)h(y), where g(x) is a function only of 
Important Terms and Concepts IMPORTANT TERMS AND CONCEPTS 
At the end of each chapter is a list of Bivariate distribution Conditional variance Joint probability density | Multinomial distribution 
important terms and concepts for an Bivariate normal Contour plots function . Reproductive property of 
distribution Correlation Joint probability mass the normal 
easy self-check and study tool. Conditional mean Covariance function distribution 


STUDENT RESOURCES 


¢ DataSets Data sets for all examples and exercises in the text. Visit the student section of the 
book Web site at www.wiley.com/college/montgomery to access these materials. 


¢ Student Solutions Manual Detailed solutions for selected problems in the book. The Student 
Solutions Manual may be purchased from the Web site at www.wiley.com/college/montgomery. 
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INSTRUCTOR RESOURCES 


MINITAB 


WileyPLUS 


For Students 


The following resources are available only to instructors who adopt the text: 


¢ Solutions Manual All solutions to the exercises in the text. 

¢ DataSets Data sets for all examples and exercises in the text. 
¢ Image Gallery of Text Figures 

¢ PowerPoint Lecture Slides 

¢ Section on Logistic Regression 


These instructor-only resources are password-protected. Visit the instructor section of the book Web site 
at www.wiley.com/college/montgomery to register for a password to access these materials. 


A student version of Minitab is available as an option to purchase in a set with this text. Student versions 
of software often do not have all the functionality that full versions do. Consequently, student versions 
may not support all the concepts presented in this text. If you would like to adopt for your course the set 
of this text with the student version of Minitab, please contact your local Wiley representative at 
www.wiley.com/college/rep. 

Alternatively, students may find information about how to purchase the professional version of the 
software for academic use at www.minitab.com. 


This online teaching and learning environment integrates the entire digital textbook with the most 
effective instructor and student resources to fit every learning style. 
With WileyPLUS: 


¢ Students achieve concept mastery in a rich, structured environment that’s available 24/7. 


¢ Instructors personalize and manage their course more effectively with assessment, assignments, 
grade tracking, and more. 


WileyPLUS can complement your current textbook or replace the printed text altogether. 


Personalize the learning experience 
Different learning styles, different levels of proficiency, different levels of preparation—each of your stu- 
dents is unique. WileyPLUS empowers them to take advantage of their individual strengths: 


¢ Students receive timely access to resources that address their demonstrated needs, and get im- 
mediate feedback and remediation when needed. 


¢ Integrated, multi-media resources—including audio and visual exhibits, demonstration prob- 
lems, and much more—provide multiple study-paths to fit each student’s learning preferences 
and encourage more active learning. 

¢ WileyPLUS includes many opportunities for self-assessment linked to the relevant portions 


of the text. Students can take control of their own learning and practice until they master the 
material. 
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For Instructors 


Personalize the teaching experience 
WileyPLUS empowers you with the tools and resources you need to make your teaching even more 
effective: 


¢ You can customize your classroom presentation with a wealth of resources and functionality 
from PowerPoint slides to a database of rich visuals. You can even add your own materials to 
your WileyPLUS course. 


¢ With WileyPLUS you can identify those students who are falling behind and intervene accord- 
ingly, without having to wait for them to come to office hours. 


¢ WileyPLUS simplifies and automates such tasks as student performance assessment, making as- 
signments, scoring student work, keeping grades, and more. 


COURSE SYLLABUS SUGGESTIONS 


This is a very flexible textbook because instructors’ ideas about what should be in a first course on sta- 
tistics for engineers vary widely, as do the abilities of different groups of students. Therefore, we hesitate 
to give too much advice, but will explain how we use the book. 

We believe that a first course in statistics for engineers should be primarily an applied statistics 
course, not a probability course. In our one-semester course we cover all of Chapter 1 (in one or two 
lectures); overview the material on probability, putting most of the emphasis on the normal distribution 
(six to eight lectures); discuss most of Chapters 6 through 10 on confidence intervals and tests (twelve to 
fourteen lectures); introduce regression models in Chapter 11 (four lectures); give an introduction to the 
design of experiments from Chapters 13 and 14 (six lectures); and present the basic concepts of statisti- 
cal process control, including the Shewhart control chart from Chapter 15 (four lectures). This leaves 
about three to four periods for exams and review. Let us emphasize that the purpose of this course is to 
introduce engineers to how statistics can be used to solve real-world engineering problems, not to weed 
out the less mathematically gifted students. This course is not the “baby math-stat” course that is all too 
often given to engineers. 

If a second semester is available, it is possible to cover the entire book, including much of the 
supplemental material, if appropriate for the audience. It would also be possible to assign and work 
many of the homework problems in class to reinforce the understanding of the concepts. Obviously, 
multiple regression and more design of experiments would be major topics in a second course. 


USING THE COMPUTER 


In practice, engineers use computers to apply statistical methods to solve problems. Therefore, we strongly 

recommend that the computer be integrated into the class. Throughout the book we have presented output 

from Minitab as typical examples of what can be done with modern statistical software. In teaching, we 

have used other software packages, including Statgraphics, JMP, and Statistica. We did not clutter up the 

book with examples from many different packages because how the instructor integrates the software into 

the class is ultimately more important than which package is used. All text data are available in electronic 
G form on the textbook Web site. In some chapters, there are problems that we feel should be worked using 
> computer software. We have marked these problems with a special icon in the margin. 

In our own classrooms, we use the computer in almost every lecture and demonstrate how the tech- 
nique is implemented in software as soon as it is discussed in the lecture. Student versions of many sta- 
tistical software packages are available at low cost, and students can either purchase their own copy or 
use the products available on the PC local area networks. We have found that this greatly improves the 
pace of the course and student understanding of the material. 
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Users should be aware that final answers may differ slightly due to different numerical precision 
and rounding protocols among softwares. 
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The Role of Statistics 


in Engineering 


Statistics is a science that helps us make decisions and draw conclusions in the pres- 
ence of variability. For example, civil engineers working in the transportation field are 
concerned about the capacity of regional highway systems. A typical problem would 
involve data on the number of nonwork, home-based trips, the number of persons per 
household, and the number of vehicles per household, and the objective would be to 
produce a trip-generation model relating trips to the number of persons per household 
and the number of vehicles per household. A statistical technique called regression 
analysis can be used to construct this model. The trip-generation model is an important 
tool for transportation systems planning. Regression methods are among the most 
widely used statistical techniques in engineering. They are presented in Chapters 11 
and 12. 

Hospital emergency departments (EDs) are an important part of the health-care de- 
livery system. The process by which patients arrive at the ED is highly variable and can 
depend on the hour of the day and the day of the week, as well as on longer-term cycli- 
cal variations. The service process is also highly variable, depending on the types of 
services that the patients require, the number of patients in the ED, and how the ED is 
staffed and organized. The capacity of an ED is also limited, so consequently some pa- 
tients experience long waiting times. How long do patients wait, on average? This is an 
important question for health-care providers. If waiting times become excessive, some 
patients will leave without receiving treatment (LWOT). Patients who LWOT are a seri- 
ous problem, as they do not have their medical concerns addressed and are at risk for 
further problems and complications. Therefore, another important question is: What 
proportion of patients LWOT from the ED? These questions can be solved by employ- 
ing probability models to describe the ED, and from these models very precise esti- 
mates of waiting times and the number of patients who LWOT can be obtained. 
Probability models that can be used to solve these types of problems are discussed in 
Chapters 2 through 5. 
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The concepts of probability and statistics are powerful ones and contribute extensively 
to the solutions of many types of engineering problems. You will encounter many exam- 
ples of these applications in this book. 


CHAPTER OUTLINE 
1-1 THE ENGINEERING METHOD AND 1-2.4 Designed Experiments 
STATISTICAL THINKING 


1-2. COLLECTING ENGINEERING DATA 


1-2.5 Observing Processes Over Time 
1-3 MECHANISTIC AND EMPIRICAL 


1-2.1 Basic Principles MODELS 
1-2.2 Retrospective Study 1-4 PROBABILITY AND PROBABILITY 


1-2.3 Observational Study 


MODELS 


LEARNING OBJECTIVES 


After careful study of this chapter you should be able to do the following: 


aA bB WN 


. Identify the role that statistics can play in the engineering problem-solving process 

. Discuss how variability affects the data collected and used for making engineering decisions 
. Explain the difference between enumerative and analytical studies 

. Discuss the different methods that engineers use to collect data 


. Identify the advantages that designed experiments have in comparison to other methods of 


collecting engineering data 


6. Explain the differences between mechanistic models and empirical models 


7. Discuss how probability and probability models are used in engineering and science 


1-1 THE ENGINEERING METHOD AND STATISTICAL THINKING 


An engineer is someone who solves problems of interest to society by the efficient application 
of scientific principles. Engineers accomplish this by either refining an existing product or 
process or by designing a new product or process that meets customers’ needs. The engineering, 
or scientific, method is the approach to formulating and solving these problems. The steps in 
the engineering method are as follows: 


1. 
2. 


Develop a clear and concise description of the problem. 


Identify, at least tentatively, the important factors that affect this problem or that may 
play a role in its solution. 


Propose a model for the problem, using scientific or engineering knowledge of the 
phenomenon being studied. State any limitations or assumptions of the model. 


Conduct appropriate experiments and collect data to test or validate the tentative 
model or conclusions made in steps 2 and 3. 


Refine the model on the basis of the observed data. 
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Figure 1-1 The 
engineering method. 


6. Manipulate the model to assist in developing a solution to the problem. 


7. Conduct an appropriate experiment to confirm that the proposed solution to the 
problem is both effective and efficient. 


8. Draw conclusions or make recommendations based on the problem solution. 


The steps in the engineering method are shown in Fig. 1-1. Many of the engineering sciences 
are employed in the engineering method: the mechanical sciences (statics, dynamics), fluid 
science, thermal science, electrical science, and the science of materials. Notice that the engi- 
neering method features a strong interplay between the problem, the factors that may influence 
its solution, a model of the phenomenon, and experimentation to verify the adequacy of the 
model and the proposed solution to the problem. Steps 2—4 in Fig. 1-1 are enclosed in a box, 
indicating that several cycles or iterations of these steps may be required to obtain the final 
solution. Consequently, engineers must know how to efficiently plan experiments, collect data, 
analyze and interpret the data, and understand how the observed data are related to the model 
they have proposed for the problem under study. 

The field of statistics deals with the collection, presentation, analysis, and use of data to 
make decisions, solve problems, and design products and processes. In simple terms, statistics 
is the science of data. Because many aspects of engineering practice involve working with 
data, obviously knowledge of statistics is just as important to an engineer as the other engineering 
sciences. Specifically, statistical techniques can be a powerful aid in designing new products 
and systems, improving existing designs, and designing, developing, and improving production 
processes. 

Statistical methods are used to help us describe and understand variability. By variability, 
we mean that successive observations of a system or phenomenon do not produce exactly the 
same result. We all encounter variability in our everyday lives, and statistical thinking can 
give us a useful way to incorporate this variability into our decision-making processes. For 
example, consider the gasoline mileage performance of your car. Do you always get exactly the 
same mileage performance on every tank of fuel? Of course not—in fact, sometimes the mileage 
performance varies considerably. This observed variability in gasoline mileage depends on 
many factors, such as the type of driving that has occurred most recently (city versus high- 
way), the changes in condition of the vehicle over time (which could include factors such as 
tire inflation, engine compression, or valve wear), the brand and/or octane number of the 
gasoline used, or possibly even the weather conditions that have been recently experienced. 
These factors represent potential sources of variability in the system. Statistics provides a 
framework for describing this variability and for learning about which potential sources of 
variability are the most important or which have the greatest impact on the gasoline mileage 
performance. 

We also encounter variability in dealing with engineering problems. For example, sup- 
pose that an engineer is designing a nylon connector to be used in an automotive engine 
application. The engineer is considering establishing the design specification on wall thickness 
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at 3/32 inch but is somewhat uncertain about the effect of this decision on the connector pull- 
off force. If the pull-off force is too low, the connector may fail when it is installed in an en- 
gine. Eight prototype units are produced and their pull-off forces measured, resulting in the 
following data (in pounds): 12.6, 12.9, 13.4, 12.3, 13.6, 13.5, 12.6, 13.1. As we anticipated, 
not all of the prototypes have the same pull-off force. We say that there is variability in the 
pull-off force measurements. Because the pull-off force measurements exhibit variability, we 
consider the pull-off force to be a random variable. A convenient way to think of a random 
variable, say X, that represents a measurement is by using the model 


X=pte (1-1) 


where w is a constant and € is a random disturbance. The constant remains the same with every 
measurement, but small changes in the environment, variance in test equipment, differences in 
the individual parts themselves, and so forth change the value of e. If there were no distur- 
bances, € would always equal zero and X would always be equal to the constant jz. However, 
this never happens in the real world, so the actual measurements X exhibit variability. We often 
need to describe, quantify, and ultimately reduce variability. 

Figure 1-2 presents a dot diagram of these data. The dot diagram is a very useful plot for 
displaying a small body of data—say, up to about 20 observations. This plot allows us to see 
easily two features of the data: the location, or the middle, and the scatter or variability. When 
the number of observations is small, it is usually difficult to identify any specific patterns in the 
variability, although the dot diagram is a convenient way to see any unusual data features. 

The need for statistical thinking arises often in the solution of engineering problems. 
Consider the engineer designing the connector. From testing the prototypes, he knows that the 
average pull-off force is 13.0 pounds. However, he thinks that this may be too low for the 
intended application, so he decides to consider an alternative design with a greater wall 
thickness, 1/8 inch. Eight prototypes of this design are built, and the observed pull-off force 
measurements are 12.9, 13.7, 12.8, 13.9, 14.2, 13.2, 13.5, and 13.1. The average is 13.4. 
Results for both samples are plotted as dot diagrams in Fig. 1-3. This display gives the im- 
pression that increasing the wall thickness has led to an increase in pull-off force. However, 
there are some obvious questions to ask. For instance, how do we know that another sample 
of prototypes will not give different results? Is a sample of eight prototypes adequate to give 
reliable results? If we use the test results obtained so far to conclude that increasing the wall 
thickness increases the strength, what risks are associated with this decision? For example, 
is it possible that the apparent increase in pull-off force observed in the thicker prototypes 
is only due to the inherent variability in the system and that increasing the thickness of the 
part (and its cost) really has no effect on the pull-off force? 

Often, physical laws (such as Ohm’s law and the ideal gas law) are applied to help design 
products and processes. We are familiar with this reasoning from general laws to specific 
cases. But it is also important to reason from a specific set of measurements to more general 
cases to answer the previous questions. This reasoning is from a sample (such as the eight 
connectors) to a population (such as the connectors that will be sold to customers). The 
reasoning is referred to as statistical inference. See Fig. 1-4. Historically, measurements were 
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Figure 1-2 Dot diagram of the pull-off force Figure 1-3 Dot diagram of pull-off force for two wall 
data when wall thickness is 3/32 inch. thicknesses. 


Figure 1-4 Statistical 
inference is one type 
of reasoning. 
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obtained from a sample of people and generalized to a population, and the terminology has 
remained. Clearly, reasoning based on measurements from some objects to measurements on 
all objects can result in errors (called sampling errors). However, if the sample is selected 
properly, these risks can be quantified and an appropriate sample size can be determined. 


1-2 COLLECTING ENGINEERING DATA 


1-2.1. Basic Principles 


In the previous section, we illustrated some simple methods for summarizing data. Sometimes 
the data are all of the observations in the populations. This results in a census. However, in the 
engineering environment, the data are almost always a sample that has been selected from the 
population. Three basic methods of collecting data are 


e A retrospective study using historical data 
e An observational study 
e A designed experiment 
An effective data-collection procedure can greatly simplify the analysis and lead to improved 


understanding of the population or process that is being studied. We now consider some 
examples of these data-collection methods. 


1-2.2 Retrospective Study 


Montgomery, Peck, and Vining (2006) describe an acetone-butyl] alcohol distillation column 
for which concentration of acetone in the distillate or output product stream is an important 
variable. Factors that may affect the distillate are the reboil temperature, the condensate tem- 
perature, and the reflux rate. Production personnel obtain and archive the following records: 


e The concentration of acetone in an hourly test sample of output product 

e The reboil temperature log, which is a plot of the reboil temperature over time 

¢ The condenser temperature controller log 

e The nominal reflux rate each hour 
The reflux rate should be held constant for this process. Consequently, production personnel 
change this very infrequently. 


A retrospective study would use either all or a sample of the historical process data 
archived over some period of time. The study objective might be to discover the relationships 
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among the two temperatures and the reflux rate on the acetone concentration in the output 
product stream. However, this type of study presents some problems: 


1. We may not be able to see the relationship between the reflux rate and acetone con- 
centration, because the reflux rate didn’t change much over the historical period. 


2. The archived data on the two temperatures (which are recorded almost continuously) 
do not correspond perfectly to the acetone concentration measurements (which are 
made hourly). It may not be obvious how to construct an approximate correspondence. 


3. Production maintains the two temperatures as closely as possible to desired targets or 
set points. Because the temperatures change so little, it may be difficult to assess their 
real impact on acetone concentration. 


4. In the narrow ranges within which they do vary, the condensate temperature tends to 
increase with the reboil temperature. Consequently, the effects of these two process 
variables on acetone concentration may be difficult to separate. 


As you can see, a retrospective study may involve a lot of data, but those data may contain 
relatively little useful information about the problem. Furthermore, some of the relevant 
data may be missing, there may be transcription or recording errors resulting in outliers 
(or unusual values), or data on other important factors may not have been collected and 
archived. In the distillation column, for example, the specific concentrations of butyl alcohol 
and acetone in the input feed stream are a very important factor, but they are not archived 
because the concentrations are too hard to obtain on a routine basis. As a result of these types 
of issues, statistical analysis of historical data sometimes identifies interesting phenomena, 
but solid and reliable explanations of these phenomena are often difficult to obtain. 


1-2.3 Observational Study 


In an observational study, the engineer observes the process or population, disturbing it as 
little as possible, and records the quantities of interest. Because these studies are usually con- 
ducted for a relatively short time period, sometimes variables that are not routinely measured 
can be included. In the distillation column, the engineer would design a form to record the two 
temperatures and the reflux rate when acetone concentration measurements are made. It may 
even be possible to measure the input feed stream concentrations so that the impact of this 
factor could be studied. Generally, an observational study tends to solve problems | and 2 
above and goes a long way toward obtaining accurate and reliable data. However, observa- 
tional studies may not help resolve problems 3 and 4. 


1-2.4 Designed Experiments 


In a designed experiment the engineer makes deliberate or purposeful changes in the control- 
lable variables of the system or process, observes the resulting system output data, and then 
makes an inference or decision about which variables are responsible for the observed changes 
in output performance. The nylon connector example in Section 1-1 illustrates a designed 
experiment; that is, a deliberate change was made in the wall thickness of the connector with 
the objective of discovering whether or not a greater pull-off force could be obtained. 
Experiments designed with basic principles such as randomization are needed to establish 
cause-and-effect relationships. 

Much of what we know in the engineering and physical-chemical sciences is developed 
through testing or experimentation. Often engineers work in problem areas in which no 
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scientific or engineering theory is directly or completely applicable, so experimentation and 
observation of the resulting data constitute the only way that the problem can be solved. Even 
when there is a good underlying scientific theory that we may rely on to explain the phenom- 
ena of interest, it is almost always necessary to conduct tests or experiments to confirm that the 
theory is indeed operative in the situation or environment in which it is being applied. 
Statistical thinking and statistical methods play an important role in planning, conducting, and 
analyzing the data from engineering experiments. Designed experiments play a very important 
role in engineering design and development and in the improvement of manufacturing processes. 

For example, consider the problem involving the choice of wall thickness for the 
nylon connector. This is a simple illustration of a designed experiment. The engineer chose two 
wall thicknesses for the connector and performed a series of tests to obtain pull-off force 
measurements at each wall thickness. In this simple comparative experiment, the engineer is 
interested in determining if there is any difference between the 3/32- and 1/8-inch designs. An 
approach that could be used in analyzing the data from this experiment is to compare the mean 
pull-off force for the 3/32-inch design to the mean pull-off force for the 1/8-inch design using 
statistical hypothesis testing, which is discussed in detail in Chapters 9 and 10. Generally, a 
hypothesis is a statement about some aspect of the system in which we are interested. For 
example, the engineer might want to know if the mean pull-off force of a 3/32-inch design 
exceeds the typical maximum load expected to be encountered in this application, say, 12.75 
pounds. Thus, we would be interested in testing the hypothesis that the mean strength exceeds 
12.75 pounds. This is called a single-sample hypothesis-testing problem. Chapter 9 presents 
techniques for this type of problem. Alternatively, the engineer might be interested in testing 
the hypothesis that increasing the wall thickness from 3/32 to 1/8 inch results in an increase 
in mean pull-off force. It is an example of a two-sample hypothesis-testing problem. Two- 
sample hypothesis-testing problems are discussed in Chapter 10. 

Designed experiments are a very powerful approach to studying complex systems, such 
as the distillation column. This process has three factors—the two temperatures and the reflux 
rate—and we want to investigate the effect of these three factors on output acetone concentra- 
tion. A good experimental design for this problem must ensure that we can separate the effects 
of all three factors on the acetone concentration. The specified values of the three factors used 
in the experiment are called factor levels. Typically, we use a small number of levels for each 
factor, such as two or three. For the distillation column problem, suppose we use two levels, 
“high” and “low” (denoted +1 and —1, respectively), for each of the three factors. A very 
reasonable experiment design strategy uses every possible combination of the factor levels to 
form a basic experiment with eight different settings for the process. This type of experiment 
is called a factorial experiment. Table 1-1 presents this experimental design. 


Table 1-1 The Designed Experiment (Factorial Design) for the 
Distillation Column 


Reboil Temp. Condensate Temp. Reflux Rate 
=Il = =I 
+1 —1 —1 
=I +1 = 
+1 +1 —1 
=Il = +1 
+1 —1 +1 
=I +1 +1 
+1 +1 +1 
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Figure 1-5 The 
factorial design for 
the distillation column. 


Figure 1-6 A four- 
factorial experiment 
for the distillation 
column. 
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Figure 1-5 illustrates that this design forms a cube in terms of these high and low levels. 
With each setting of the process conditions, we allow the column to reach equilibrium, take 
a sample of the product stream, and determine the acetone concentration. We then can draw 
specific inferences about the effect of these factors. Such an approach allows us to proactively 
study a population or process. 

An important advantage of factorial experiments is that they allow one to detect an 
interaction between factors. Consider only the two temperature factors in the distillation 
experiment. Suppose that the response concentration is poor when the reboil temperature is 
low, regardless of the condensate temperature. That is, the condensate temperature has no 
effect when the reboil temperature is Jow. However, when the reboil temperature is high, a 
high condensate temperature generates a good response, while a Jow condensate tempera- 
ture generates a poor response. That is, the condensate temperature changes the response 
when the reboil temperature is high. The effect of condensate temperature depends on the 
setting of the reboil temperature, and these two factors would interact in this case. If the four 
combinations of high and low reboil and condensate temperatures were not tested, such an 
interaction would not be detected. 

We can easily extend the factorial strategy to more factors. Suppose that the engineer wants 
to consider a fourth factor, type of distillation column. There are two types: the standard one 
and a newer design. Figure 1-6 illustrates how all four factors—reboil temperature, condensate 
temperature, reflux rate, and column design—could be investigated in a factorial design. Since 
all four factors are still at two levels, the experimental design can still be represented geometri- 
cally as a cube (actually, it’s a hypercube). Notice that as in any factorial design, all possible 
combinations of the four factors are tested. The experiment requires 16 trials. 

Generally, if there are k factors and they each have two levels, a factorial experimental 
design will require 2" runs. For example, with k = 4, the 2* design in Fig. 1-6 requires 16 tests. 
Clearly, as the number of factors increases, the number of trials required in a factorial experi- 
ment increases rapidly; for instance, eight factors each at two levels would require 256 trials. 
This quickly becomes unfeasible from the viewpoint of time and other resources. Fortunately, 
when there are four to five or more factors, it is usually unnecessary to test all possible 
combinations of factor levels. A fractional factorial experiment is a variation of the basic 
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Figure 1-7 A frac- 
tional factorial experi- 
ment for the connector 
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factorial arrangement in which only a subset of the factor combinations are actually tested. 
Figure 1-7 shows a fractional factorial experimental design for the four-factor version of the 
distillation experiment. The circled test combinations in this figure are the only test combina- 
tions that need to be run. This experimental design requires only 8 runs instead of the original 
16; consequently it would be called a one-half fraction. This is an excellent experimental 
design in which to study all four factors. It will provide good information about the individual 
effects of the four factors and some information about how these factors interact. 

Factorial and fractional factorial experiments are used extensively by engineers and scien- 
tists in industrial research and development, where new technology, products, and processes are 
designed and developed and where existing products and processes are improved. Since so 
much engineering work involves testing and experimentation, it is essential that all engineers 
understand the basic principles of planning efficient and effective experiments. We discuss 
these principles in Chapter 13. Chapter 14 concentrates on the factorial and fractional factorials 
that we have introduced here. 


1-2.5 Observing Processes Over Time 


Figure 1-8 The dot 
diagram illustrates 
variation but does not 
identify the problem. 


Often data are collected over time. In this case, it is usually very helpful to plot the data versus 
time in a time series plot. Phenomena that might affect the system or process often become 
more visible in a time-oriented plot and the concept of stability can be better judged. 

Figure 1-8 is a dot diagram of acetone concentration readings taken hourly from the 
distillation column described in Section 1-2.2. The large variation displayed on the dot 
diagram indicates a lot of variability in the concentration, but the chart does not help explain 
the reason for the variation. The time series plot is shown in Figure 1-9. A shift in the process 
mean level is visible in the plot and an estimate of the time of the shift can be obtained. 

W. Edwards Deming, a very influential industrial statistician, stressed that it is important 
to understand the nature of variability in processes and systems over time. He conducted an 
experiment in which he attempted to drop marbles as close as possible to a target on a table. 
He used a funnel mounted on a ring stand and the marbles were dropped into the funnel. See 
Fig. 1-10. The funnel was aligned as closely as possible with the center of the target. He then 
used two different strategies to operate the process. (1) He never moved the funnel. He just 
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Figure 1-9 A time series plot of concentration provides Figure 1-10 Deming’s funnel experiment. 


more information than the dot diagram. 


Figure 1-11 Adjust- 
ments applied to 
random disturbances 
overcontrol the process 
and increase the devia- 
tions from the target. 


dropped one marble after another and recorded the distance from the target. (2) He dropped 
the first marble and recorded its location relative to the target. He then moved the funnel an 
equal and opposite distance in an attempt to compensate for the error. He continued to make 
this type of adjustment after each marble was dropped. 

After both strategies were completed, he noticed that the variability of the distance 
from the target for strategy 2 was approximately 2 times larger than for strategy 1. The adjust- 
ments to the funnel increased the deviations from the target. The explanation is that the error 
(the deviation of the marble’s position from the target) for one marble provides no information 
about the error that will occur for the next marble. Consequently, adjustments to the funnel do 
not decrease future errors. Instead, they tend to move the funnel farther from the target. 

This interesting experiment points out that adjustments to a process based on random dis- 
turbances can actually increase the variation of the process. This is referred to as overcontrol 
or tampering. Adjustments should be applied only to compensate for a nonrandom shift in the 
process—then they can help. A computer simulation can be used to demonstrate the lessons of 
the funnel experiment. Figure 1-11 displays a time plot of 100 measurements (denoted as y) 
from a process in which only random disturbances are present. The target value for the process 


4 
—e— Without adjustment 
2 ° With adjustment 
0) 
1 11 21 31 4l 51 61 71 81 91 


Observation number 


Figure 1-12 Process 
mean shift is detected at 
observation number 57, 
and one adjustment 

(a decrease of two units) 
reduces the deviations 
from target. 


Figure 1-13. Acon- 
trol chart for the 
chemical process 
concentration data. 
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is 10 units. The figure displays the data with and without adjustments that are applied to the 
process mean in an attempt to produce data closer to target. Each adjustment is equal and 
opposite to the deviation of the previous measurement from target. For example, when the 
measurement is 11 (one unit above target), the mean is reduced by one unit before the next 
measurement is generated. The overcontrol has increased the deviations from the target. 

Figure 1-12 displays the data without adjustment from Fig. 1-11, except that the measure- 
ments after observation number 50 are increased by two units to simulate the effect of a shift 
in the mean of the process. When there is a true shift in the mean of a process, an adjustment 
can be useful. Figure 1-12 also displays the data obtained when one adjustment (a decrease of 
two units) is applied to the mean after the shift is detected (at observation number 57). Note 
that this adjustment decreases the deviations from target. 

The question of when to apply adjustments (and by what amounts) begins with an under- 
standing of the types of variation that affect a process. A control chart is an invaluable way 
to examine the variability in time-oriented data. Figure 1-13 presents a control chart for 
the concentration data from Fig. 1-9. The center line on the control chart is just the average of 
the concentration measurements for the first 20 samples (x = 91.5 g/l) when the process is 
stable. The upper control limit and the lower control limit are a pair of statistically derived 
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limits that reflect the inherent or natural variability in the process. These limits are located 
three standard deviations of the concentration values above and below the center line. If the 
process is operating as it should, without any external sources of variability present in the 
system, the concentration measurements should fluctuate randomly around the center line, and 
almost all of them should fall between the control limits. 

In the control chart of Fig. 1-13, the visual frame of reference provided by the center line 
and the control limits indicates that some upset or disturbance has affected the process around 
sample 20 because all of the following observations are below the center line, and two of them 
actually fall below the lower control limit. This is a very strong signal that corrective action is 
required in this process. If we can find and eliminate the underlying cause of this upset, we can 
improve process performance considerably. 

Furthermore, Deming pointed out that data from a process are used for different types of 
conclusions. Sometimes we collect data from a process to evaluate current production. For 
example, we might sample and measure resistivity on three semiconductor wafers selected 
from a lot and use this information to evaluate the lot. This is called an enumerative study. 
However, in many cases we use data from current production to evaluate future production. We 
apply conclusions to a conceptual, future population. Deming called this an analytic study. 
Clearly this requires an assumption of a stable process, and Deming emphasized that control 
charts were needed to justify this assumption. See Fig. 1-14 as an illustration. 

Control charts are a very important application of statistics for monitoring, controlling, 
and improving a process. The branch of statistics that makes use of control charts is called 
statistical process control, or SPC. We will discuss SPC and control charts in Chapter 15. 


1-3 MECHANISTIC AND EMPIRICAL MODELS 


Models play an important role in the analysis of nearly all engineering problems. Much of the 
formal education of engineers involves learning about the models relevant to specific fields 
and the techniques for applying these models in problem formulation and solution. As a sim- 
ple example, suppose we are measuring the flow of current in a thin copper wire. Our model 
for this phenomenon might be Ohm’s law: 


Current = voltage/resistance 
or 


I= E/R (1-2) 
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We call this type of model a mechanistic model because it is built from our underlying 
knowledge of the basic physical mechanism that relates these variables. However, if we 
performed this measurement process more than once, perhaps at different times, or even on 
different days, the observed current could differ slightly because of small changes or varia- 
tions in factors that are not completely controlled, such as changes in ambient temperature, 
fluctuations in performance of the gauge, small impurities present at different locations in the 
wire, and drifts in the voltage source. Consequently, a more realistic model of the observed 
current might be 


I=E/R+e (1-3) 


where € is a term added to the model to account for the fact that the observed values of 
current flow do not perfectly conform to the mechanistic model. We can think of € as a 
term that includes the effects of all of the unmodeled sources of variability that affect this 
system. 

Sometimes engineers work with problems for which there is no simple or well- 
understood mechanistic model that explains the phenomenon. For instance, suppose we are 
interested in the number average molecular weight (M,,) of a polymer. Now we know that M,, 
is related to the viscosity of the material (V), and it also depends on the amount of catalyst (C) 
and the temperature (7) in the polymerization reactor when the material is manufactured. 
The relationship between M,, and these variables is 


M,, = f(V, C, T) (1-4) 


say, where the form of the function f is unknown. Perhaps a working model could be de- 
veloped from a first-order Taylor series expansion, which would produce a model of the 
form 


M, = Bo + BiV + BoC + B3T (1-5) 


where the B’s are unknown parameters. Now just as in Ohm’s law, this model will not exactly 
describe the phenomenon, so we should account for the other sources of variability that may 
affect the molecular weight by adding another term to the model; therefore, 


M, = Bo + BiV + BoC + BaT + € (1-6) 


is the model that we will use to relate molecular weight to the other three variables. This type 
of model is called an empirical model; that is, it uses our engineering and scientific knowl- 
edge of the phenomenon, but it is not directly developed from our theoretical or first-principles 
understanding of the underlying mechanism. 

To illustrate these ideas with a specific example, consider the data in Table 1-2. This table 
contains data on three variables that were collected in an observational study in a semicon- 
ductor manufacturing plant. In this plant, the finished semiconductor is wire-bonded to a 
frame. The variables reported are pull strength (a measure of the amount of force required to 
break the bond), the wire length, and the height of the die. We would like to find a model 
relating pull strength to wire length and die height. Unfortunately, there is no physical mech- 
anism that we can easily apply here, so it doesn’t seem likely that a mechanistic modeling 
approach will be successful. 

Figure 1-15 presents a three-dimensional plot of all 25 observations on pull strength, wire 
length, and die height. From examination of this plot, we see that pull strength increases as 
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Figure 1-15 Three- 
dimensional plot of 
the wire bond pull 
strength data. 


Table 1-2 Wire Bond Pull Strength Data 


Observation Pull Strength Wire Length Die Height 


Number y xy Xp 
1 9.95 D 50 
2 24.45 8 110 
3 31.75 11 120 
4 35.00 10 550 
5 25.02 8 295 
6 16.86 4 200 
7 14.38 D 375 
8 9.60 2 52 
9 24.35 9 100 

10 27.50 8 300 
11 17.08 4 412 
12 37.00 11 400 
13 41.95 12 500 
14 11.66 2 360 
15 21.65 4 205 
16 17.89 4 400 
7 69.00 20 600 
18 10.30 1 585 
19 34.93 10 540 
20 46.59 15 250 
21 44.88 15 290 
22 54.12 16 510 
23 56.63 17 590 
24 22.13 6 100 
25 21.15 5 400 


both wire length and die height increase. Furthermore, it seems reasonable to think that a 
model such as 


Pull strength = By + B,(wire length) + B,(die height) + e 


would be appropriate as an empirical model for this relationship. In general, this type of 
empirical model is called a regression model. In Chapters 11 and 12 we show how to build 
these models and test their adequacy as approximating functions. We will use a method for 
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Figure 1-16 Plot of 
predicted values of 
pull strength from the 
empirical model. 
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Pull strength 


Wire length 


estimating the parameters in regression models, called the method of least squares, that 
traces its origins to work by Karl Gauss. Essentially, this method chooses the parameters in 
the empirical model (the B’s) to minimize the sum of the squared distances between each 
data point and the plane represented by the model equation. Applying this technique to the 
data in Table 1-2 results in 


a 
Pull strength = 2.26 + 2.74(wire length) + 0.0125(die height) (1-7) 


where the “hat,” or circumflex, over pull strength indicates that this is an estimated or pre- 
dicted quantity. 

Figure 1-16 is a plot of the predicted values of pull strength versus wire length and die 
height obtained from Equation 1-7. Notice that the predicted values lie on a plane above the 
wire length—die height space. From the plot of the data in Fig. 1-15, this model does not ap- 
pear unreasonable. The empirical model in Equation 1-7 could be used to predict values of 
pull strength for various combinations of wire length and die height that are of interest. 
Essentially, the empirical model could be used by an engineer in exactly the same way that 
a mechanistic model can be used. 


1-4 PROBABILITY AND PROBABILITY MODELS 


In Section 1-1, it was mentioned that decisions often need to be based on measurements from 
only a subset of objects selected in a sample. This process of reasoning from a sample of 
objects to conclusions for a population of objects was referred to as statistical inference. A 
sample of three wafers selected from a larger production lot of wafers in semiconductor man- 
ufacturing was an example mentioned. To make good decisions, an analysis of how well a 
sample represents a population is clearly necessary. If the lot contains defective wafers, how 
well will the sample detect this? How can we quantify the criterion to “detect well’? Basically, 
how can we quantify the risks of decisions based on samples? Furthermore, how should 
samples be selected to provide good decisions—ones with acceptable risks? Probability 
models help quantify the risks involved in statistical inference, that is, the risks involved in 
decisions made every day. 

More details are useful to describe the role of probability models. Suppose a production 
lot contains 25 wafers. If all the wafers are defective or all are good, clearly any sample will 
generate all defective or all good wafers, respectively. However, suppose only one wafer in 
the lot is defective. Then a sample might or might not detect (include) the wafer. A probabil- 
ity model, along with a method to select the sample, can be used to quantify the risks that the 
defective wafer is or is not detected. Based on this analysis, the size of the sample might be 
increased (or decreased). The risk here can be interpreted as follows. Suppose a series of lots, 
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each with exactly one defective wafer, are sampled. The details of the method used to select 
the sample are postponed until randomness is discussed in the next chapter. Nevertheless, 
assume that the same size sample (such as three wafers) is selected in the same manner from 
each lot. The proportion of the lots in which the defective wafer is included in the sample or, 
more specifically, the limit of this proportion as the number of lots in the series tends to infin- 
ity, is interpreted as the probability that the defective wafer is detected. 

A probability model is used to calculate this proportion under reasonable assumptions for 
the manner in which the sample is selected. This is fortunate because we do not want to at- 
tempt to sample from an infinite series of lots. Problems of this type are worked in Chapters 2 
and 3. More importantly, this probability provides valuable, quantitative information regard- 
ing any decision about lot quality based on the sample. 

Recall from Section 1-1 that a population might be conceptual, as in an analytic study that 
applies statistical inference to future production based on the data from current production. 
When populations are extended in this manner, the role of statistical inference and the associ- 
ated probability models becomes even more important. 

In the previous example, each wafer in the sample was only classified as defective or not. 
Instead, a continuous measurement might be obtained from each wafer. In Section 1-2.5, con- 
centration measurements were taken at periodic intervals from a production process. Figure 1-8 
shows that variability is present in the measurements, and there might be concern that the 
process has moved from the target setting for concentration. Similar to the defective wafer, 
one might want to quantify our ability to detect a process change based on the sample data. 
Control limits were mentioned in Section 1-2.5 as decision rules for whether or not to adjust 
a process. The probability that a particular process change is detected can be calculated with 
a probability model for concentration measurements. Models for continuous measurements 
are developed based on plausible assumptions for the data and a result known as the central 
limit theorem, and the associated normal distribution is a particularly valuable probability 
model for statistical inference. Of course, a check of assumptions is important. These types of 
probability models are discussed in Chapter 4. The objective is still to quantify the risks in- 
herent in the inference made from the sample data. 

Throughout Chapters 6 through 15, decisions are based on statistical inference from sam- 
ple data. Continuous probability models, specifically the normal distribution, are used exten- 
sively to quantify the risks in these decisions and to evaluate ways to collect the data and how 
large a sample should be selected. 


IMPORTANT TERMS AND CONCEPTS 
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Probability 


An athletic woman in her twenties arrives at the emergency department complaining of 
dizziness after running in hot weather. An electrocardiogram is used to check for a heart at- 
tack and the patient generates an abnormal result. The test has a false positive rate 0.1 (the 
probability of an abnormal result when the patient is normal) and a false negative rate of 0.1 
(the probability of a normal result when the patient is abnormal). Furthermore, it might be 
assumed that the prior probability of a heart attack for this patient is 0.001. Although the ab- 
normal test is a concern, you might be surprised to learn that the probability of a heart at- 
tack given the electrocardiogram result is still less than 0.01. See “Why Clinicians are 
Natural Bayesians” (2005, bmj.com) for details of this example and others. 

The key is to properly combine the given probabilities. Furthermore, the exact same 
analysis used for this medical example can be applied to tests of engineered products. 
Consequently knowledge of how to manipulate probabilities in order to assess risks and 
make better decisions is important throughout scientific and engineering disciplines. In this 
chapter the laws of probability are presented and used to assess risks in cases such as this 
one and numerous others. 
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After careful study of this chapter you should be able to do the following: 
1. Understand and describe sample spaces and events for random experiments with graphs, tables, 
lists, or tree diagrams 
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2. Interpret probabilities and use probabilities of outcomes to calculate probabilities of events in 
discrete sample spaces 


3. Use permutation and combinations to count the number of outcomes in both an event and the 
sample space 


4. Calculate the probabilities of joint events such as unions and intersections from the probabilities 
of individual events 


. Interpret and calculate conditional probabilities of events 


5 
6. Determine the independence of events and use independence to calculate probabilities 
7. Use Bayes’ theorem to calculate conditional probabilities 
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. Understand random variables 


2-1 SAMPLE SPACES AND EVENTS 
2-1.1 Random Experiments 


If we measure the current in a thin copper wire, we are conducting an experiment. However, day-to- 
day repetitions of the measurement can differ slightly because of small variations in variables that 
are not controlled in our experiment, including changes in ambient temperatures, slight variations in 
gauge and small impurities in the chemical composition of the wire (if different locations are se- 
lected), and current source drifts. Consequently, this experiment (as well as many we conduct) is said 
to have a random component. In some cases, the random variations are small enough, relative to our 
experimental goals, that they can be ignored. However, no matter how carefully our experiment is 
designed and conducted, the variation is almost always present, and its magnitude can be large 
enough that the important conclusions from our experiment are not obvious. In these cases, the 
methods presented in this book for modeling and analyzing experimental results are quite valuable. 

Our goal is to understand, quantify, and model the type of variations that we often 
encounter. When we incorporate the variation into our thinking and analyses, we can make 
informed judgments from our results that are not invalidated by the variation. 

Models and analyses that include variation are not different from models used in other areas 
of engineering and science. Figure 2-1 displays the important components. A mathematical 
model (or abstraction) of the physical system is developed. It need not be a perfect abstraction. 
For example, Newton’s laws are not perfect descriptions of our physical universe. Still, they are 
useful models that can be studied and analyzed to approximately quantify the performance of a 
wide range of engineered products. Given a mathematical abstraction that is validated with 
measurements from our system, we can use the model to understand, describe, and quantify 
important aspects of the physical system and predict the response of the system to inputs. 

Throughout this text, we discuss models that allow for variations in the outputs of a system, 
even though the variables that we control are not purposely changed during our study. 
Figure 2-2 graphically displays a model that incorporates uncontrollable inputs (noise) that 
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identifies deviations from the model. 


Random 
Experiment 


combine with the controllable inputs to produce the output of our system. Because of the 
uncontrollable inputs, the same settings for the controllable inputs do not result in identical 
outputs every time the system is measured. 


An experiment that can result in different outcomes, even though it is repeated in the 
same manner every time, is called a random experiment. 


For the example of measuring current in a copper wire, our model for the system might 
simply be Ohm’s law. Because of uncontrollable inputs, variations in measurements of current 
are expected. Ohm’s law might be a suitable approximation. However, if the variations are 
large relative to the intended use of the device under study, we might need to extend our model 
to include the variation. See Fig. 2-3. 

As another example, in the design of a communication system, such as a computer or voice 
communication network, the information capacity available to serve individuals using the net- 
work is an important design consideration. For voice communication, sufficient external lines 
need to be available to meet the requirements of a business. Assuming each line can carry only 
a single conversation, how many lines should be purchased? If too few lines are purchased, calls 
can be delayed or lost. The purchase of too many lines increases costs. Increasingly, design and 
product development is required to meet customer requirements at a competitive cost. 

In the design of the voice communication system, a model is needed for the number of calls 
and the duration of calls. Even knowing that, on average, calls occur every five minutes and that 
they last five minutes is not sufficient. If calls arrived precisely at five-minute intervals and lasted 
for precisely five minutes, one phone line would be sufficient. However, the slightest variation in 
call number or duration would result in some calls being blocked by others. See Fig. 2-4. A 
system designed without considering variation will be woefully inadequate for practical use. Our 
model for the number and duration of calls needs to include variation as an integral component. 


2-1.2 Sample Spaces 


Sample Space 


To model and analyze a random experiment, we must understand the set of possible outcomes 
from the experiment. In this introduction to probability, we make use of the basic concepts of 
sets and operations on sets. It is assumed that the reader is familiar with these topics. 


The set of all possible outcomes of a random experiment is called the sample space 
of the experiment. The sample space is denoted as S. 
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A sample space is often defined based on the objectives of the analysis. The following exam- 


ple illustrates several alternatives. 


EXAMPLE 2-1 Molded Plastic Part 

Consider an experiment in which you select a molded plastic 
part, such as a connector, and measure its thickness. The 
possible values for thickness depend on the resolution of 
the measuring instrument, and they also depend on upper and 
lower bounds for thickness. However, it might be convenient 
to define the sample space as simply the positive real line 


S=R* ={x|x > 0} 
because a negative value for thickness cannot occur. 


If it is known that all connectors will be between 10 and 
11 millimeters thick, the sample space could be 


S = {x|10<x< 11} 


If the objective of the analysis is to consider only whether 
a particular part is low, medium, or high for thickness, the 
sample space might be taken to be the set of three outcomes: 


S = {low, medium, high} 
If the objective of the analysis is to consider only whether 
or not a particular part conforms to the manufacturing specifi- 


cations, the sample space might be simplified to the set of two 
outcomes 


S = {yes, no} 


that indicate whether or not the part conforms. 


It is useful to distinguish between two types of sample spaces. 


Discrete and 
Continuous 
Sample Spaces 


real numbers. 


A sample space is discrete if it consists of a finite or countable infinite set of outcomes. 
A sample space is continuous if it contains an interval (either finite or infinite) of 


In Example 2-1, the choice S = R* is an example of a continuous sample space, whereas 
S ={yes, no} is a discrete sample space. As mentioned, the best choice of a sample space 
depends on the objectives of the study. As specific questions occur later in the book, appro- 


priate sample spaces are discussed. 


EXAMPLE 2-2 Manufacturing Specifications 

If two connectors are selected and measured, the extension of 
the positive real line R is to take the sample space to be the 
positive quadrant of the plane: 


S = R* X RT 


If the objective of the analysis is to consider only whether 
or not the parts conform to the manufacturing specifications, 
either part may or may not conform. We abbreviate yes and no 
as y and n. If the ordered pair yn indicates that the first con- 
nector conforms and the second does not, the sample space 
can be represented by the four outcomes: 


S = {yy, yn, ny, nn} 


If we are only interested in the number of conforming 
parts in the sample, we might summarize the sample space as 


S = {0, 1,2} 
As another example, consider an experiment in which 


the thickness is measured until a connector fails to meet the 
specifications. The sample space can be represented as 


S = {n, yn, yyn, yyyn, yyyyn, and so forth} 


and this is an example of a discrete sample space that is count- 
ably infinite. 
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Sample spaces can also be described graphically with tree diagrams. When a sample 
space can be constructed in several steps or stages, we can represent each of the n, ways of 
completing the first step as a branch of a tree. Each of the ways of completing the second step 
can be represented as , branches starting from the ends of the original branches, and so forth. 


EXAMPLE 2-3 

Each message in a digital communication system is classi- 
fied as to whether it is received within the time specified by 
the system design. If three messages are classified, use a 
tree diagram to represent the sample space of possible out- 
comes. 


EXAMPLE 2-4 
An automobile manufacturer provides vehicles equipped with 
selected options. Each vehicle is ordered 

With or without an automatic transmission 

With or without air conditioning 

With one of three choices of a stereo system 


With one of four exterior colors 


Transmission 


Each message can be received either on time or late. The 
possible results for three messages can be displayed by eight 
branches in the tree diagram shown in Fig. 2-5. 

Practical Interpretation: A tree diagram can affectively 
represent a sample space. Even if a tree becomes too large to 
construct it can still conceptually clarify the sample space. 


If the sample space consists of the set of all possible 
vehicle types, what is the number of outcomes in the sam- 
ple space? The sample space contains 48 outcomes. The 
tree diagram for the different types of vehicles is displayed 
in Fig. 2-6. 
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Figure 2-6 Tree diagram for different types of vehicles with 48 outcomes in the sample space. 


22 CHAPTER 2 PROBABILITY 


Exterior color Red White Blue Brown 
j Interi | 
Figure 2-7 Tree nterior color Black Red A\ /\ 
diagram for different 12x2=24 12x4=48 12x3=36 12x1l=12 


types of vehicles with 
interior colors. 


EXAMPLE 2-5 Automobile Configurations 
Consider an extension of the automobile manufacturer ill- 
ustration in the previous example in which another vehicle 
option is the interior color. There are four choices of interior 
color: red, black, blue, or brown. However, 


With a red exterior, only a black or red interior can be 
chosen. 


With a white exterior, any interior color can be chosen. 


2-1.3 Events 


24 + 48 + 36 + 12 = 120 vehicle types 


With a blue exterior, only a black, red, or blue interior can 
be chosen. 


With a brown exterior, only a brown interior can be chosen. 


In Fig. 2-6, there are 12 vehicle types with each exterior 
color, but the number of interior color choices depends on the 
exterior color. As shown in Fig. 2-7, the tree diagram can be 
extended to show that there are 120 different vehicle types 
in the sample space. 


Often we are interested in a collection of related outcomes from a random experiment. 
Related outcomes can be described by subsets of the sample space and set operations can also 


be applied. 


Event 


An event is a subset of the sample space of a random experiment. 


We can also be interested in describing new events from combinations of existing events. 
Because events are subsets, we can use basic set operations such as unions, intersections, and 
complements to form other events of interest. Some of the basic set operations are summa- 


rized below in terms of events: 


e The union of two events is the event that consists of all outcomes that are contained 
in either of the two events. We denote the union as E, U EF). 


e The intersection of two events is the event that consists of all outcomes that are 
contained in both of the two events. We denote the intersection as E, M E>. 


e The complement of an event in a sample space is the set of outcomes in the sample 
space that are not in the event. We denote the complement of the event E as E’. The 
notation E° is also used in other literature to denote the complement. 


EXAMPLE 2-6 

Consider the sample space S = { yy, yn, ny, nn} in Example 2-2. 
Suppose that the subset of outcomes for which at least one part 
conforms is denoted as E;. Then, 


E, = {yy, yn, ny} 


The event in which both parts do not conform, denoted as E,, 
contains only the single outcome, E, = {nn}. Other examples 


of events are FE; = ©, the null set, and E, = S, the sample 
space. If E; = { yn, ny, nn}, 

E, VE; =S E, N Es = {yn, ny} Ej = {nn} 
Practical Interpretation: Events are used to define outcomes of 
interest from a random experiment. One is often interested in 
the probabilities of specified events. 


EXAMPLE 2-7 

Measurements of the thickness of a plastic connector might be 
modeled with the sample space S = R*, the set of positive real 
numbers. Let 


E, ={x|10<Sx<12} and E, = {x|11<x< 15} 


Then, 


E, UE, = {x| 10 =x < 15} 


EXAMPLE 2-8 Hospital Emergency Visits 

The following table summarizes visits to emergency departments 
at four hospitals in Arizona. People may leave without being seen 
by a physician, and those visits are denoted as LWBS. The re- 
maining visits are serviced at the emergency department, and the 
visitor may or may not be admitted for a stay in the hospital. 


Hospital 
1 2 3 4 Total 
Total 5292 6991 5640 4329 22,252 
LWBS 195 270 246 242 953 
Admitted 1277 1558 666 984 4485 
Not admitted 3820 5163 4728 3103 16,814 


Let A denote the event that a visit is to Hospital 1 and let B 
denote the event that the result of the visit is LWBS. Calculate 
the number of outcomes in A | B, A’, and A U B. 
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and 

E, NE, = {x| 11 <x < 12} 
Also, 

Ei = {x|x<10o0r12 = x} 
and 


Ei NE) = {x|12 sx < 15} 


The event A ™ B consists of the 195 visits to Hospital 1 
that result in LWBS. The event A’ consists of the visits to 
Hospitals 2, 3, and 4 and contains 6991 + 5640 + 4329 = 
16,690 visits. The event A U B consists of the visits to Hospi- 
tal 1 or the visits that result in LWBS, or both, and contains 
5292 + 270 + 246 + 242 = 6050 visits. Notice that the last 
result can also be calculated as the number of visits in A plus 
the number of visits in B minus the number of visits AM B 
(that would otherwise be counted twice) = 5292 + 953 — 
195 = 6050. 

Practical Interpretation: Hospitals track visits that result in 
LWBS to understand resource needs and to improve patient 
services. 


Diagrams are often used to portray relationships between sets, and these diagrams are also 
used to describe relationships between events. We can use Venn diagrams to represent a 
sample space and events in a sample space. For example, in Fig. 2-8(a) the sample space of 
the random experiment is represented as the points in the rectangle S. The events A and B are 
the subsets of points in the indicated regions. Figs. 2-8(b) to 2-8(d) illustrate additional joint 
events. Figure 2-9 illustrates two events with no common outcomes. 


Mutually 
Exclusive 
Events 


Two events, denoted as E, and E,, such that 


are said to be mutually exclusive. 


E, NE, = 


Additional results involving events are summarized below. The definition of the comple- 


ment of an event implies that 


(E) =E 
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Figure 2-8 Venn diagrams. Figure 2-9 Mutually exclusive events. 


The distributive law for set operations implies that 
(AUB)NC=(ANC)U(BNC) and (AN B)UC=(AUC)N(BUC) 
DeMorgan’s laws imply that 
(AU B)' =A'MB' and (AM B)' =A'UB' 
Also, remember that 


AMB=BNfA and AUB=BUA 


2-1.4 Counting Techniques 


In many of the examples in Chapter 2, it is easy to determine the number of outcomes in 
each event. In more complicated examples, determining the outcomes that comprise the 
sample space (or an event) becomes more difficult. Instead, counts of the numbers of 
outcomes in the sample space and various events are used to analyze the random experi- 
ments. These methods are referred to as counting techniques. Some simple rules can be 
used to simplify the calculations. 

In Example 2-4, an automobile manufacturer provides vehicles equipped with selected 
options. Each vehicle is ordered 


With or without an automatic transmission 
With or without air conditioning 
With one of three choices of a stereo system 


With one of four exterior colors 


Multiplication 
Rule (for 
counting 

techniques) 


EXAMPLE 2-9 
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The tree diagram in Fig. 2-6 describes the sample space of all possible vehicle types. The size 
of the sample space equals the number of branches in the last level of the tree, and this quantity 
equals 2 X 2 X 3 X 4= 48. This leads to the following useful result. 


Assume an operation can be described as a sequence of k steps, and 


the number of ways of completing step 1 is 7,, and 


the number of ways of completing step 2 is n, for each way of completing 
step 1, and 


the number of ways of completing step 3 is n; for each way of completing 
step 2, and 


so forth. 


The total number of ways of completing the operation is 


ny XN X+* X Ny 


In the design of a casing for a gear housing, we can use four Practical Interpretation: The multipication rule and other 

different types of fasteners, three different bolt lengths, and counting techniques enables one to easily determine the num- 

three different bolt locations. From the multiplication rule, ber of outcomes in a sample space or events and this, in turn, 

4 xX 3 X 3 = 36 different designs are possible. allows probabilities of events to be determined. 
Permutations 


Another useful calculation is the number of ordered sequences of the elements of a set. 
Consider a set of elements, such as S = {a, b, c}. A permutation of the elements is an ordered 
sequence of the elements. For example, abc, acb, bac, bca, cab, and cba are all of the permu- 
tations of the elements of S. 


The number of permutations of n different elements is n! where 


n!=nX(n—1)X (n—-2)X-+X2X1 (2-1) 


This result follows from the multiplication rule. A permutation can be constructed by select- 
ing the element to be placed in the first position of the sequence from the n elements, then 
selecting the element for the second position from the n — 1 remaining elements, then select- 
ing the element for the third position from the remaining n — 2 elements, and so forth. 
Permutations such as these are sometimes referred to as linear permutations. 

In some situations, we are interested in the number of arrangements of only some of the 
elements of a set. The following result also follows from the multiplication rule. 


26 CHAPTER 2 PROBABILITY 


Permutations 
of Subsets 


elements is 


The number of permutations of subsets of r elements selected from a set of n different 


Ph=nx(@—1)x@—2) x xX rs I= 


(n — r)! ee 


EXAMPLE 2-10 Printed Circuit Board 

A printed circuit board has eight different locations in which a 

component can be placed. If four different components are to be 

placed on the board, how many different designs are possible? 
Each design consists of selecting a location from the 

eight locations for the first component, a location from the re- 

maining seven for the second component, a location from the 


remaining six for the third component, and a location from the 
remaining five for the fourth component. Therefore, 


! 


8 
P 8X7X6X5=7, 


Il 


1680 different designs are possible. 


Sometimes we are interested in counting the number of ordered sequences for objects that 
are not all different. The following result is a useful, general calculation. 


Permutations 
of Similar The number of permutations of n = n, + n) + ++: + n, objects of which n, are of 
Objects one type, 1 are of a second type, ... , and n, are of an rth type is 
n! 
TE eee (2-3) 
ny! no! n3!...n,! 


EXAMPLE 2-11 Machine Shop Schedule 

Consider a machining operation in which a piece of sheet 
metal needs two identical-diameter holes drilled and two 
identical-size notches cut. We denote a drilling operation as d 
and a notching operation as n. In determining a schedule for a 
machine shop, we might be interested in the number of different 
possible sequences of the four operations. The number of 


EXAMPLE 2-12 Bar Codes 

A part is labeled by printing with four thick lines, three 
medium lines, and two thin lines. If each ordering of the nine 
lines represents a different label, how many different labels 
can be generated by using this scheme? 


Combinations 


possible sequences for two drilling operations and two notch- 
ing operations is 
4! 
2! 2! 


The six sequences are easily summarized: ddnn, dndn, dnnd, 
nddn, ndnd, nndd. 


From Equation 2-3, the number of possible part labels is 


9! 
4! 3! 2! 


= 1260 


Another counting problem of interest is the number of subsets of r elements that can be selected 
from a set of n elements. Here, order is not important. These are called combinations. Every 
subset of r elements can be indicated by listing the elements in the set and marking each 
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element with a ‘“*” if it is to be included in the subset. Therefore, each permutation of r *’s and 
n — r blanks indicates a different subset and the numbers of these are obtained from Equation 2-3. 


For example, if the set is S = {a, b, c, d}, the subset {a, c} can be indicated as 


abed 


Ey ES 


Combinations 
The number of combinations, subsets of size r that can be selected from a set of n 
elements, is denoted as (/) or C? and 
c" (@) = (2-4) 
Z r ri(n — r)! 
EXAMPLE 2-13 


A printed circuit board has eight different locations in which 
a component can be placed. If five identical components are 
to be placed on the board, how many different designs are 
possible? 


Each design is a subset of size five from the eight loca- 
tions that are to contain the components. From Equation 2-4, 
the number of possible designs is 


8! 


51a 


The following example uses the multiplication rule in combination with Equation 2-4 to 
answer a more difficult, but common, question. In random experiments in which items are 
selected from a batch, an item may or may not be replaced before the next one is selected. This 
is referred to as sampling with or without replacement, respectively. 


EXAMPLE 2-14 Sampling without Replacement 

A bin of 50 manufactured parts contains three defective parts 
and 47 nondefective parts. A sample of six parts is selected 
from the 50 parts without replacement. That is, each part 
can only be selected once and the sample is a subset of the 
50 parts. How many different samples are there of size six that 
contain exactly two defective parts? 

A subset containing exactly two defective parts can be 
formed by first choosing the two defective parts from the 
three defective parts. Using Equation 2-4, this step can be 
completed in 


7. = 3 differ 
a) Se ifferent ways 


Then, the second step is to select the remaining four parts 
from the 47 acceptable parts in the bin. The second step can be 
completed in 


sp ee = 178,365 different 
4) Mal ; ifferent ways 


Therefore, from the multiplication rule, the number of subsets 
of size six that contain exactly two defective items is 


3 X 178,365 = 535,095 


As an additional computation, the total number of different 
subsets of size six is found to be 


50 50! 
= =] 7 
@ 61 44! 5,890,700 


When probability is discussed in this chapter, the proba- 
bility of an event is determined as the ratio of the number of 
outcomes in the event to the number of outcomes in the 
sample space (for equally likely outcomes). Therefore, the 
probability that a sample contains exactly two defective parts is 


535,095 


——__ = 0,034 
15,890,700” 


Note that this example illustrates a common distribution 
studied in Chapter 3 (hypergeometric distribution). 
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EXERCISES FOR SECTION 2-1 


Provide a reasonable description of the sample space for each 
of the random experiments in Exercises 2-1 to 2-17. There can 
be more than one acceptable interpretation of each experiment. 
Describe any assumptions you make. 

2-1. Each of three machined parts is classified as either 
above or below the target specification for the part. 

2-2. Each of four transmitted bits is classified as either in 
error or not in error. 

2-3. In the final inspection of electronic power supplies, 
either units pass or three types of nonconformities might occur: 
functional, minor, or cosmetic. Three units are inspected. 

2-4. The number of hits (views) is recorded at a high-volume 
Web site in a day. 

2-5. Each of 24 Web sites is classified as containing or not 
containing banner ads. 

2-6. An ammeter that displays three digits is used to mea- 
sure current in milliamperes. 

2-7. A scale that displays two decimal places is used to 
measure material feeds in a chemical plant in tons. 

2-8. The following two questions appear on an employee 
survey questionnaire. Each answer is chosen from the five- 
point scale 1 (never), 2, 3, 4, 5 (always). 


Is the corporation willing to listen to and fairly evaluate 
new ideas? 


How often are my coworkers important in my overall job 
performance? 


2-9. The concentration of ozone to the nearest part per billion. 


2-10. The time until a service transaction is requested of a 
computer to the nearest millisecond. 


2-11. The pH reading of a water sample to the nearest tenth 
of a unit. 
2-12. The voids in a ferrite slab are classified as small, 


medium, or large. The number of voids in each category is 
measured by an optical inspection of a sample. 


2-13. The time of a chemical reaction is recorded to the 
nearest millisecond. 


2-14. An order for an automobile can specify either an 
automatic or a standard transmission, either with or without 
air-conditioning, and any one of the four colors red, blue, black, 
or white. Describe the set of possible orders for this experiment. 
2-15. A sampled injection-molded part could have been 
produced in either one of two presses and in any one of the 
eight cavities in each press. 

2-16. An order for a computer system can specify memory 
of 4, 8, or 12 gigabytes, and disk storage of 200, 300, or 400 
gigabytes. Describe the set of possible orders. 

2-17. Calls are repeatedly placed to a busy phone line until 
a connection is achieved. 


2-18. Inamagnetic storage device, three attempts are made 
to read data before an error recovery procedure that reposi- 
tions the magnetic head is used. The error recovery procedure 
attempts three repositionings before an “abort” message is 
sent to the operator. Let 


s denote the success of a read operation 

f denote the failure of a read operation 

F denote the failure of an error recovery procedure 
S denote the success of an error recovery procedure 
A denote an abort message sent to the operator. 


Describe the sample space of this experiment with a tree 
diagram. 

2-19. Three events are shown on the Venn diagram in the 
following figure: 


Reproduce the figure and shade the region that corresponds to 
each of the following events. 

(a) A’ (b) ANB 

(d) (BUC)' (e) (AN B)'UC 
2-20. Three events are shown on the Venn diagram in the 
following figure: 


(c) (ANB)UC 


Reproduce the figure and shade the region that corresponds to 
each of the following events. 

(a) A’ (b) (AM B)U(ANB’) 

(c) (AN B)UC (d) (BUC)’ 

(e) (AN B)'UC 

2-21. A digital scale is used that provides weights to the 
nearest gram. 

(a) What is the sample space for this experiment? 


Let A denote the event that a weight exceeds 11 grams, let B 
denote the event that a weight is less than or equal to 15 grams, 
and let C denote the event that a weight is greater than or equal 
to 8 grams and less than 12 grams. 

Describe the following events. 


(b) AUB (c) ANB 

(d) A’ (ec) AUBUC 

(f) (AUC)! (gs ANBNC 

(hy) BNC (i) AU(BN C) 

2-22. In an injection-molding operation, the length and 


width, denoted as X and Y, respectively, of each molded part 
are evaluated. Let 


A denote the event of 48 < X < 52 centimeters 


B denote the event of 9 < Y < 11 centimeters 


Construct a Venn diagram that includes these events. Shade 

the areas that represent the following: 

(a) A (b) ANB 

(c) AAUB (d) AUB 

(e) If these events were mutually exclusive, how successful 
would this production operation be? Would the process pro- 
duce parts with X = 50 centimeters and Y = 10 centimeters? 


2-23. Four bits are transmitted over a digital communica- 
tions channel. Each bit is either distorted or received without 
distortion. Let A; denote the event that the ith bit is distorted, 
i=1,...,4. 

(a) Describe the sample space for this experiment. 

(b) Are the A;’s mutually exclusive? 

Describe the outcomes in each of the following events: 


(c) Ay (d) Ay’ 
(e) Aj M Ad M A3 M A4 (f) (A, M Az) Wy (A3 M A4) 
2-24. In light-dependent photosynthesis, light quality refers 


to the wavelengths of light that are important. The wavelength 
of a sample of photosynthetically active radiations (PAR) is 
measured to the nearest nanometer. The red range is 675-700 nm 
and the blue range is 450-500 nm. Let A denote the event that 
PAR occurs in the red range and let B denote the event that 
PAR occurs in the blue range. Describe the sample space and 
indicate each of the following events: 


(a) A (b) B (c) ANB (4) AUB 


2-25. In control replication, cells are replicated over a 
period of two days. Not until mitosis is completed can 
freshly synthesized DNA be replicated again. Two control 
mechanisms have been identified—one positive and one 
negative. Suppose that a replication is observed in three 
cells. Let A denote the event that all cells are identified as 
positive and let B denote the event that all cells are negative. 
Describe the sample space graphically and display each of 
the following events: 

(a) A (b) B 

(c) ANB (d) AUB 
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2-26. Disks of polycarbonate plastic from a supplier are 
analyzed for scratch and shock resistance. The results from 
100 disks are summarized below: 


shock resistance 


high low 
scratch high 70 9 
resistance low 16 5 


Let A denote the event that a disk has high shock resistance, and 
let B denote the event that a disk has high scratch resistance. 
Determine the number of disks in A M B, A’, and A U B. 
2-27. Samples of a cast aluminum part are classified on the 
basis of surface finish (in microinches) and edge finish. The 
results of 100 parts are summarized as follows: 


edge finish 
excellent good 
surface excellent 80 2 
finish good 10 8 


(a) Let A denote the event that a sample has excellent surface 
finish, and let B denote the event that a sample has excel- 
lent edge finish. Determine the number of samples in 
A'()B, B’, and A UB. 

(b) Assume that each of two samples is to be classified on the 
basis of surface finish, either excellent or good, and on the 
basis of edge finish, either excellent or good. Use a tree 
diagram to represent the possible outcomes of this 
experiment. 

2-28. Samples of emissions from three suppliers are classi- 


fied for conformance to air-quality specifications. The results 
from 100 samples are summarized as follows: 


conforms 
yes no 
22 8 
supplier 2 25 5 
3 30 10 


Let A denote the event that a sample is from supplier 1, and let 
B denote the event that a sample conforms to specifications. 
Determine the number of samples in A’ M B, B’, and A U B. 


2-29. The rise time of a reactor is measured in minutes 
(and fractions of minutes). Let the sample space be positive, 
real numbers. Define the events A and B as follows: 
A = {x|x < 72.5} and B = {x|x > 52.5}. 

Describe each of the following events: 

(a) A’ (b) B' 

(c) ANB (d) AUB 
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2-30. A sample of two items is selected without replace- 
ment from a batch. Describe the (ordered) sample space for 
each of the following batches: 

(a) The batch contains the items {a, b, c, d}. 

(b) The batch contains the items {a, b, c, d, e, f; g}. 

(c) The batch contains 4 defective items and 20 good items. 
(d) The batch contains 1 defective item and 20 good items. 


2-31. A sample of two printed circuit boards is selected 
without replacement from a batch. Describe the (ordered) sample 
space for each of the following batches: 

(a) The batch contains 90 boards that are not defective, eight 
boards with minor defects, and two boards with major 
defects. 

(b) The batch contains 90 boards that are not defective, eight 
boards with minor defects, and one board with major 
defects. 


2-32. Counts of the Web pages provided by each of two 

computer servers in a selected hour of the day are recorded. 

Let A denote the event that at least 10 pages are provided by 

server | and let B denote the event that at least 20 pages are 

provided by server 2. 

(a) Describe the sample space for the numbers of pages for 
the two servers graphically in an x — y plot. 


Show each of the following events on the sample space graph: 

(b) A (c) B 

(d) ANB (e) AUB 

2-33. Therise time of a reactor is measured in minutes (and 

fractions of minutes). Let the sample space for the rise time of 

each batch be positive, real numbers. Consider the rise times 

of two batches. Let A denote the event that the rise time of 

batch 1| is less than 72.5 minutes, and let B denote the event 

that the rise time of batch 2 is greater than 52.5 minutes. 
Describe the sample space for the rise time of two batches 

graphically and show each of the following events on a two- 

dimensional plot: 

(a) A (b) B' 

(c) ANB (d) AUB 

2-34. A wireless garage door opener has a code determined 

by the up or down setting of 12 switches. How many out- 

comes are in the sample space of possible codes? 


2-35. An order for a computer can specify any one of five 
memory sizes, any one of three types of displays, and any one of 
four sizes of a hard disk, and can either include or not include a 
pen tablet. How many different systems can be ordered? 


2-36. In a manufacturing operation, a part is produced by 
machining, polishing, and painting. If there are three machine 
tools, four polishing tools, and three painting tools, how many 
different routings (consisting of machining, followed by pol- 
ishing, and followed by painting) for a part are possible? 


2-37. New designs for a wastewater treatment tank have 
proposed three possible shapes, four possible sizes, three loca- 
tions for input valves, and four locations for output valves. 
How many different product designs are possible? 


2-38. A manufacturing process consists of 10 operations 
that can be completed in any order. How many different pro- 
duction sequences are possible? 


2-39. A manufacturing operation consists of 10 operations. 
However, five machining operations must be completed before 
any of the remaining five assembly operations can begin. 
Within each set of five, operations can be completed in any 
order. How many different production sequences are possible? 


2-40. Ina sheet metal operation, three notches and four 

bends are required. If the operations can be done in any order, 

how many different ways of completing the manufacturing are 

possible? 

2-41. A batch of 140 semiconductor chips is inspected by 

choosing a sample of five chips. Assume 10 of the chips do not 

conform to customer requirements. 

(a) How many different samples are possible? 

(b) How many samples of five contain exactly one noncon- 
forming chip? 

(c) How many samples of five contain at least one noncon- 
forming chip? 

2-42. In the layout of a printed circuit board for an elec- 

tronic product, there are 12 different locations that can accom- 

modate chips. 

(a) If five different types of chips are to be placed on the 
board, how many different layouts are possible? 

(b) If the five chips that are placed on the board are of the 
same type, how many different layouts are possible? 


2-43. Inthe laboratory analysis of samples from a chemical 
process, five samples from the process are analyzed daily. In 
addition, a control sample is analyzed two times each day to 
check the calibration of the laboratory instruments. 

(a) How many different sequences of process and control 
samples are possible each day? Assume that the five 
process samples are considered identical and that the two 
control samples are considered identical. 

(b) How many different sequences of process and control 
samples are possible if we consider the five process samples 
to be different and the two control samples to be identical? 

(c) For the same situation as part (b), how many sequences 
are possible if the first test of each day must be a control 
sample? 

2-44. In the design of an electromechanical product, 12 

components are to be stacked into a cylindrical casing in a 

manner that minimizes the impact of shocks. One end of the 

casing is designated as the bottom and the other end is the top. 

(a) If all components are different, how many different de- 
signs are possible? 

(b) If seven components are identical to one another, but the oth- 
ers are different, how many different designs are possible? 

(c) If three components are of one type and identical to one 
another, and four components are of another type and 
identical to one another, but the others are different, how 
many different designs are possible? 
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2-45. Consider the design of a communication system. 

(a) How many three-digit phone prefixes that are used to rep- 
resent a particular geographic area (such as an area code) 
can be created from the digits 0 through 9? 

(b) As in part (a), how many three-digit phone prefixes are 
possible that do not start with 0 or 1, but contain 0 or 1 as 
the middle digit? 

(c) How many three-digit phone prefixes are possible in 
which no digit appears more than once in each prefix? 


2-46. A byte is a sequence of eight bits and each bit is 

either 0 or 1. 

(a) How many different bytes are possible? 

(b) If the first bit of a byte is a parity check, that is, the first 
byte is determined from the other seven bits, how many 
different bytes are possible? 


2-47. Inachemical plant, 24 holding tanks are used for final 
product storage. Four tanks are selected at random and without 
replacement. Suppose that six of the tanks contain material in 
which the viscosity exceeds the customer requirements. 

(a) What is the probability that exactly one tank in the sample 
contains high-viscosity material? 

(b) What is the probability that at least one tank in the sample 
contains high-viscosity material? 

(c) In addition to the six tanks with high viscosity levels, four 
different tanks contain material with high impurities. 
What is the probability that exactly one tank in the sample 
contains high-viscosity material and exactly one tank in 
the sample contains material with high impurities? 

2-48. Plastic parts produced by an injection-molding opera- 
tion are checked for conformance to specifications. Each tool 
contains 12 cavities in which parts are produced, and these 
parts fall into a conveyor when the press opens. An inspector 
chooses three parts from among the 12 at random. Two cavi- 
ties are affected by a temperature malfunction that results in 
parts that do not conform to specifications. 

(a) What is the probability that the inspector finds exactly one 
nonconforming part? 

(b) What is the probability that the inspector finds at least one 
nonconforming part? 

2-49. A bin of 50 parts contains five that are defective. A 

sample of two parts is selected at random, without replace- 

ment. Determine the probability that both parts in the sample 
are defective. 

2-50. The following table summarizes 204 endothermic re- 

actions involving sodium bicarbonate. 


Final Temperature Heat Absorbed (cal) 
Conditions Below Target Above Target 
266 K 12 40 
271K 44 16 
274K 56 36 


Let A denote the event that a reaction final temperature is 271 K 
or less. Let B denote the event that the heat absorbed is below 
target. Determine the number of reactions in each of the fol- 
lowing events. 

(a) ANB (b) A’ (c) AUB ()AUB' Ce) A'NB 
2-51. A Web ad can be designed from four different colors, 
three font types, five font sizes, three images, and five text 
phrases. How many different designs are possible? 


2-52. Consider the hospital emergency department data in 
Example 2-8. Let A denote the event that a visit is to Hospital 
1 and let B denote the event that a visit results in admittance to 
any hospital. Determine the number of persons in each of the 
following events. 

(a) ANB (b) A’ (c) AUB (A) AUB’ Ce) A'NB 
2-53. An article in The Journal of Data Science [“A 
Statistical Analysis of Well Failures in Baltimore County” 
(2009, Vol. 7, pp. 111-127)] provided the following table of 
well failures for different geological formation groups in 
Baltimore County. 


Wells 
Geological Formation Group Failed Total 
Gneiss 170 1685 
Granite 2 28 
Loch raven schist 443 3733 
Mafic 14 363 
Marble 29 309 
Prettyboy schist 60 1403 
Other schists 46 933 
Serpentine 3 39 


Let A denote the event that the geological formation has more 
than 1000 wells and let B denote the event that a well failed. 
Determine the number of wells in each of the following events. 
(a) ANB (b) A’ (c) AUB AUB’ C)A'NB 


2-2. INTERPRETATIONS AND AXIOMS OF PROBABILITY 


In this chapter, we introduce probability for discrete sample spaces—those with only a finite 
(or countably infinite) set of outcomes. The restriction to these sample spaces enables us to 
simplify the concepts and the presentation without excessive mathematics. 

Probability is used to quantify the likelihood, or chance, that an outcome of a random 
experiment will occur. “The chance of rain today is 30%” is a statement that quantifies our 
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Figure 2-10 Relative 
frequency of corrupted 
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feeling about the possibility of rain. The likelihood of an outcome is quantified by assigning 
a number from the interval [0, 1] to the outcome (or a percentage from 0 to 100%). Higher 
numbers indicate that the outcome is more likely than lower numbers. A 0 indicates an outcome 
will not occur. A probability of 1 indicates an outcome will occur with certainty. 

The probability of an outcome can be interpreted as our subjective probability, or 
degree of belief, that the outcome will occur. Different individuals will no doubt assign dif- 
ferent probabilities to the same outcomes. Another interpretation of probability is based on 
the conceptual model of repeated replications of the random experiment. The probability of 
an outcome is interpreted as the limiting value of the proportion of times the outcome oc- 
curs in n repetitions of the random experiment as n increases beyond all bounds. For 
example, if we assign probability 0.2 to the outcome that there is a corrupted pulse in a dig- 
ital signal, we might interpret this assignment as implying that, if we analyze many pulses, 
approximately 20% of them will be corrupted. This example provides a relative frequency 
interpretation of probability. The proportion, or relative frequency, of replications of the ex- 
periment that result in the outcome is 0.2. Probabilities are chosen so that the sum of the 
probabilities of all outcomes in an experiment adds up to 1. This convention facilitates 
the relative frequency interpretation of probability. Figure 2-10 illustrates the concept of rel- 
ative frequency. 

Probabilities for a random experiment are often assigned on the basis of a reasonable 
model of the system under study. One approach is to base probability assignments on the 
simple concept of equally likely outcomes. 

For example, suppose that we will select one laser diode randomly from a batch of 100. 
Randomly implies that it is reasonable to assume that each diode in the batch has an equal chance 
of being selected. Because the sum of the probabilities must equal 1, the probability model for this 
experiment assigns probability of 0.01 to each of the 100 outcomes. We can interpret the proba- 
bility by imagining many replications of the experiment. Each time we start with all 100 diodes 
and select one at random. The probability 0.01 assigned to a particular diode represents the 
proportion of replicates in which a particular diode is selected. When the model of equally likely 
outcomes is assumed, the probabilities are chosen to be equal. 


Whenever a sample space consists of N possible outcomes that are equally likely, the 
probability of each outcome is 1/N. 


It is frequently necessary to assign probabilities to events that are composed of several 
outcomes from the sample space. This is straightforward for a discrete sample space. 
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Figure 2-11 
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EXAMPLE 2-15 Laser Diodes 

Assume that 30% of the laser diodes in a batch of 100 meet the 
minimum power requirements of a specific customer. If a laser 
diode is selected randomly, that is, each laser diode is equally 
likely to be selected, our intuitive feeling is that the probabil- 
ity of meeting the customer’s requirements is 0.30. 


Let E denote the subset of 30 diodes that meet the cus- 
tomer’s requirements. Because E contains 30 outcomes and 
each outcome has probability 0.01, we conclude that the 
probability of E is 0.3. The conclusion matches our intu- 
ition. Figure 2-11 illustrates this example. 


For a discrete sample space, the probability of an event can be defined by the reasoning 


used in the example above. 


Probability of 
an Event 


For a discrete sample space, the probability of an event E, denoted as P(E), equals the 
sum of the probabilities of the outcomes in E. 


EXAMPLE 2-16 


A random experiment can result in one of the outcomes {a, b, c, d} 
with probabilities 0.1, 0.3, 0.5, and 0.1, respectively. Let A denote 
the event {a, b}, B the event {b, c, d}, and C the event {d}.Then, 


P(A) =0.1+ 0.3 =04 
P(B) =0.3+05+0.1 =09 
P(C) =0.1 


EXAMPLE 2-17 Contamination Particles 
A visual inspection of a location on wafers from a semicon- 
ductor manufacturing process resulted in the following table: 


Number of 
Contamination 
Particles Proportion of Wafers 
0 0.40 
1 0.20 
2 0.15 
3 0.10 
4 0.05 
5 or more 0.10 


If one wafer is selected randomly from this process and the 
location is inspected, what is the probability that it contains no 
particles? If information were available for each wafer, we could 
define the sample space as the set of all wafers inspected and 
proceed as in the example with diodes. However, this level of 


Also, P(A’) = 0.6, P(B’) = 0.1, and P(C’) = 0.9. Further- 
more, because AM B = {b}, P(AM B) = 0.3. Because 
AUB ={a,b,c,d}, (AUB) =0.1+03+05+01=1, 
Because A 1 C is the null set, P(A M C) = 0. 


detail is not needed in this case. We can consider the sample 
space to consist of the six categories that summarize the number 
of contamination particles on a wafer. Each category has proba- 
bility equal to the proportion of wafers in the category. The event 
that there is no contamination particle in the inspected location 
on the wafer, denoted as E, can be considered to be comprised of 
the single outcome, namely, E = {0}. Therefore, 


What is the probability that a wafer contains three or 
more particles in the inspected location? Let FE denote the 
event that a wafer contains three or more particles in the in- 
spected location. Then, E consists of the three outcomes {3, 4, 
5 or more}. Therefore, 


P(E) = 0.10 + 0.05 + 0.10 = 0.25 


Practical Interpretation: Contamination levels affect the yield of 
functional devices in semiconductor manufacturing so that 
probabilities such as these are regularly studied. 
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Often more than one item is selected from a batch without replacement when production 
is inspected. In this case, randomly selected implies that each possible subset of items is 
equally likely. 


EXAMPLE 2-18 

Suppose a batch contains six parts {a, b, c, d, e, f} and two ation there are 15 outcomes. Let E denote the event that part f 
parts are selected randomly, without replacement. Suppose is in the sample. Then E can be written as E = {{a, f}, {b, f}, 
that part f is defective, but the others are good. What is the {c, f}, {d, f}, {e, f}}. Because each outcome is equally likely, 
probability that part f appears in the sample? P(E) = 5/15 = 1/3. 


The sample space consists of all possible (unordered) pairs 
selected without replacement. From Equation 2-4 or by enumer- 


Axioms of 
Probability 


Now that the probability of an event has been defined, we can collect the assumptions that 
we have made concerning probabilities into a set of axioms that the probabilities in any ran- 
dom experiment must satisfy. The axioms ensure that the probabilities assigned in an experi- 
ment can be interpreted as relative frequencies and that the assignments are consistent with 
our intuitive understanding of relationships between relative frequencies. For example, if 
event A is contained in event B, we should have P(A) = P(B). The axioms do not determine 
probabilities; the probabilities are assigned based on our knowledge of the system under 
study. However, the axioms enable us to easily calculate the probabilities of some events from 
knowledge of the probabilities of other events. 


Probability is a number that is assigned to each member of a collection of events 
from a random experiment that satisfies the following properties: 
If S is the sample space and E is any event in a random experiment, 


UD eS 
(2) OSP(E)=1 
(3) For two events E, and E, with E, 1 Ey = © 


P(E, U Ey) = P(E\) + P(E) 


The property that 0 = P(E) = 1 is equivalent to the requirement that a relative frequency 
must be between 0 and 1. The property that P(S) = 1 is a consequence of the fact that an 
outcome from the sample space occurs on every trial of an experiment. Consequently, the 
relative frequency of S is 1. Property 3 implies that if the events E, and E, have no outcomes 
in common, the relative frequency of outcomes in E; U E; is the sum of the relative frequen- 
cies of the outcomes in FE, and EF). 

These axioms imply the following results. The derivations are left as exercises at the end 
of this section. Now, 


and for any event E, 
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For example, if the probability of the event E is 0.4, our interpretation of relative 
frequency implies that the probability of £’ is 0.6. Furthermore, if the event E, is contained 


in the event E,, 


EXERCISES FOR SECTION 2-2 


P(E,) = P(E) 


2-54. Each of the possible five outcomes of a random ex- 
periment is equally likely. The sample space is {a, b, c, d, e}. 
Let A denote the event {a, b}, and let B denote the event 
{c, d, e}. Determine the following: 


(a) P(A) (b) PCB) 

(c) P(A’) @) P(AUB) 

(e) P(A MB) 

2-55. The sample space of a random experiment is {a, b, c, 


d, e} with probabilities 0.1, 0.1, 0.2, 0.4, and 0.2, respectively. 
Let A denote the event {a, b, c}, and let B denote the event 
{c, d, e}. Determine the following: 


(a) P(A) (b) P(B) 
(c) P(A’) (d) P(AUB) 
(ce) P(A NB) 


2-56. Orders for a computer are summarized by the op- 
tional features that are requested as follows: 


proportion of orders 
no optional features 0.3 
one optional feature 0.5 
more than one optional feature 0.2 


(a) What is the probability that an order requests at least one 
optional feature? 

(b) What is the probability that an order does not request 
more than one optional feature? 


2-57. Tf the last digit of a weight measurement is equally 

likely to be any of the digits 0 through 9, 

(a) What is the probability that the last digit is 0? 

(b) What is the probability that the last digit is greater than or 
equal to 5? 

2-58. A part selected for testing is equally likely to have 

been produced on any one of six cutting tools. 

(a) What is the sample space? 

(b) What is the probability that the part is from tool 1? 

(c) What is the probability that the part is from tool 3 or tool 5? 

(d) What is the probability that the part is not from tool 4? 


2-59. An injection-molded part is equally likely to be ob- 

tained from any one of the eight cavities on a mold. 

(a) What is the sample space? 

(b) What is the probability a part is from cavity 1 or 2? 

(c) What is the probability that a part is from neither cavity 3 
nor 4? 


2-60. In an acid-base titration, a base or acid is gradually 
added to the other until they have completely neutralized 
each other. Because acids and bases are usually colorless (as 
are the water and salt produced in the neutralization reac- 
tion), pH is measured to monitor the reaction. Suppose that 
the equivalence point is reached after approximately 100 mL 
of a NaOH solution have been added (enough to react with 
all the acetic acid present) but that replicates are equally 
likely to indicate from 95 to 104 mL to the nearest mL. 
Assume that volumes are measured to the nearest mL and 
describe the sample space. 
(a) What is the probability that equivalence is indicated at 
100 mL? 
(b) What is the probability that equivalence is indicated at 
less than 100 mL? 
(c) What is the probability that equivalence is indicated be- 
tween 98 and 102 mL (inclusive)? 
2-61. Ina NiCd battery, a fully charged cell is composed 
of Nickelic Hydroxide. Nickel is an element that has multiple 
oxidation states that is usually found in the following states: 


nickel charge proportions found 


0 0.17 
+2 0.35 
oo 0.33 
+4 0.15 


(a) What is the probability that a cell has at least one of the 
positive nickel-charged options? 

(b) What is the probability that a cell is not composed of a 
positive nickel charge greater than +3? 


2-62. <A credit card contains 16 digits between 0 and 9. 
However, only 100 million numbers are valid. If a number is 
entered randomly, what is the probability that it is a valid 
number? 


2-63. Suppose your vehicle is licensed in a state that issues 
license plates that consist of three digits (between 0 and 9) fol- 
lowed by three letters (between A and Z). If a license number 
is selected randomly, what is the probability that yours is the 
one selected? 


2-64. A message can follow different paths through servers 
on a network. The sender’s message can go to one of five 
servers for the first step; each of them can send to five servers 
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at the second step; each of those can send to four servers at the 

third step; and then the message goes to the recipient’s server. 

(a) How many paths are possible? 

(b) If all paths are equally likely, what is the probability that a 
message passes through the first of four servers at the 
third step? 

2-65. Magnesium alkyls are used as homogenous catalysts 
in the production of linear low-density polyethylene (LLDPE), 
which requires a finer magnesium powder to sustain a reaction. 
Redox reaction experiments using four different amounts of 
magnesium powder are performed. Each result may or may not 
be further reduced in a second step using three different mag- 
nesium powder amounts. Each of these results may or may not 
be further reduced in a third step using three different amounts 
of magnesium powder. 

(a) How many experiments are possible? 

(b) If all outcomes are equally likely, what is the probability that 
the best result is obtained from an experiment that uses all 
three steps? 

(c) Does the result in the previous question change if five or 
six or seven different amounts are used in the first step? 
Explain. 

2-66. Disks of polycarbonate plastic from a supplier are 

analyzed for scratch and shock resistance. The results from 

100 disks are summarized as follows: 


shock resistance 


high low 
scratch high 70 9 
resistance low 16 5 


Average Frequencies and Operations in TPC-C 


Let A denote the event that a disk has high shock resistance, 
and let B denote the event that a disk has high scratch resist- 
ance. If a disk is selected at random, determine the following 
probabilities: 


(a) P(A) (b) P(B) 

(c) P(A’) ~~ (d) PAN B) 

(e) PAUB) (f) P(A’ UB) 

2-67. Samples of emissions from three suppliers are classi- 


fied for conformance to air-quality specifications. The results 
from 100 samples are summarized as follows: 


conforms 
yes no 
22 8 
supplier 2 25, 5 
3 30 10 


Let A denote the event that a sample is from supplier 1, and let 
B denote the event that a sample conforms to specifications. 
If a sample is selected at random, determine the following 
probabilities: 


(a) P(A) (b) P(B) 

(c) P(A’) (d) P(AMB) 

(e) PAUB) (f) P(A’ UB) 

2-68. An article in the Journal of Database Management 


(“Experimental Study of a Self-Tuning Algorithm for DBMS 
Buffer Pools” (2005, Vol. 16, pp. 1—20)] provided the workload 
used in the TPC-C OLTP (Transaction Processing Performance 
Council’s Version C On-Line Transaction Processing) bench- 
mark, which simulates a typical order entry application. 


Transaction Frequency Selects Updates Inserts Deletes Non-Unique Selects Joins 
New order 43 23 11 12 0 0 0 
Payment 44 4.2 3 1 0 0.6 0 
Order status 4 11.4 0 0 0 0.6 0 
Delivery 5 130 120 0 10 0 0 
Stock level 4 0 0 0 0 0 1 


The frequency of each type of transaction (in the second 
column) can be used as the percentage of each type of trans- 
action. The average number of selects operations required for 
each type of transaction is shown. Let A denote the event of 
transactions with an average number of selects operations of 
12 or fewer. Let B denote the event of transactions with an 
average number of updates operations of 12 or fewer. 
Calculate the following probabilities. 
(a) P(A) (b) P(B) 

(d) P(AM B’) (e) P(A UB) 


(c) P(AMB) 


2-69. Use the axioms of probability to show the following: 
(a) For any event FE, P(E’) = 1 — P(E). 

(b) P(@) = 0 

(c) If A is contained in B, then P(A) = P(B). 

2-70. Consider the endothermic reactions in Exercise 2-50. 
Let A denote the event that a reaction final temperature is 
271 K or less. Let B denote the event that the heat absorbed is 
above target. 

Determine the following probabilities. 
(a) PANB) —(b) P(A’) 

(d) P(A U B’) (e) P(A’ MB’) 


(c) P(A UB) 


2-71. A Web ad can be designed from four different colors, 
three font types, five font sizes, three images, and five text 
phrases. A specific design is randomly generated by the Web 
server when you visit the site. If you visit the site five times, what 
is the probability that you will not see the same design? 


2-72. Consider the hospital emergency room data in 
Example 2-8. Let A denote the event that a visit is to Hospital 
4 and let B denote the event that a visit results in LWBS (at any 
hospital). Determine the following probabilities. 


2-3 ADDITION RULES 
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(a) P(ANM B) (b) P(A’) (c) P(A U B) 
(d) P(A UB’) (e) P(A’ MB’) 
2-73. Consider the well failure data in Exercise 2-53. Let A 


denote the event that the geological formation has more than 
1000 wells and let B denote the event that a well failed. 
Determine the following probabilities. 
(a) P(ANB) (b) P(A’) 

(d) P(A U B’) (e) P(A’ MB’) 


(c) P(A UB) 


Joint events are generated by applying basic set operations to individual events. Unions of 
events, such as A U B; intersections of events, such as A B; and complements of events, 
such as A’, are commonly of interest. The probability of a joint event can often be determined 
from the probabilities of the individual events that comprise it. Basic set operations are also 
sometimes helpful in determining the probability of a joint event. In this section the focus is 


on unions of events. 


EXAMPLE 2-19 Semiconductor Wafers 
Table 2-1 lists the history of 940 wafers in a semiconductor 
manufacturing process. Suppose one wafer is selected at 
random. Let H denote the event that the wafer contains high 
levels of contamination. Then, P(H) = 358/940. 

Let C denote the event that the wafer is in the center of a 
sputtering tool. Then, P(C) = 626/940. Also, P(H M C) is the 
probability that the wafer is from the center of the sputtering 
tool and contains high levels of contamination. Therefore, 


P(H NC) = 112/940 


The event H U C is the event that a wafer is from the 
center of the sputtering tool or contains high levels of contam- 


ination (or both). From the table, P(H U C) = 872/940. An 
alternative calculation of P(H U C) can be obtained as fol- 
lows. The 112 wafers that comprise the event HMC are 
included once in the calculation of P(H) and again in the cal- 
culation of P(C). Therefore, P(H U C) can be found to be 


P(H UC) = P(X) + P(C) — PH NC) 
= 358/940 + 626/940 — 112/940 = 872/940 


Practical Interpretation: To better understand the sources of 
contamination, yield from defferent locations on wafers are rou- 
tinely aggregated. 


The preceding example illustrates that the probability of A or B is interpreted as P(A U B) 
and that the following general addition rule applies. 


Table 2-1 Wafers in Semiconductor Manufacturing Classified 
by Contamination and Location 


Location in Sputtering Tool 


Contamination Center Edge Total 
Low 514 68 582 
High 112 246 358 
Total 626 314 
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Probability of 
a Union 


P(A U B) = P(A) + P(B) — P(A N B) 


(2-5) 


EXAMPLE 2-20 Semiconductor Wafers and Location 


The wafers in Example 2-19 were further classified by the 
degree of contamination. Table 2-2 shows the proportion of 
wafers in each category. What is the probability that a wafer 
was either at the edge or that it contains four or more particles? 
Let FE, denote the event that a wafer contains four or more par- 
ticles, and let E, denote the event that a wafer was at the edge. 

The requested probability is P(E, UE). Now, 
P(E;) =0.15 and P(E) = 0.28. Also, from the table, 
P(E, M Ey) = 0.04. Therefore, using Equation 2-1, we find that 


P(E, U Ey) = 0.15 + 0.28 — 0.04 = 0,39 


What is the probability that a wafer contains less than 
two particles or that it is both at the edge and contains more 
than four particles? Let E,; denote the event that a wafer 
contains less than two particles, and let E, denote the event 
that a wafer is both at the edge and contains more than four 
particles. The requested probability is P(E, U E,). Now, 
P(E,) = 0.60 and P(Ey) = 0.03. Also, E, and E, are mutually 
exclusive. Consequently, there are no wafers in the intersec- 
tion and P(E; M E,) = 0. Therefore, 


P(E; U Ey) = 0.60 + 0.03 = 0.63 


Recall that two events A and B are said to be mutually exclusive if AM B = ©. Then, 
P(A B) = 0, and the general result for the probability of A U B simplifies to the third 


axiom of probability. 


P(A U B) = P(A) + P(B) 


If A and B are mutually exclusive events, 


(2-6) 


Three or More Events 


More complicated probabilities, such as P(A U B U C), can be determined by repeated use 
of Equation 2-5 and by using some basic set operations. For example, 


P(AUBUC) = P[(AUB) UC] = P(AUB) + P(C) — P[(AUB)NC] 


Table 2-2 Wafers Classified by Contamination and Location 


Number of 
Contamination 

Particles Center Edge Totals 
0 0.30 0.10 0.40 

1 0.15 0.05 0.20 

2; 0.10 0.05 0.15 

3 0.06 0.04 0.10 

4 0.04 0.01 0.05 

5 or more 0.07 0.03 0.10 

Totals 0.72 0.28 1.00 


Mutually Exclusive 
Events 


Figure 2-12 Venn 
diagram of four mutu- 
ally exclusive events. 
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Upon expanding P(A U B) by Equation 2-5 and using the distributed rule for set opera- 
tions to simplify P[(A U B) MC], we obtain 


( 
= P(A) + P(B) — P(ANM B) + P( 
[ 
We have developed a formula for the probability of the union of three events. Formulas can be 


developed for the probability of the union of any number of events, although the formulas 
become very complex. As a summary, for the case of three events, 


P(A U BUC) = P(A) + P(B) + P(C) — P(ANB) 
— P(ANC) — P(BNC) + PIANBNC) (2-7) 


Results for three or more events simplify considerably if the events are mutually exclu- 
sive. In general, a collection of events, FE), E>,..., Ex, is said to be mutually exclusive if there 
is no overlap among any of them. The Venn diagram for several mutually exclusive events is 
shown in Fig. 2-12. By generalizing the reasoning for the union of two events, the following 
result can be obtained: 


A collection of events, E,, E>, ..., Ex,1s said to be mutually exclusive if for all pairs, 
For a collection of mutually exclusive events, 


Zz Ey 


E3 
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EXAMPLE 2-21 pH 

A simple example of mutually exclusive events will be used 
quite frequently. Let X denote the pH of a sample. Consider 
the event that X is greater than 6.5 but less than or equal to 7.8. 
This probability is the sum of any collection of mutually ex- 
clusive events with union equal to the same range for X. One 
example is 


P(6.5 < X =7.8) = P(6.5 < X = 7.0) + P(7.0 < X =7.5) 
+ P(7.5 < X = 7.8) 


EXERCISES FOR SECTION 2-3 


Another example is 


P(6.5 <X $7.8) = P(6.5 <X = 6.6) + P(6.6 <X $7.1) 
+ P(T.L<X S 7.4) + P(7.4<X =7.8) 


The best choice depends on the particular probabilities 
available. 

Practical Interpretation: The partition of an event into mu- 
tually exclusive subsets is widely used in later chapters to cal- 
culate probabilities. 


2-74. If P(A) = 0.3, P(B) = 0.2, and P(AM B) = 0.1, 
determine the following probabilities: 

(a) P(A’) (b) P(A UB) (c) P(A’ MB) 

(d) P(AM B’) (ce) P[(AUB)'] (ff) P(A’ UB) 

2-75. If A, B, and C are mutually exclusive events with 


P(A) = 0.2, P(B) = 0.3, and P(C) = 0.4, determine the fol- 
lowing probabilities: 


(a) PAUBUC) (b) PAN BNC) 


(c) PAM B) (d) P(AUB)NC] 
(e) P(A' NM B' NC’) 
2-76. In the article “ACL Reconstruction Using Bone- 


Patellar Tendon-Bone Press-Fit Fixation: 10-Year Clinical 
Results” in Knee Surgery, Sports Traumatology, Arthroscopy 
(2005, Vol. 13, pp. 248-255), the following causes for knee 
injuries were considered: 


Percentage of 


Activity Knee Injuries 
Contact sport 46% 
Noncontact sport 44% 
Activity of daily living 9% 
Riding motorcycle 1% 


(a) What is the probability a knee injury resulted from a sport 
(contact or noncontact)? 

(b) What is the probability a knee injury resulted from an ac- 
tivity other than a sport? 


2-77. Disks of polycarbonate plastic from a supplier are an- 
alyzed for scratch and shock resistance. The results from 100 
disks are summarized as follows: 


shock resistance 


high low 
scratch high 70 9 
resistance low 16 5 


(a) If a disk is selected at random, what is the probability that 
its scratch resistance is high and its shock resistance is high? 
(b) If a disk is selected at random, what is the probability that 
its scratch resistance is high or its shock resistance is high? 


(c) Consider the event that a disk has high scratch resistance 
and the event that a disk has high shock resistance. Are 
these two events mutually exclusive? 

2-78. Strands of copper wire from a manufacturer are ana- 

lyzed for strength and conductivity. The results from 100 

strands are as follows: 


strength 
high low 
high conductivity 74 8 
low conductivity 15 3 


(a) If a strand is randomly selected, what is the probability 
that its conductivity is high and its strength is high? 

(b) If a strand is randomly selected, what is the probability 
that its conductivity is low or the strength is low? 

(c) Consider the event that a strand has low conductivity and 
the event that the strand has a low strength. Are these two 
events mutually exclusive? 


2-79. The analysis of shafts for a compressor is summarized 
by conformance to specifications. 


roundness conforms 


yes no 
surface finish yes 345 5 
conforms no 12 8 


(a) If a shaft is selected at random, what is the probability that 
the shaft conforms to surface finish requirements? 

(b) What is the probability that the selected shaft conforms to 
surface finish requirements or to roundness requirements? 

(c) What is the probability that the selected shaft either con- 
forms to surface finish requirements or does not conform 
to roundness requirements? 

(d) What is the probability that the selected shaft conforms to 
both surface finish and roundness requirements? 


2-80. Cooking oil is produced in two main varieties: mono- 
and polyunsaturated. Two common sources of cooking oil are 


corm and canola. The following table shows the number of 
bottles of these oils at a supermarket: 


type of oil 
canola com 
type of mono 7 13 
unsaturation poly 93 77 


(a) If a bottle of oil is selected at random, what is the proba- 
bility that it belongs to the polyunsaturated category? 

(b) What is the probability that the chosen bottle is monoun- 
saturated canola oil? 

2-81. A manufacturer of front lights for automobiles tests 

lamps under a high-humidity, high-temperature environment 

using intensity and useful life as the responses of interest. 

The following table shows the performance of 130 lamps: 


useful life 


satisfactory unsatisfactory 


intensity satisfactory 117 3 


unsatisfactory 8 2 


(a) Find the probability that a randomly selected lamp will 
yield unsatisfactory results under any criteria. 

(b) The customers for these lamps demand 95% satisfactory 
results. Can the lamp manufacturer meet this demand? 


2-82. A computer system uses passwords that are six char- 
acters and each character is one of the 26 letters (a—z) or 10 in- 
tegers (0-9). Uppercase letters are not used. Let A denote the 
event that a password begins with a vowel (either a, e, i, 0, or 
u) and let B denote the event that a password ends with an even 
number (either 0, 2, 4, 6, or 8). Suppose a hacker selects a 
password at random. Determine the following probabilities: 
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(a) P(A) (b) P(B) 
(c) PAM B) (d) P(A UB) 
2-83. Consider the endothermic reactions in Exercise 2-50. 


Let A denote the event that a reaction final temperature is 
271 K or less. Let B denote the event that the heat absorbed is 
above target. Use the addition rules to calculate the following 
probabilities. 
(a) P(A U B) 
(c) P(A’ UB’) 
2-84. A Web ad can be designed from four different colors, 
three font types, five font sizes, three images, and five text 
phrases. A specific design is randomly generated by the Web 
server when you visit the site. Let A denote the event that the 
design color is red and let B denote the event that the font size 
is not the smallest one. Use the addition rules to calculate the 
following probabilities. 
(a) P(A U B) 

(c) P(A’ U B’) 

2-85. Consider the hospital emergency room data in Example 
2-8. Let A denote the event that a visit is to Hospital 4 and let B 
denote the event that a visit results in LWBS (at any hospital). 
Use the addition rules to calculate the following probabilities. 
(a) P(A U B) (b) P(A U B’) 

(c) P(A’ U B’) 

2-86. Consider the well failure data in Exercise 2-53. Let A 
denote the event that the geological formation has more than 
1000 wells and let B denote the event that a well failed. Use 
the addition rules to calculate the following probabilities. 

(a) P(A U B) (b) P(A U B’) 

(c) P(A’ U B’) 


(b) P(A U B’) 


(b) P(A UB’) 


Sometimes probabilities need to be reevaluated as additional information becomes available. 
A useful way to incorporate additional information into a probability model is to assume that 
the outcome that will be generated is a member of a given event. This event, say A, defines the 
conditions that the outcome is known to satisfy. Then probabilities can be revised to include 
this knowledge. The probability of an event B under the knowledge that the outcome will be 
in event A is denoted as 


P(BIA) 


and this is called the conditional probability of B given A. 

A digital communication channel has an error rate of one bit per every thousand trans- 
mitted. Errors are rare, but when they occur, they tend to occur in bursts that affect many con- 
secutive bits. If a single bit is transmitted, we might model the probability of an error as 
1/1000. However, if the previous bit was in error, because of the bursts, we might believe that 
the probability that the next bit is in error is greater than 1/1000. 

In a thin film manufacturing process, the proportion of parts that are not acceptable is 2%. 
However, the process is sensitive to contamination problems that can increase the rate of parts 
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Figure 2-13 
Conditional probabili- 
ties for parts with 
surface flaws. 


EXAMPLE 2-22 
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P(D|F) = 0.25 


25% 
defective { 


>~ 5% defective 
P(D|F') = 0.05 


parts with 
surface flaws 


F 


F'= parts without 
surface flaws 


that are not acceptable. If we knew that during a particular shift there were problems with the 
filters used to control contamination, we would assess the probability of a part being unac- 
ceptable as higher than 2%. 

In a manufacturing process, 10% of the parts contain visible surface flaws and 25% of the 
parts with surface flaws are (functionally) defective parts. However, only 5% of parts without sur- 
face flaws are defective parts. The probability of a defective part depends on our knowledge of 
the presence or absence of a surface flaw. Let D denote the event that a part is defective and let F 
denote the event that a part has a surface flaw. Then, we denote the probability of D given, or as- 
suming, that a part has a surface flaw as P(D| F). Because 25% of the parts with surface flaws are 
defective, our conclusion can be stated as P(D |F ) = 0.25. Furthermore, because F’ denotes the 
event that a part does not have a surface flaw and because 5% of the parts without surface flaws 
are defective, we have P(D| F’) = 0.05. These results are shown graphically in Fig. 2-13. 


Table 2-3 provides an example of 400 parts classified by 
surface flaws and as (functionally) defective. For this table the 
conditional probabilities match those discussed previously in 
this section. For example, of the parts with surface flaws 
(40 parts) the number of defective ones is 10. Therefore, 


P(D|F) = 10/40 = 0.25 


and of the parts without surface flaws (360 parts) the number 
of defective ones is 18. Therefore, 


P(D|F') = 18/360 = 0.05 


Practical Interpretation: The probability of defective is five 
times greater for parts with surface flaws. This calculation il- 
lustrates how probabilities are adjusted for additional informa- 
tion. The result also suggests that there may be a link between 
surface flaws and functionally defective parts that should be 
investigated. 


In Example 2-22 conditional probabilities were calculated directly. These probabilities can 
also be determined from the formal definition of conditional probability. 


Conditional 
Probability 


The conditional probability of an event B given an event A, denoted as P(B|A), is 


P(B|A) = P(A B)/P(A) (2-9) 
for P(A) > 0. 
Table 2-3 Parts Classified 
Surface Flaws 

Yes (event F) No Total 

Defective Yes (event D) 10 18 28 
No 30 342 372 

Total 40 360 400 
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This definition can be understood in a special case in which all outcomes of a random exper- 
iment are equally likely. If there are n total outcomes, 


P(A) = (number of outcomes in A)/n 


Also, 


P(A B) = (number of outcomes in A M B)/n 


Consequently, 


P(A B)/P(A) = 


number of outcomes in A / B 


number of outcomes in A 


Therefore, P(B|A) can be interpreted as the relative frequency of event B among the trials that 


produce an outcome in event A. 


EXAMPLE 2-23 Surface Flaws 
Again consider the 400 parts in Table 2-3. From this table, 


10 /40 _ 10 
PO SPOOR = <6 ag ag 


Note that in this example all four of the following probabilities 
are different: 


P(F) = 40/400 
P(D) = 28/400 


P(F|D) = 10/28 
P(D|F) = 10/40 


Here, P(D) and P(D|F) are probabilities of the same event, 
but they are computed under two different states of knowledge. 


Similarly, P(F) and P(F|D) are computed under two different 
states of knowledge. 

The tree diagram in Fig. 2-14 can also be used to display 
conditional probabilities. The first branch is on surface flaw. Of 
the 40 parts with surface flaws, 10 are functionally defective and 
30 are not. Therefore, 

P(D|F) = 10/40 and ~—P(D' | F) = 30/40 
Of the 360 parts without surface flaws, 18 are functionally de- 
fective and 342 are not. Therefore, 


P(D|F') = 18/360 and = P(D'|F’) = 342/360 


Random Samples and Conditional Probability 

Recall that to select one item randomly from a batch implies that each item is equally likely. 
If more than one item is selected, randomly implies that each element of the sample space is 
equally likely. For example, when sample spaces were presented earlier in this chapter, sam- 
pling with and without replacement was defined and illustrated for the simple case of a batch 
with three items {a, b, c}. If two items are selected randomly from this batch without replace- 
ment, each of the six outcomes in the ordered sample space {ab, ac, ba, bc, ca, cb} has prob- 
ability 1/6. If the unordered sample space is used, each of the three outcomes in { {a, b}, 


{a, c}, {b, c}} has probability 1/3. 


Surface flaw 


Defective — ;e: 


Figure 2-14 Tree 
diagram for parts 
classified 


342 18 
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Random Samples 


When a sample is selected randomly from a large batch, it is usually easier to avoid enu- 
meration of the sample space and calculate probabilities from conditional probabilities. For 
example, suppose that a batch contains 10 parts from tool | and 40 parts from tool 2. If two 
parts are selected randomly, without replacement, what is the conditional probability that a 
part from tool 2 is selected second given that a part from tool | is selected first? 

Although the answer can be determined from counts of outcomes, this type of question 
can be answered more easily with the following result. 


To select randomly implies that at each step of the sample, the items that remain in 
the batch are equally likely to be selected. 


If a part from tool 1 were selected with the first pick, 49 items would remain, 9 from tool 1 and 
40 from tool 2, and they would be equally likely to be picked. Therefore, the probability that 
a part from tool 2 would be selected with the second pick given this first pick is 


P(E>|E,) = 40/49 


In this manner, other probabilities can also be simplified. For example, let the event E 
consist of the outcomes with the first selected part from tool | and the second part from tool 2. 
To determine the probability of E, consider each step. The probability that a part from tool | 
is selected with the first pick is P(E,) = 10/50. The conditional probability that a part from 
tool 2 is selected with the second pick, given that a part from tool 1 is selected first, is 


P(E |E,) = 40/49. Therefore, 
40 10 8 
P(E) = P(E2| E\)P(E1) = 49 * 59 = 49 


Sometimes a partition of the question into successive picks is an easier method to solve the 
problem. 


EXAMPLE 2-24 
A day’s production of 850 manufactured parts contains 50 
parts that do not meet customer requirements. Two parts are 
selected randomly without replacement from the batch. What 
is the probability that the second part is defective given that 
the first part is defective? 

Let A denote the event that the first part selected is de- 
fective, and let B denote the event that the second part 


EXAMPLE 2-25 


Continuing the previous example, if three parts are selected at 
random, what is the probability that the first two are defective 
and the third is not defective? This event can be described in 
shorthand notation as simply P(ddn). We have 


50 | 
850 


49 
849 


800 


P(ddn) = 848 


= 0.0032 


selected is defective. The probability needed can be expressed 
as P(B|A). If the first part is defective, prior to selecting the 
second part, the batch contains 849 parts, of which 49 are 
defective; therefore, 


P(B|A) = 49/849 


The third term is obtained as follows. After the first two 
parts are selected, there are 848 remaining. Of the remain- 
ing parts, 800 are not defective. In this example, it is easy to 
obtain the solution with a conditional probability for each 
selection. 


EXERCISES FOR SECTION 2-4 


2-4 CONDITIONAL PROBABILITY 45 


2-87. Disks of polycarbonate plastic from a supplier are 
analyzed for scratch and shock resistance. The results from 
100 disks are summarized as follows: 


shock resistance 


high low 
scratch high 70 9 
resistance low 16 5 


Let A denote the event that a disk has high shock resistance, 
and let B denote the event that a disk has high scratch resis- 
tance. Determine the following probabilities: 


(a) P(A) (b) P(B) 
(c) P(A|B) (d) P(BIA) 
2-88. Samples of skin experiencing desquamation are ana- 


lyzed for both moisture and melanin content. The results from 
100 skin samples are as follows: 


melanin content 


high low 
moisture high 13 7 
content low 48 32 


Let A denote the event that a sample has low melanin content, 
and let B denote the event that a sample has high moisture con- 
tent. Determine the following probabilities: 


(a) P(A) (b) P(B) 
(c) P(A|B) — (d) P(BIA) 
2-89. The analysis of results from a leaf transmutation 


experiment (turning a leaf into a petal) is summarized by type 
of transformation completed: 


total textural 


transformation 
yes no 
total color yes 243 26 
transformation no 13 18 


(a) If a leaf completes the color transformation, what is the 

probability that it will complete the textural transformation? 
(b) If aleaf does not complete the textural transformation, what 

is the probability it will complete the color transformation? 
2-90. Samples of a cast aluminum part are classified on the 
basis of surface finish (in microinches) and length measure- 
ments. The results of 100 parts are summarized as follows: 


length 
excellent good 
surface excellent 80 2 
finish good 10 8 


Let A denote the event that a sample has excellent surface 

finish, and let B denote the event that a sample has excellent 

length. Determine: 

(a) P(A) (b) P(B) 

(c) P(A|B) (d) P(BIA) 

(e) If the selected part has excellent surface finish, what is the 
probability that the length is excellent? 

(f) If the selected part has good length, what is the probabil- 
ity that the surface finish is excellent? 


2-91. The following table summarizes the analysis of samples 
of galvanized steel for coating weight and surface roughness: 


coating weight 


high low 
surface high 12 16 
roughness low 88 34 


(a) If the coating weight of a sample is high, what is the prob- 
ability that the surface roughness is high? 

(b) If the surface roughness of a sample is high, what is the 
probability that the coating weight is high? 

(c) If the surface roughness of a sample is low, what is the 
probability that the coating weight is low? 


2-92. Consider the data on wafer contamination and loca- 
tion in the sputtering tool shown in Table 2-2. Assume that one 
wafer is selected at random from this set. Let A denote the 
event that a wafer contains four or more particles, and let B de- 
note the event that a wafer is from the center of the sputtering 
tool. Determine: 


(a) P(A) (b) P(A|B) 

(c) P(B) (d) P(BIA) 

(e) PAMB) (f) P(AUB) 

2-93. The following table summarizes the number of de- 


ceased beetles under autolysis (the destruction of a cell after 
its death by the action of its own enzymes) and putrefaction 
(decomposition of organic matter, especially protein, by 
microorganisms, resulting in production of foul-smelling 
matter): 


autolysis 
high low 
: high 14 59 
putrefaction 
low 18 9 


(a) If the autolysis of a sample is high, what is the probability 
that the putrefaction is low? 

(b) If the putrefaction of a sample is high, what is the proba- 
bility that the autolysis is high? 

(c) If the putrefaction of a sample is low, what is the probability 
that the autolysis is low? 
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2-94. A maintenance firm has gathered the following infor- 
mation regarding the failure mechanisms for air conditioning 
systems: 


evidence of gas leaks 


yes no 
evidence of yes 55 17 
electrical failure no 32 3 


The units without evidence of gas leaks or electrical failure 

showed other types of failure. If this is a representative sample 

of AC failure, find the probability 

(a) That failure involves a gas leak 

(b) That there is evidence of electrical failure given that there 
was a gas leak 

(c) That there is evidence of a gas leak given that there is 
evidence of electrical failure 


2-95. A lot of 100 semiconductor chips contains 20 that are 

defective. Two are selected randomly, without replacement, 

from the lot. 

(a) What is the probability that the first one selected is 
defective? 

(b) What is the probability that the second one selected is 
defective given that the first one was defective? 

(c) What is the probability that both are defective? 

(d) How does the answer to part (b) change if chips selected 
were replaced prior to the next selection? 


2-96. A batch of 500 containers for frozen orange juice con- 

tains five that are defective. Two are selected, at random, with- 

out replacement from the batch. 

(a) What is the probability that the second one selected is 
defective given that the first one was defective? 

(b) What is the probability that both are defective? 

(c) What is the probability that both are acceptable? 


Three containers are selected, at random, without replace- 

ment, from the batch. 

(d) What is the probability that the third one selected is de- 
fective given that the first and second ones selected were 
defective? 

(e) What is the probability that the third one selected is 
defective given that the first one selected was defective 
and the second one selected was okay? 

(f) What is the probability that all three are defective? 


2-97. A batch of 350 samples of rejuvenated mitochondria 

contains eight that are mutated (or defective). Two are selected, 

at random, without replacement from the batch. 

(a) What is the probability that the second one selected is de- 
fective given that the first one was defective? 

(b) What is the probability that both are defective? 

(c) What is the probability that both are acceptable? 


2-98. A computer system uses passwords that are exactly 
seven characters and each character is one of the 26 letters 


(a—z) or 10 integers (0-9). You maintain a password for this 
computer system. Let A denote the subset of passwords that 
begin with a vowel (either a, e, i, 0, or u) and let B denote the 

subset of passwords that end with an even number (either 0, 2, 

4, 6, or 8). 

(a) Suppose a hacker selects a password at random. What is 
the probability that your password is selected? 

(b) Suppose a hacker knows your password is in event A and 
selects a password at random from this subset. What is the 
probability that your password is selected? 

(c) Suppose a hacker knows your password is in A and B and 
selects a password at random from this subset. What is the 
probability that your password is selected? 


2-99. If P(A|B) = 1, must A = B? Draw a Venn diagram to 
explain your answer. 


2-100. Suppose A and B are mutually exclusive events. 
Construct a Venn diagram that contains the three events A, B, 
and C such that P(A|C) = 1 and P(B|C) = 0. 


2-101. Consider the endothermic reactions in Exercise 2-50. 
Let A denote the event that a reaction final temperature is 
271 K or less. Let B denote the event that the heat absorbed is 
above target. Determine the following probabilities. 


(a) P(A|B) (b) P(A'|B) 
(c) P(A|B’) (d) P(BIA) 
2-102. Consider the hospital emergency room data in 


Example 2-8. Let A denote the event that a visit is to Hospital 
4 and let B denote the event that a visit results in LWBS (at any 
hospital). Determine the following probabilities. 


(a) P(A|B) (b) P(A’|B) 
(c) P(A|B’) (d) P(B|A) 
2-103. Consider the well failure data in Exercise 2-53. 


(a) What is the probability of a failure given there are more 
than 1000 wells in a geological formation? 

(b) What is the probability of a failure given there are fewer 
than 500 wells in a geological formation? 


2-104. An article in the The Canadian Entomologist 
(Harcourt et al., 1977, Vol. 109, pp. 1521-1534) studied the life 
of the alfalfa weevil from eggs to adulthood. The following 
table shows the number of larvae that survived at each stage of 
development from eggs to adults. 


Early Late Pre- Late 
Eggs Larvae Larvae pupae Pupae Adults 


421 412 306 45 35 31 


(a) What is the probability an egg survives to an adult? 

(b) What is the probability of survival to adult given survival 
to the late larvae stage? 

(c) What stage has the lowest probability of survival to the 
next stage? 


2-5 MULTIPLICATION AND TOTAL PROBABILITY RULES 47 
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The probability of the intersection of two events is often needed. The conditional probability 
definition in Equation 2-9 can be rewritten to provide a formula known as the multiplication 


rule for probabilities. 


Multiplication Rule 


P(A 1 B) = P(B|A)P(A) = P(A|B)P(B) 


(2-10) 


The last expression in Equation 2-10 is obtained by interchanging A and B. 


EXAMPLE 2-26 Machining Stages 
The probability that the first stage of a numerically controlled 
machining operation for high-rpm pistons meets specifications 
is 0.90. Failures are due to metal variations, fixture alignment, 
cutting blade condition, vibration, and ambient environmental 
conditions. Given that the first stage meets specifications, the 
probability that a second stage of machining meets specifica- 
tions is 0.95. What is the probability that both stages meet 
specifications? 

Let A and B denote the events that the first and second 
stages meet specifications, respectively. The probability re- 
quested is 


P(A B) = P(BJA)P(A) = 0.95(0.90) = 0.855 


Although it is also true that P(A M B) = P(A|B)P(B), the in- 
formation provided in the problem does not match this second 
formulation. 

Practical Interpretation: The probability that both stages meet 
specifications is approximately 0.85 and if additional stages 
were needed to complete a piston the probability would de- 
crease further. Consequently, the probability that each stage is 
completed successfully needs to be large in order for a piston 
to meet all specifications. 


Sometimes the probability of an event is given under each of several conditions. With enough 
of these conditional probabilities, the probability of the event can be recovered. For example, sup- 
pose that in semiconductor manufacturing the probability is 0.10 that a chip that is subjected to 
high levels of contamination during manufacturing causes a product failure. The probability is 
0.005 that a chip that is not subjected to high contamination levels during manufacturing causes a 
product failure. In a particular production run, 20% of the chips are subject to high levels of con- 
tamination. What is the probability that a product using one of these chips fails? 

Clearly, the requested probability depends on whether or not the chip was exposed to high 
levels of contamination. For any event B, we can write B as the union of the part of B in A and 


the part of B in A’. That is, 


= (AMB)U(A' NB) 


This result is shown in the Venn diagram in Fig. 2-15. Because A and A’ are mutually exclu- 
sive, AM B and A’ / B are mutually exclusive. Therefore, from the probability of the union 
of mutually exclusive events in Equation 2-6 and the Multiplication Rule in Equation 2-10, the 
following total probability rule is obtained. 


Figure 2-15 Partitioning 
an event into two mutually 
exclusive subsets. 


B=(BOE}) UV (BO Ed) U (BO E3) U (BO Eg) 


Figure 2-16 Partitioning an event into 
several mutually exclusive subsets. 
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Total Probability 


Rule (two events) For any events A and B, 


P(B) = P(BN A) + P(BO A’) = P(B|A)P(A) + P(B\A')P(A4') 


(2-11) 


EXAMPLE 2-27 Semiconductor Contamination 
Consider the contamination discussion at the start of this 
section. The information is summarized here. 


Probability of Level of Probability 
Failure Contamination of Level 
0.1 High 0.2 
0.005 Not High 0.8 


Let F denote the event that the product fails, and let H de- 
note the event that the chip is exposed to high levels of 


contamination. The requested probability is P(F), and the 
information provided can be represented as 


P(F|H)=0.10 and 
P(H)=0.20 and 


P(F|H') = 0.005 
P(H') = 0.80 
From Equation 2-11, 
P(F) = 0.10(0.20) + 0.005(0.80) = 0.024 


which can be interpreted as just the weighted average of the 
two probabilities of failure. 


The reasoning used to develop Equation 2-11 can be applied more generally. Because 


Total Probability 
Rule (multiple 


A UA' = Swe know (A M B) U (A’ M B) equals B, and because AM A’ = hb we knowA M B 
and A’ M B are mutually exclusive. In general, a collection of sets Ej, E>, ..., E, such that 
E,U E,U ... UE, =S is said to be exhaustive. A graphical display of partitioning an 
event B among a collection of mutually exclusive and exhaustive events is shown in Fig. 2-16. 


Assume EF), F5,... , E, are k mutually exclusive and exhaustive sets. Then 


events) 


P(B) = P(BM E;) ar P(B 1M E>) ap Se: gp P(B NM E,) 
= P(B|E,)P(E,) + P(B|E,)P(E2) + +++ + P(BIE,)P(E,) 


(2-12) 


EXAMPLE 2-28 Semiconductor Failures 
Continuing with semiconductor manufacturing, assume the 
following probabilities for product failure subject to levels of 
contamination in manufacturing: 


Probability of Failure Level of Contamination 


H denote the event that a chip is exposed to high levels of 
contamination 


M denote the event that a chip is exposed to medium levels 
of contamination 


L denote the event that a chip is exposed to low levels of 


0.10 High contamination 
0.01 Medium Then, 
0.001 Low 


In a particular production run, 20% of the chips are sub- 
jected to high levels of contamination, 30% to medium levels 
of contamination, and 50% to low levels of contamination. 
What is the probability that a product using one of these chips 
fails? Let 


P(F) = P(F|A)P(H) + P(F|M)P(M) + P(F|L)P(L) 
= 0.10(0.20) + 0.01(0.30) + 0.001(0.50) = 0.0235 


The calculations are conveniently organized with the tree 
diagram in Fig. 2-17. 
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Contamination 
e 


0.20 0.50 
0.30 
High Medium Low 
P(Fail|High) P(Not Fail| High) P(Fail|Medium) P(Not Fail|Medium)  P(Fail|Low) — P(Not Fail|Low) 
= 0.10 =0.90 =0.01 = 0.99 = 0.001 = 0.999 
P Ny \ “ Ss 
0.10(0.20) 0.90(0.20) 0.01(0.30) 0.99(0.30) 0.001(0.50) 0.999(0.50) 
Figure 2-17 Tree = 0.02 =0.18 = 0.003 = 0.297 = 0.0005 = 0.4995 
diagram for 
Example 2-28. P(Fail) = 0.02 + 0.003 + 0.0005 = 0.0235 
EXERCISES FOR SECTION 2-5 
2-105. Suppose that P(A|B)=0.4 and P(B) = 0.5. wires (35%), improper connections (13%), or poorly welded 
Determine the following: wires (52%). 
(a) P(ANM B) (a) Find the probability that a failure is due to loose keys. 
(b) P(A’ MB) (b) Find the probability that a failure is due to improperly 


2-106. Suppose that P(A|B) = 0.2, P(A|B’) = 0.3, and 
P(B) = 0.8. What is P(A)? 

2-107. The probability is 1% that an electrical connector 
that is kept dry fails during the warranty period of a portable 
computer. If the connector is ever wet, the probability of a fail- 
ure during the warranty period is 5%. If 90% of the connectors 
are kept dry and 10% are wet, what proportion of connectors 
fail during the warranty period? 


2-108. Suppose 2% of cotton fabric rolls and 3% of nylon 
fabric rolls contain flaws. Of the rolls used by a manufacturer, 
70% are cotton and 30% are nylon. What is the probability that a 
randomly selected roll used by the manufacturer contains flaws? 


2-109. The edge roughness of slit paper products increases 
as knife blades wear. Only 1% of products slit with new blades 
have rough edges, 3% of products slit with blades of average 
sharpness exhibit roughness, and 5% of products slit with worn 
blades exhibit roughness. If 25% of the blades in manufactur- 
ing are new, 60% are of average sharpness, and 15% are worn, 
what is the proportion of products that exhibit edge roughness? 


2-110. In the 2004 presidential election, exit polls from the 
critical state of Ohio provided the following results: 


total Bush, 2004 Kerry, 2004 
no college degree (62%) 50% 50% 
college graduate (38%) 53% 46% 


What is the probability a randomly selected respondent voted 
for Bush? 


2-111. Computer keyboard failures are due to faulty electrical 
connects (12%) or mechanical defects (88%). Mechanical 
defects are related to loose keys (27%) or improper assembly 
(73%). Electrical connect defects are caused by defective 


connected or poorly welded wires. 


2-112. Heart failures are due to either natural occurrences 

(87%) or outside factors (13%). Outside factors are related to 

induced substances (73%) or foreign objects (27%). Natural 

occurrences are caused by arterial blockage (56%), disease 

(27%), and infection (e.g., staph infection) (17%). 

(a) Determine the probability that a failure is due to induced 
substance. 

(b) Determine the probability that a failure is due to disease 
or infection. 


2-113. A batch of 25 injection-molded parts contains five 

that have suffered excessive shrinkage. 

(a) If two parts are selected at random, and without replace- 
ment, what is the probability that the second part selected 
is one with excessive shrinkage? 

(b) If three parts are selected at random, and without replace- 
ment, what is the probability that the third part selected is 
one with excessive shrinkage? 

2-114. A lot of 100 semiconductor chips contains 20 that 

are defective. 

(a) Two are selected, at random, without replacement, from 
the lot. Determine the probability that the second chip se- 
lected is defective. 

(b) Three are selected, at random, without replacement, 
from the lot. Determine the probability that all are 
defective. 


2-115. An article in the British Medical Journal [“Com- 
parison of Treatment of Renal Calculi by Operative Surgery, 
Percutaneous Nephrolithotomy, and Extracorporeal Shock 
Wave Lithotripsy” (1986, Vol. 82, pp. 879-892)] provided the 
following discussion of success rates in kidney stone re- 
movals. Open surgery had a success rate of 78% (273/350) 
while a newer method, percutaneous nephrolithotomy (PN), 
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had a success rate of 83% (289/350). This newer method 
looked better, but the results changed when stone diameter 
was considered. For stones with diameters less than two cen- 
timeters, 93% (81/87) of cases of open surgery were success- 
ful compared with only 83% (234/270) of cases of PN. For 
stones greater than or equal to two centimeters, the success 
rates were 73% (192/263) and 69% (55/80) for open surgery 
and PN, respectively. Open surgery is better for both stone 
sizes, but less successful in total. In 1951, E. H. Simpson 
pointed out this apparent contradiction (known as Simpson’s 
Paradox) but the hazard still persists today. Explain how open 
surgery can be better for both stone sizes but worse in total. 


2-116. Consider the endothermic reactions in Exercise 2-50. 
Let A denote the event that a reaction final temperature is 
271 K or less. Let B denote the event that the heat absorbed is 
above target. Determine the following probabilities. 

(a) P(ANM B) (b) P(A U B) (c) P(A’ UB’) 

(d) Use the total probability rule to determine P(A) 

2-117. Consider the hospital emergency room data in 
Example 2-8. Let A denote the event that a visit is to Hospital 
4 and let B denote the event that a visit results in LWBS (at any 
hospital). Determine the following probabilities. 

(a) P(ANM B) (b) P(A U B) (c) P(A’ U B’) 

(d) Use the total probability rule to determine P(A) 
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2-118. Consider the hospital emergency room data in Example 

2-8. Suppose that three visits that resulted in LWBS are selected 

randomly (without replacement) for a follow-up interview. 

(a) What is the probability that all three are selected from 
Hospital 2? 

(b) What is the probability that all three are from the same 
hospital? 

2-119. Consider the well failure data in Exercise 2-53. Let A 

denote the event that the geological formation has more than 

1000 wells and let B denote the event that a well failed. 

Determine the following probabilities. 

(a) P(ANM B) (b) P(A UB) (c) P(A’ U B’) 

(d) Use the total probability rule to determine P(A) 

2-120. Consider the well failure data in Exercise 2-53. 

Suppose that two failed wells are selected randomly (without 

replacement) for a follow-up review. 

(a) What is the probability that both are from the gneiss geo- 
logical formation group? 

(b) What is the probability that both are from the same geo- 
logical formation group? 


2-121. A Webadcan be designed from four different colors, 
three font types, five font sizes, three images, and five text 
phrases. A specific design in randomly generated by the Web 
server when you visit the site. Determine the probability that 
the ad color is red and the font size is not the smallest one. 


In some cases, the conditional probability of P(B|A) might equal P(B). In this special case, 
knowledge that the outcome of the experiment is in event A does not affect the probability that 


the outcome is in event B. 


EXAMPLE 2-29 Sampling with Replacement 
Suppose a day’s production of 850 manufactured parts con- 
tains 50 parts that do not meet customer requirements. Sup- 
pose two parts are selected from the batch, but the first part is 
replaced before the second part is selected. What is the proba- 
bility that the second part is defective (denoted as B) given that 
the first part is defective (denoted as A)? The probability 
needed can be expressed as P(B|A). 

Because the first part is replaced prior to selecting the 
second part, the batch still contains 850 parts, of which 50 are 


EXAMPLE 2-30 Flaws and Functions 

The information in Table 2-3 related surface flaws to func- 
tionally defective parts. In that case, we determined that 
P(D|F) = 10/40 = 0.25 and P(D) = 28/400 = 0.07. Sup- 
pose that the situation is different and follows Table 2-4. Then, 
P(D|F) = 2/40 =0.05 = and ~— P(D) = 20/400 = 0.05 


That is, the probability that the part is defective does not de- 
pend on whether it has surface flaws. Also, 


P(F|D) = 2/20=0.10 and —P(F) = 40/400 = 0.10 


defective. Therefore, the probability of B does not depend on 
whether or not the first part was defective. That is, 


P(B|A) = 50/850 


Also, the probability that both parts are defective is 


P(AM B) = P(B|A)P(A) = (2) . (2) = 0.0035 


so the probability of a surface flaw does not depend on 
whether the part is defective. Furthermore, the definition of 
conditional probability implies that 

P(F ND) = P(D|F)P(F) 


but in the special case of this problem, 


P(F ND) = P(D)P(F) = — + = 


Table 2-4 Parts Classified 
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Surface Flaws 


Yes (event F) No Total 
Defective Yes (event D) Dy 18 20 
No 38 342 380 
Total 40 360 400 


The preceding example illustrates the following conclusions. In the special case that 


P(B|A) = P(B), we obtain 


P(A (1 B) = P(B\A)P(A) = P(B)P(A) 


and 


P(A B) 


OD eae 


These conclusions lead to an important definition. 


Independence 
(two events) 


(1) P(A|B) = P(A) 
(2) P(BIA) = P(B) 


Two events are independent if any one of the following equivalent statements is true: 


(3) P(A MB) = P(A)P(B) 


(2-13) 


It is left as a mind-expanding exercise to show that independence implies related results 


such as 


P(A’ 1 B’) = P(A')P(B’) 


The concept of independence is an important relationship between events and is used 


throughout this text. A mutually exclusive relationship between two events is based only on 
the outcomes that comprise the events. However, an independence relationship depends on the 
probability model used for the random experiment. Often, independence is assumed to be part 
of the random experiment that describes the physical system under study. 


EXAMPLE 2-31 

A day’s production of 850 manufactured parts contains 50 
parts that do not meet customer requirements. Two parts are 
selected at random, without replacement, from the batch. Let 
A denote the event that the first part is defective, and let B de- 
note the event that the second part is defective. 

We suspect that these two events are not independent be- 
cause knowledge that the first part is defective suggests that it 
is less likely that the second part selected is defective. Indeed, 
P(B|A) = 49/849. Now, what is P(B)? Finding the uncondi- 
tional P(B) is somewhat difficult because the possible values 
of the first selection need to be considered: 


P(B) = P(B|A)P(A) + P(B|A')P(A’) 
= (49/849)(50/850) + (50/849)(800/850) 
= 50/850 


Interestingly, P(B), the unconditional probability that the 
second part selected is defective, without any knowledge of 
the first part, is the same as the probability that the first part 
selected is defective. Yet, our goal is to assess independence. 
Because P(B|A) does not equal P(B), the two events are not 
independent, as we suspected. 
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When considering three or more events, we can extend the definition of independence 


with the following general result. 


Independence 


(multiple events) The events Ey, FE), .. 


events E,, E; ne 


ip in? >** in? 


.,£, are independent if and only if for any subset of these 


PE, OE ee W EL) — P(E, 


(2-14) 


This definition is typically used to calculate the probability that several events occur assuming 
that they are independent and the individual event probabilities are known. The knowledge 
that the events are independent usually comes from a fundamental understanding of the 


random experiment. 


EXAMPLE 2-32 Series Circuit 

The following circuit operates only if there is a path of func- 
tional devices from left to right. The probability that each de- 
vice functions is shown on the graph. Assume that devices fail 
independently. What is the probability that the circuit operates? 


0.8 0.2) | _ 


Let L and R denote the events that the left and right devices 
operate, respectively. There is only a path if both operate. The 


EXAMPLE 2-33 
Assume that the probability that a wafer contains a large par- 
ticle of contamination is 0.01 and that the wafers are inde- 
pendent; that is, the probability that a wafer contains a large 
particle is not dependent on the characteristics of any of the 
other wafers. If 15 wafers are analyzed, what is the probability 
that no large particles are found? 

Let E; denote the event that the ith wafer contains no large 
particles, i = 1, 2,..., 15. Then, P(E;) = 0.99. The probability 


EXAMPLE 2-34 Parallel Circuit 

The following circuit operates only if there is a path of func- 
tional devices from left to right. The probability that each de- 
vice functions is shown on the graph. Assume that devices fail 
independently. What is the probability that the circuit operates? 


0.95 


0.95 


Let T and B denote the events that the top and bottom de- 
vices operate, respectively. There is a path if at least one device 
operates. The probability that the circuit operates is 


probability the circuit operates is 
P(Land R) = P(LM R) = P(L)P(R) = 0.80(0.90) = 0.72 


Practical Interpretation: Notice that the probability that the 
circuit operates degrades to approximately 0.5 when all devices 
are required to be functional. The probability each device is func- 
tional needs to be large for a circuit to operate when many devices 
are connected in series. 


requested can be represented as P(E, ME, M +++ M Ej5). From 
the independence assumption and Equation 2-14, 


x P(E\5) = 0.995 = 0.86 


P(T or B) = 1 — P[(Tor B)'] = 1 — P(T' and B’) 


A simple formula for the solution can be derived from the 
complements 7’ and B’. From the independence assumption, 


P(T' and B') = P(T’)P(B’) = (1 — 0.95) = 0.057 
sO 
P(T or B) = 1 — 0.057 = 0.9975 


Practical Interpretation: Notice that the probability that the cir- 
cuit operates is larger than the probability that either device is 
functional. This is an advantage of a parallel architecture. A dis- 
advantage is that multiple devices are needed. 


EXAMPLE 2-35 Advanced Circuit 

The following circuit operates only if there is a path of func- 
tional devices from left to right. The probability that each de- 
vice functions is shown on the graph. Assume that devices 
fail independently. What is the probability that the circuit 
operates? 


0.9 
O195 
a 0.9 0.99 b 
0.95 
0.9 


The solution can be obtained from a partition of the 
graph into three columns. Let L denote the event that there 
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is a path of functional devices only through the three units 
on the left. From the independence and based upon the pre- 
vious example, 


P(L) =1-0.17 


Similarly, let M denote the event that there is a path of functional 
devices only through the two units in the middle. Then, 


P(M) = 1 — 0.05? 
The probability that there is a path of functional devices only 
through the one unit on the right is simply the probability that 


the device functions, namely, 0.99. Therefore, with the inde- 
pendence assumption used again, the solution is 


(1 — 0.13)(1 — 0.057)(0.99) = 0.987 


2-122. If P(A|B) = 0.4, P(B) = 0.8, and P(A) = 0.5, are 
the events A and B independent? 


2-123. If P(A|B) = 0.3, P(B) = 0.8, and P(A) = 0.3, are 
the events B and the complement of A independent? 


2-124. If P(A) = 0.2, P(B) = 0.2, and A and B are mutu- 
ally exclusive, are they independent? 


2-125. <A batch of 500 containers for frozen orange juice 
contains five that are defective. Two are selected, at random, 
without replacement, from the batch. Let A and B denote the 
events that the first and second containers selected are defec- 
tive, respectively. 

(a) Are A and B independent events? 

(b) If the sampling were done with replacement, would A and 

B be independent? 


2-126. Disks of polycarbonate plastic from a supplier are 
analyzed for scratch and shock resistance. The results from 
100 disks are summarized as follows: 


shock resistance 


high low 
scratch high 70 9 
resistance low 16 5 


Let A denote the event that a disk has high shock resistance, 
and let B denote the event that a disk has high scratch resis- 
tance. Are events A and B independent? 


2-127. Samples of emissions from three suppliers are clas- 
sified for conformance to air-quality specifications. The re- 
sults from 100 samples are summarized as follows: 


conforms 
yes no 
1 22 
supplier 2 25 
3 30 10 


Let A denote the event that a sample is from supplier 1, and let 
B denote the event that a sample conforms to specifications. 
(a) Are events A and B independent? 

(b) Determine P(B|A). 


2-128. Redundant Array of Inexpensive Disks (RAID) is a 
technology that uses multiple hard drives to increase the speed 
of data transfer and provide instant data backup. Suppose that 
the probability of any hard drive failing in a day is 0.001 and 
the drive failures are independent. 

(a) A RAID 0 scheme uses two hard drives, each containing a 
mirror image of the other. What is the probability of data 
loss? Assume that data loss occurs if both drives fail 
within the same day. 

(b) A RAID 1 scheme splits the data over two hard drives. 
What is the probability of data loss? Assume that data loss 
occurs if at least one drive fails within the same day. 
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2-129. The probability that a lab specimen contains high 

levels of contamination is 0.10. Five samples are checked, and 

the samples are independent. 

(a) What is the probability that none contains high levels of 
contamination? 

(b) What is the probability that exactly one contains high 
levels of contamination? 

(c) What is the probability that at least one contains high 
levels of contamination? 


2-130. Ina test of a printed circuit board using a random 

test pattern, an array of 10 bits is equally likely to be 0 or 1. 

Assume the bits are independent. 

(a) What is the probability that all bits are 1s? 

(b) What is the probability that all bits are 0s? 

(c) What is the probability that exactly five bits are 1s and five 
bits are Os? 


2-131. Six tissues are extracted from an ivy plant infested 

by spider mites. The plant in infested in 20% of its area. Each 

tissue is chosen from a randomly selected area on the ivy 

plant. 

(a) What is the probability that four successive samples show 
the signs of infestation? 

(b) What is the probability that three out of four successive 
samples show the signs of infestation? 


2-132. A player of a video game is confronted with a series of 

four opponents and an 80% probability of defeating each oppo- 

nent. Assume that the results from opponents are independent 

(and that when the player is defeated by an opponent the game 

ends). 

(a) What is the probability that a player defeats all four oppo- 
nents in a game? 

(b) What is the probability that a player defeats at least two 
opponents in a game? 

(c) If the game is played three times, what is the probability 
that the player defeats all four opponents at least once? 


2-133. In an acid-base titration, a base or acid is gradually 
added to the other until they have completely neutralized each 
other. Since acids and bases are usually colorless (as are the 
water and salt produced in the neutralization reaction), pH is 
measured to monitor the reaction. Suppose that the equiva- 
lence point is reached after approximately 100 mL of a NaOH 
solution has been added (enough to react with all the acetic 
acid present) but that replicates are equally likely to indicate 
from 95 to 104 mL, measured to the nearest mL. Assume that 
two technicians each conduct titrations independently. 
(a) What is the probability that both technicians obtain equiv- 
alence at 100 mL? 
(b) What is the probability that both technicians obtain equiv- 
alence between 98 and 104 mL (inclusive)? 
(c) What is the probability that the average volume at equiva- 
lence from the technicians is 100 mL? 


2-134. A credit card contains 16 digits. It also contains a 
month and year of expiration. Suppose there are one million 


credit card holders with unique card numbers. A hacker ran- 

domly selects a 16-digit credit card number. 

(a) What is the probability that it belongs to a user? 

(b) Suppose a hacker has a 25% chance of correctly guessing 
the year your card expires and randomly selects one of 
the 12 months. What is the probability that the hacker 
correctly selects the month and year of expiration? 


2-135. Eight cavities in an injection-molding tool produce 

plastic connectors that fall into a common stream. A sample is 

chosen every several minutes. Assume that the samples are 

independent. 

(a) What is the probability that five successive samples were 
all produced in cavity one of the mold? 

(b) What is the probability that five successive samples were 
all produced in the same cavity of the mold? 

(c) What is the probability that four out of five successive 
samples were produced in cavity one of the mold? 


2-136. The following circuit operates if and only if there is 
a path of functional devices from left to right. The probability 
that each device functions is as shown. Assume that the prob- 
ability that a device is functional does not depend on whether 
or not other devices are functional. What is the probability that 
the circuit operates? 


0.9 || 0.8 
0.95 i 0.25; 


2-137. The following circuit operates if and only if there is 
a path of functional devices from left to right. The probability 
each device functions is as shown. Assume that the probabil- 
ity that a device functions does not depend on whether or not 
other devices are functional. What is the probability that the 
circuit operates? 


(O).7/ 


0.95 


0.9 0.9 0.8 


0.95 0.95 0.9 


2-138. Consider the endothermic reactions in Exercise 2-50. 
Let A denote the event that a reaction final temperature is 271 K 
or less. Let B denote the event that the heat absorbed is above 
target. Are these events independent? 

2-139. Consider the hospital emergency room data in 
Example 2-8. Let A denote the event that a visit is to Hospital 4 
and let B denote the event that a visit results in LWBS (at any 
hospital). Are these events independent? 


2-140. Consider the well failure data in Exercise 2-53. Let A 
denote the event that the geological formation has more than 
1000 wells and let B denote the event that a well failed. Are 
these events independent? 

2-141. A Web ad can be designed from four different colors, 
three font types, five font sizes, three images, and five text 
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phrases. A specific design is randomly generated by the Web 
server when you visit the site. Let A denote the event that the 
design color is red and Let B denote the event that the font size 
is not the smallest one. Are A and B independent events? 
Explain why or why not. 


The examples in this chapter indicate that information is often presented in terms of con- 
ditional probabilities. These conditional probabilities commonly provide the probability of 
an event (such as failure) given a condition (such as high or low contamination). But after 
a random experiment generates an outcome, we are naturally interested in the probability 
that a condition was present (high contamination) given an outcome (a semiconductor fail- 
ure). Thomas Bayes addressed this essential question in the 1700s and developed the fun- 
damental result known as Bayes’ theorem. Do not let the simplicity of the mathematics 
conceal the importance. There is extensive interest in such probabilities in modern statistical 


analysis. 


From the definition of conditional probability, 


P(A 1 B) = P(A|B)P(B) = P(BM A) = P(B|A)P(A) 


Now, considering the second and last terms in the expression above, we can write 


ala) = PIAA) 


for P(B)>0 (2-15) 


P(B) 


This is a useful result that enables us to solve for P(A|B) in terms of P(B| A). 


EXAMPLE 2-36 

Reconsider Example 2-27. The conditional probability a high 
level of contamination was present when a failure occurred is 
to be determined. The information from Example 2-27 is sum- 
marized here. 


Probability of Level of Probability of 
Failure Contamination Level 
0.1 High 0.2 
0.005 Not High 0.8 


The probability of P(H| F) is determined from 


_ P(F|H)P(H) — 0.10(0.20) _ 


ree) P(F) 0.024 


0.83 


The value of P(F) in the denominator of our solution was 
found from P(F) = P(F|H)P(H) + P(F|H')P(H’). 


In general, if P(B) in the denominator of Equation 2-15 is written using the Total 
Probability Rule in Equation 2-12, we obtain the following general result, which is known as 


Bayes’ Theorem. 
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Bayes’ Theorem 
If Ej, Eo, .. 


., E; are k mutually exclusive and exhaustive events and B is any event, 


P(B|E\)P(E)) 


P(E, |B) = P 


(B|E,)P(E,) + P(B|E>)P(Ex) + = 


+ P(BIE)P(Ep Cy, 


for P(B) > 0 


Notice that the numerator always equals one of the terms in the sum in the denominator. 


EXAMPLE 2-37 Medical Diagnostic 

Because a new medical procedure has been shown to be effec- 
tive in the early detection of an illness, a medical screening of 
the population is proposed. The probability that the test cor- 
rectly identifies someone with the illness as positive is 0.99, and 
the probability that the test correctly identifies someone without 
the illness as negative is 0.95. The incidence of the illness in the 
general population is 0.0001. You take the test, and the result is 
positive. What is the probability that you have the illness? 

Let D denote the event that you have the illness, and let $ 
denote the event that the test signals positive. The probability 
requested can be denoted as P(D|S). The probability that the 
test correctly signals someone without the illness as negative 
is 0.95. Consequently, the probability of a positive test without 
the illness is 


P(S|D') = 0.05 


EXAMPLE 2-38 Bayesian Network 

Bayesian networks are used on the Web sites of high- 
technology manufacturers to allow customers to quickly diag- 
nose problems with products. An oversimplified example is 
presented here. A printer manufacturer obtained the following 
probabilities from a database of test results. Printer failures are 
associated with three types of problems: hardware, software, 
and other (such as connectors), with probabilities 0.1, 0.6, and 
0.3, respectively. The probability of a printer failure given a 
hardware problem is 0.9, given a software problem is 0.2, and 
given any other type of problem is 0.5. If a customer enters the 
manufacturer’s Web site to diagnose a printer failure, what is 
the most likely cause of the problem? 

Let the events H, S, and O denote a hardware, software, 
or other problem, respectively, and let F denote a printer fail- 
ure. The most likely cause of the problem is the one that cor- 
responds to the largest of P(H|F), P(S|F), and P(O|F). In 
Bayes’ Theorem the denominator is 
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From Bayes’ Theorem, 


P(D|S) = P(S|D)P(D)/[P(S|D)P(D) + P(S|D')P(D')] 
0.99(0.0001)/[0.99(0.0001) + 0.05(1 — 0.0001)] 
= 1/506 = 0.002 


Practical Interpretation: The probability of you having the 
illness given a positive result from the test is only 0.002. 
Surprisingly, even though the test is effective, in the sense that 
P(S|D) is high and P(S|D') is low, because the incidence of 
the illness in the general population is low, the chances are 
quite small that you actually have the disease even if the test is 
positive. 


P(F) = P(F|H)P(H) + P(F|S) P(S) + P(F|O)P(O) 
= 0.9(0.1) + 0.2(0.6) + 0.5(0.3) = 0.36 


Then, 


P(H|F) = P(F|H)P(H)/P(F) = 0.9(0.1)/0.36 = 0.250 
P(S|F) = P(F|S)P(S)/P(F) = 0.2(0.6)/0.36 = 0.333 
P(O|F) = P(F|O)P(O)/P(F) = 0.5(0.3)/0.36 = 0.417 


Notice that P(H|F) + P(S|F) + P(O|F) = 1 because 
one of the three types of problems is responsible for the failure. 
Because P(O|F ) is largest, the most likely cause of the problem 
is in the other category. A Web site dialog to diagnose the prob- 
lem quickly should start with a check into that type of problem. 

Practical Interpretation: Such networks are more com- 
monly used to diagnose problems in areas as diverse as 
electronic products and healthcare. 


2-142. Suppose that P(A|B) = 0.7, P(A) = 0.5, and 
P(B) = 0.2. Determine P(B|A). 
2-143. Suppose that P(A|B) = 0.4, P(A|B') = 0.2, and 
P(B) = 0.8. Determine P(B|A). 


2-144. Software to detect fraud in consumer phone cards 
tracks the number of metropolitan areas where calls originate 
each day. It is found that 1% of the legitimate users originate 
calls from two or more metropolitan areas in a single day. 
However, 30% of fraudulent users originate calls from two or 


more metropolitan areas in a single day. The proportion of 
fraudulent users is 0.01%. If the same user originates calls 
from two or more metropolitan areas in a single day, what is 
the probability that the user is fraudulent? 


2-145. Anew process of more accurately detecting anaer- 
obic respiration in cells is being tested. The new process is 
important due to its high accuracy, its lack of extensive ex- 
perimentation, and the fact that it could be used to identify 
five different categories of organisms: obligate anaerobes, 
facultative anaerobes, aerotolerant, microaerophiles, and 
nanaerobes instead of using a single test for each category. 
The process claims that it can identify obligate anaerobes 
with 97.8% accuracy, facultative anaerobes with 98.1% ac- 
curacy, aerotolerant with 95.9% accuracy, microaerophiles 
with 96.5% accuracy, and nanaerobes with 99.2% accuracy. 
If any category is not present, the process does not signal. 
Samples are prepared for the calibration of the process 
and 31% of them contain obligate anaerobes, 27% contain 
facultative anaerobes, 21% contain microaerophiles, 13% con- 
tain nanaerobes, and 8% contain aerotolerant. A test sample is 
selected randomly. 

(a) What is the probability that the process will signal? 

(b) If the test signals, what is the probability that mi- 

croaerophiles are present? 


2-146. In the 2004 presidential election, exit polls from 
the critical state of Ohio provided the following results: 


Bush Kerry 
no college degree (62%) 50% 50% 
college graduate (38%) 53% 46% 


If a randomly selected respondent voted for Bush, what is 
the probability that the person has a college degree? 


2-147. Customers are used to evaluate preliminary product 
designs. In the past, 95% of highly successful products 
received good reviews, 60% of moderately successful products 
received good reviews, and 10% of poor products received 
good reviews. In addition, 40% of products have been highly 
successful, 35% have been moderately successful, and 25% 
have been poor products. 

(a) What is the probability that a product attains a good review? 
(b) If a new design attains a good review, what is the proba- 

bility that it will be a highly successful product? 
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(c) If a product does not attain a good review, what is the 
probability that it will be a highly successful product? 


2-148. An inspector working for a manufacturing company 

has a 99% chance of correctly identifying defective items and 

a 0.5% chance of incorrectly classifying a good item as defec- 

tive. The company has evidence that its line produces 0.9% of 

nonconforming items. 

(a) What is the probability that an item selected for inspection 
is classified as defective? 

(b) If an item selected at random is classified as nondefective, 
what is the probability that it is indeed good? 


2-149. A new analytical method to detect pollutants in wa- 
ter is being tested. This new method of chemical analysis is 
important because, if adopted, it could be used to detect three 
different contaminants—organic pollutants, volatile solvents, 
and chlorinated compounds—instead of having to use a single 
test for each pollutant. The makers of the test claim that it can 
detect high levels of organic pollutants with 99.7% accuracy, 
volatile solvents with 99.95% accuracy, and chlorinated com- 
pounds with 89.7% accuracy. If a pollutant is not present, the 
test does not signal. Samples are prepared for the calibration 
of the test and 60% of them are contaminated with organic 
pollutants, 27% with volatile solvents, and 13% with traces of 
chlorinated compounds. A test sample is selected randomly. 
(a) What is the probability that the test will signal? 

(b) If the test signals, what is the probability that chlori- 

nated compounds are present? 


2-150. Consider the endothermic reactions in Exercise 2-50. 
Use Bayes’ Theorem to calculate the probability that a reaction 
final temperature is 271 K or less given that the heat absorbed 
is above target. 


2-151. Consider the hospital emergency room data in 
Example 2-8. Use Bayes’ Theorem to calculate the probability 
that a person visits Hospital 4 given they are LWBS. 


2-152. Consider the well failure data in Exercise 2-53. Use 
Bayes’ Theorem to calculate the probability that a randomly 
selected well is in the gneiss group given that the well is failed. 


2-153. Two Web colors are used for a site advertisement. If a 
site visitor arrives from an affiliate, the probabilities of the blue or 
green colors are 0.8 and 0.2, respectively. If the site visitor arrives 
from a search site, the blue and green colors are 0.4 and 0.6, re- 
spectively. The proportions of visitors from affiliates and search 
sites are 0.3 and 0.7, respectively. What is the probability that a 
visitor is from a search site given that the blue ad was viewed? 


We often summarize the outcome from a random experiment by a simple number. In many of 
the examples of random experiments that we have considered, the sample space has been a 
description of possible outcomes. In some cases, descriptions of outcomes are sufficient, but 
in other cases, it is useful to associate a number with each outcome in the sample space. 
Because the particular outcome of the experiment is not known in advance, the resulting 
value of our variable is not known in advance. For this reason, the variable that associates a 
number with the outcome of a random experiment is referred to as a random variable. 
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Random 
Variable 


Notation 


Discrete and 
Continuous 
Random 
Variables 


Examples 
of Random 
Variables 


A random variable is a function that assigns a real number to each outcome in the 
sample space of a random experiment. 


Notation is used to distinguish between a random variable and the real number. 


A random variable is denoted by an uppercase letter such as X. After an experiment 
is conducted, the measured value of the random variable is denoted by a lowercase 
letter such as x = 70 milliamperes. 


Sometimes a measurement (such as current in a copper wire or length of a machined part) 
can assume any value in an interval of real numbers (at least theoretically). Then arbitrary pre- 
cision in the measurement is possible. Of course, in practice, we might round off to the nearest 
tenth or hundredth of a unit. The random variable that represents this measurement is said to 
be a continuous random variable. The range of the random variable includes all values in an 
interval of real numbers; that is, the range can be thought of as a continuum. 

In other experiments, we might record a count such as the number of transmitted bits that 
are received in error. Then the measurement is limited to integers. Or we might record that a 
proportion such as 0.0042 of the 10,000 transmitted bits were received in error. Then the 
measurement is fractional, but it is still limited to discrete points on the real line. Whenever 
the measurement is limited to discrete points on the real line, the random variable is said to be 
a discrete random variable. 


A discrete random variable is a random variable with a finite (or countably infinite) 
range. 


A continuous random variable is a random variable with an interval (either finite or 
infinite) of real numbers for its range. 


In some cases, the random variable X is actually discrete but, because the range of possible 
values is so large, it might be more convenient to analyze X as a continuous random variable. For 
example, suppose that current measurements are read from a digital instrument that displays the 
current to the nearest one-hundredth of a milliampere. Because the possible measurements are 
limited, the random variable is discrete. However, it might be a more convenient, simple approx- 
imation to assume that the current measurements are values of a continuous random variable. 


Examples of continuous random variables: 
electrical current, length, pressure, temperature, time, voltage, weight 


Examples of discrete random variables: 
number of scratches on a surface, proportion of defective parts among 1000 
tested, number of transmitted bits received in error 


EXERCISES FOR SECTION 2-8 
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2-154. Decide whether a discrete or continuous random 
variable is the best model for each of the following 
variables: 

(a) The time until a projectile returns to earth. 


(b) The number of times a transistor in a computer memory 
changes state in one operation. 

(c) The volume of gasoline that is lost to evaporation during 
the filling of a gas tank. 


(d) The outside diameter of a machined shaft. 

(e) The number of cracks exceeding one-half inch in 10 miles 
of an interstate highway. 

(f) The weight of an injection-molded plastic part. 

(g) The number of molecules in a sample of gas. 

(h) The concentration of output from a reactor. 

(i) The current in an electronic circuit. 


Supplemental Exercises 


2-155. Samples of laboratory glass are in small, light pack- 
aging or heavy, large packaging. Suppose that 2 and 1% of the 
sample shipped in small and large packages, respectively, 
break during transit. If 60% of the samples are shipped in large 
packages and 40% are shipped in small packages, what pro- 
portion of samples break during shipment? 

2-156. A sample of three calculators is selected from a 
manufacturing line, and each calculator is classified as either 
defective or acceptable. Let A, B, and C denote the events 
that the first, second, and third calculators, respectively, are 
defective. 

(a) Describe the sample space for this experiment with a tree 

diagram. 
Use the tree diagram to describe each of the following events: 


(b) A (c) B 
(d) ANB (e) BUC 
2-157. Samples of a cast aluminum part are classified on the 


basis of surface finish (in microinches) and edge finish. The 
results of 100 parts are summarized as follows: 


edge finish 
excellent good 
surface excellent 80 Z 
finish good 10 8 


Let A denote the event that a sample has excellent surface fin- 
ish, and let B denote the event that a sample has excellent edge 
finish. If a part is selected at random, determine the following 
probabilities: 


(a) P(A) (b) P(B) 
(c) P(A’) (d) P(ANB) 
(ec) (AUB) (f) P(A’ UB) 


2-158. Shafts are classified in terms of the machine tool that 
was used for manufacturing the shaft and conformance to surface 
finish and roundness. 


2-159. IfA, B, and C are mutually exclusive events, is it possi- 
ble for P(A) = 0.3, P(B) = 0.4, and P(C) = 0.5? Why or why not? 


Tool | roundness conforms 


yes no 
surface finish yes 200 1 
conforms no 4 2 
Tool 2 roundness conforms 
yes no 
surface finish yes 145 4 
conforms no 8 6 


(a) If a shaft is selected at random, what is the probability that 
the shaft conforms to surface finish requirements or to 
roundness requirements or is from Tool 1? 

(b) If a shaft is selected at random, what is the probability that 
the shaft conforms to surface finish requirements or does 
not conform to roundness requirements or is from Tool 2? 

(c) If a shaft is selected at random, what is the probability that 
the shaft conforms to both surface finish and roundness 
requirements or the shaft is from Tool 2? 

(d) If a shaft is selected at random, what is the probability that 
the shaft conforms to surface finish requirements or the 
shaft is from Tool 2? 


2-160. The analysis of shafts for a compressor is summa- 
rized by conformance to specifications: 


roundness conforms 


yes no 
surface finish yes 345 3 
conforms no 12 8 


(a) If we know that a shaft conforms to roundness require- 
ments, what is the probability that it conforms to surface 
finish requirements? 

(b) If we know that a shaft does not conform to roundness 
requirements, what is the probability that it conforms to 
surface finish requirements? 


2-161. A researcher receives 100 containers of oxygen. Of 

those containers, have oxygen that is not ionized, and the rest 

are ionized. Two samples are randomly selected, without re- 

placement, from the lot. 

(a) What is the probability that the first one selected is not 
ionized? 

(b) What is the probability that the second one selected is not 
ionized given that the first one was ionized? 

(c) What is the probability that both are ionized? 

(d) How does the answer in part (b) change if samples se- 
lected were replaced prior to the next selection? 
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2-162. A lot contains 15 castings from a local supplier and 
25 castings from a supplier in the next state. Two castings are 
selected randomly, without replacement, from the lot of 40. 
Let A be the event that the first casting selected is from the 
local supplier, and let B denote the event that the second cast- 
ing is selected from the local supplier. Determine: 

(a) P(A) (b) P(B|A) 

(c) PAMB) (d) P(AUB) 


Suppose three castings are selected at random, without re- 
placement, from the lot of 40. In addition to the definitions of 
events A and B, let C denote the event that the third casting 
selected is from the local supplier. Determine: 

(e) PANBMC) 

(f) PANBNC') 


2-163. Inthe manufacturing of a chemical adhesive, 3% of 
all batches have raw materials from two different lots. This 
occurs when holding tanks are replenished and the remaining 
portion of a lot is insufficient to fill the tanks. 

Only 5% of batches with material from a single lot require 
reprocessing. However, the viscosity of batches consisting of 
two or more lots of material is more difficult to control, and 
40% of such batches require additional processing to achieve 
the required viscosity. 

Let A denote the event that a batch is formed from two dif- 
ferent lots, and let B denote the event that a lot requires 
additional processing. Determine the following probabilities: 


(a) P(A) (b) P(A’) 

(c) P(BIA) — (d) P(BIA’) 

(ec) PPAMB) (f) P(AMB’') 

(g) P(B) 

2-164. Incoming calls to a customer service center are classi- 


fied as complaints (75% of calls) or requests for information 

(25% of calls). Of the complaints, 40% deal with computer 

equipment that does not respond and 57% deal with incomplete 

software installation; in the remaining 3% of complaints the 
user has improperly followed the installation instructions. The 
requests for information are evenly divided on technical ques- 

tions (50%) and requests to purchase more products (50%). 

(a) What is the probability that an incoming call to the cus- 
tomer service center will be from a customer who has not 
followed installation instructions properly? 

(b) Find the probability that an incoming call is a request for 
purchasing more products. 


2-165. A congested computer network has a 0.002 proba- 

bility of losing a data packet and packet losses are independent 

events. A lost packet must be resent. 

(a) What is the probability that an e-mail message with 100 
packets will need any to be resent? 

(b) What is the probability that an e-mail message with 3 
packets will need exactly one to be resent? 

(c) If 10 e-mail messages are sent, each with 100 packets, 
what is the probability that at least one message will need 
some packets to be resent? 


2-166. Samples of a cast aluminum part are classified on 
the basis of surface finish (in microinches) and length mea- 
surements. The results of 100 parts are summarized as follows: 


length 
excellent good 
surface excellent 80 2 
finish good 10 8 


Let A denote the event that a sample has excellent surface fin- 
ish, and let B denote the event that a sample has excellent 
length. Are events A and B independent? 


2-167. An optical storage device uses an error recovery pro- 
cedure that requires an immediate satisfactory readback of any 
written data. If the readback is not successful after three writing 
operations, that sector of the disk is eliminated as unacceptable 
for data storage. On an acceptable portion of the disk, the proba- 
bility of a satisfactory readback is 0.98. Assume the readbacks 
are independent. What is the probability that an acceptable por- 
tion of the disk is eliminated as unacceptable for data storage? 


2-168. Semiconductor lasers used in optical storage prod- 
ucts require higher power levels for write operations than for 
read operations. High-power-level operations lower the useful 
life of the laser. 

Lasers in products used for backup of higher-speed mag- 
netic disks primarily write, and the probability that the useful 
life exceeds five years is 0.95. Lasers that are in products that 
are used for main storage spend approximately an equal 
amount of time reading and writing, and the probability that 
the useful life exceeds five years is 0.995. Now, 25% of the 
products from a manufacturer are used for backup and 75% of 
the products are used for main storage. 

Let A denote the event that a laser’s useful life exceeds five 
years, and let B denote the event that a laser is in a product that 
is used for backup. 

Use a tree diagram to determine the following: 


(a) P(B) (b) P(A|B) 
(c) P(A|B’) (d) P(AMB) 
(ce) PAMB') (f) P(A) 


(g) What is the probability that the useful life of a laser 
exceeds five years? 

(h) What is the probability that a laser that failed before five 
years came from a product used for backup? 


2-169. Energy released from three cells breaks the molecular 
bond and converts ATP (adenosine triphosphate) into ADP 
(adenosine diphosphate). Storage of ATP in muscle cells (even 
for an athlete) can only sustain maximal muscle power for less 
than five seconds (a short dash). Three systems are used to re- 
plenish ATP: phosphagen system; glycogen-lactic acid system 
(anaerobic); and aerobic respiration, but the first is only useful 
for less than 10 seconds, and even the second system provides 
less than two minutes of ATP. An endurance athlete needs to 
perform below the anaerobic threshold to sustain energy for 


extended periods. A sample of 100 individuals is described by 
the energy system used in exercise at different intensity levels. 


primarily aerobic 


period yes no 
1 50 7 
2 13 30 


Let A denote the event that an individual is in period 2, and 

let B denote the event that the energy is primarily aerobic. 

Determine the number of individuals in 

(a) A' NB (b)B’ (c) AUB 

2-170. A sample preparation for a chemical measurement 

is completed correctly by 25% of the lab technicians, com- 

pleted with a minor error by 70%, and completed with a major 

error by 5%. 

(a) If a technician is selected randomly to complete the prepa- 
ration, what is the probability it is completed without error? 

(b) What is the probability that it is completed with either a 
minor or a major error? 


2-171. In circuit testing of printed circuit boards, each 
board either fails or does not fail the test. A board that fails the 
test is then checked further to determine which one of five de- 
fect types is the primary failure mode. Represent the sample 
space for this experiment. 


2-172. The data from 200 machined parts are summarized 
as follows: 
depth of bore 
above below 
edge condition target target 
coarse 15 10 
moderate 25 20 
smooth 50 80 


(a) What is the probability that a part selected has a moderate 
edge condition and a below-target bore depth? 

(b) What is the probability that a part selected has a moderate 
edge condition or a below-target bore depth? 

(c) What is the probability that a part selected does not have a 
moderate edge condition or does not have a below-target 
bore depth? 

2-173. Computers in a shipment of 100 units contain a 

portable hard drive, DVD drive, or both, according to the 

following table: 


portable hard drive 


yes no 
DVD 
yes 15 80 
no 4 1 


Let A denote the event that a computer has a portable hard 
drive and let B denote the event that a computer has a DVD 
drive. If one computer is selected randomly, compute 
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(a) P(A) (b) PAN) 

(d) P(A’ MB) (e) P(A|B) 

2-174. The probability that a customer’s order is not 

shipped on time is 0.05. A particular customer places three 

orders, and the orders are placed far enough apart in time that 

they can be considered to be independent events. 

(a) What is the probability that all are shipped on time? 

(b) What is the probability that exactly one is not shipped on 
time? 

(c) What is the probability that two or more orders are not 
shipped on time? 

2-175. Let E,, E,, and E; denote the samples that conform 

to a percentage of solids specification, a molecular weight 

specification, and a color specification, respectively. A total of 

240 samples are classified by the E,, E,, and E; specifications, 

where yes indicates that the sample conforms. 


(c) P(A UB) 


E3 yes 
E, 
yes no Total 
E, yes 200 1 201 
no 3 4 9 
Total 205 5) 210 
E;no 
Ey 
yes no 
E, yes 20 4 24 
no 6 0 6 
Total 26 4 30 


(a) Are E,, E>, and E; mutually exclusive events? 

(b) Are E’,, E’,, and E’; mutually exclusive events? 

(c) What is P(E’; or E’, or E’;)? 

(d) What is the probability that a sample conforms to all three 
specifications? 

(e) What is the probability that a sample conforms to the FE, or 
E; specification? 

(f) What is the probability that a sample conforms to the E, or 
E, or E; specification? 

2-176. Transactions to a computer database are either new 

items or changes to previous items. The addition of an item 

can be completed in less than 100 milliseconds 90% of the 

time, but only 20% of changes to a previous item can be com- 

pleted in less than this time. If 30% of transactions are changes, 

what is the probability that a transaction can be completed in 

less than 100 milliseconds? 

2-177. Asteel plate contains 20 bolts. Assume that five bolts 

are not torqued to the proper limit. Four bolts are selected at 

random, without replacement, to be checked for torque. 

(a) What is the probability that all four of the selected bolts 
are torqued to the proper limit? 

(b) What is the probability that at least one of the selected 
bolts is not torqued to the proper limit? 
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2-178. The following circuit operates if and only if there is 
a path of functional devices from left to right. Assume devices 
fail independently and that the probability of failure of each 
device is as shown. What is the probability that the circuit 
operates? 


2-179. The probability of getting through by telephone to 
buy concert tickets is 0.92. For the same event, the probability 
of accessing the vendor’s Web site is 0.95. Assume that these 
two ways to buy tickets are independent. What is the proba- 
bility that someone who tries to buy tickets through the 
Internet and by phone will obtain tickets? 


2-180. The British government has stepped up its informa- 
tion campaign regarding foot-and-mouth disease by mailing 
brochures to farmers around the country. It is estimated that 
99% of Scottish farmers who receive the brochure possess 
enough information to deal with an outbreak of the disease, 
but only 90% of those without the brochure can deal with an 
outbreak. After the first three months of mailing, 95% of the 
farmers in Scotland received the informative brochure. 
Compute the probability that a randomly selected farmer will 
have enough information to deal effectively with an outbreak 
of the disease. 


2-181. In an automated filling operation, the probability of 
an incorrect fill when the process is operated at a low speed is 
0.001. When the process is operated at a high speed, the prob- 
ability of an incorrect fill is 0.01. Assume that 30% of the 
containers are filled when the process is operated at a high 
speed and the remainder are filled when the process is 
operated at a low speed. 
(a) What is the probability of an incorrectly filled container? 
(b) If an incorrectly filled container is found, what is the prob- 
ability that it was filled during the high-speed operation? 


2-182. An encryption-decryption system consists of three 

elements: encode, transmit, and decode. A faulty encode 

occurs in 0.5% of the messages processed, transmission errors 

occur in 1% of the messages, and a decode error occurs in 

0.1% of the messages. Assume the errors are independent. 

(a) What is the probability of a completely defect-free 
message? 

(b) What is the probability of a message that has either an 
encode or a decode error? 


2-183. It is known that two defective copies of a commer- 
cial software program were erroneously sent to a shipping lot 
that now has a total of 75 copies of the program. A sample of 
copies will be selected from the lot without replacement. 


(a) If three copies of the software are inspected, determine 
the probability that exactly one of the defective copies will 
be found. 

(b) If three copies of the software are inspected, determine the 
probability that both defective copies will be found. 

(c) If 73 copies are inspected, determine the probability that 
both copies will be found. Hint: Work with the copies that 
remain in the lot. 


2-184. A robotic insertion tool contains 10 primary compo- 
nents. The probability that any component fails during the 
warranty period is 0.01. Assume that the components fail 
independently and that the tool fails if any component fails. 
What is the probability that the tool fails during the warranty 
period? 

2-185. An e-mail message can travel through one of two 
server routes. The probability of transmission error in each of 
the servers and the proportion of messages that travel each 
route are shown in the following table. Assume that the 
servers are independent. 


probability of error 
percentage 
of messages server 1 server2 server3 server 4 
route 1 30 0.01 0.015 
route 2 70 0.02 0.003 


(a) What is the probability that a message will arrive without 
error? 

(b) If a message arrives in error, what is the probability it was 
sent through route 1? 


2-186. A machine tool is idle 15% of the time. You request 
immediate use of the tool on five different occasions during 
the year. Assume that your requests represent independent 
events. 


(a) What is the probability that the tool is idle at the time of all 
of your requests? 

(b) What is the probability that the machine is idle at the time 
of exactly four of your requests? 

(c) What is the probability that the tool is idle at the time of at 
least three of your requests? 


2-187. A lot of 50 spacing washers contains 30 washers 
that are thicker than the target dimension. Suppose that three 
washers are selected at random, without replacement, from 
the lot. 

(a) What is the probability that all three washers are thicker 
than the target? 

(b) What is the probability that the third washer selected is 
thicker than the target if the first two washers selected are 
thinner than the target? 

(c) What is the probability that the third washer selected is 
thicker than the target? 


2-188. Continuation of Exercise 2-187. Washers are se- 
lected from the lot at random, without replacement. 


(a) What is the minimum number of washers that need to be 
selected so that the probability that all the washers are 
thinner than the target is less than 0.10? 

(b) What is the minimum number of washers that need to be 
selected so that the probability that one or more washers 
are thicker than the target is at least 0.90? 


2-189. The following table lists the history of 940 orders for 
features in an entry-level computer product. 


extra memory 


no yes 
optional high- no 514 68 
speed processor yes 112 246 


Let A be the event that an order requests the optional high- 
speed processor, and let B be the event that an order requests 
extra memory. Determine the following probabilities: 

(a) PAUB) (b) P(AMB) 

(c) P(A'UB) (d) P(A'N B’) 

(e) What is the probability that an order requests an optional 
high-speed processor given that the order requests extra 
memory? 

(f) What is the probability that an order requests extra mem- 
ory given that the order requests an optional high-speed 
processor? 


2-190. The alignment between the magnetic tape and head 
in a magnetic tape storage system affects the performance of 
the system. Suppose that 10% of the read operations are 
degraded by skewed alignments, 5% of the read operations are 
degraded by off-center alignments, and the remaining read op- 
erations are properly aligned. The probability of a read error is 
0.01 from a skewed alignment, 0.02 from an off-center align- 
ment, and 0.001 from a proper alignment. 

(a) What is the probability of a read error? 

(b) If a read error occurs, what is the probability that it is due 

to a skewed alignment? 


2-191. The following circuit operates if and only if there is 
a path of functional devices from left to right. Assume that de- 
vices fail independently and that the probability of failure of 
each device is as shown. What is the probability that the 
circuit does not operate? 


0.02 


0.01 }— 0.01 


0.01 }—— 0.01 


0.02 
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2-192. A company that tracks the use of its web site deter- 
mined that the more pages a visitor views, the more likely the 
visitor is to provide contact information. Use the following 
tables to answer the questions: 


Number of 


pages viewed: 1 2 3 4 or more 


Percentage of 
visitors: 40 30 20 10 


Percentage of visitors 
in each page-view 
category that provide 


contact information: 10 10 20 40 


(a) What is the probability that a visitor to the Web site 
provides contact information? 

(b) If a visitor provides contact information, what is the 
probability that the visitor viewed four or more pages? 
2-193. An article in Genome Research [“An Assessment 
of Gene Prediction Accuracy in Large DNA Sequences” 
(2000, Vol. 10, pp. 1631—1642)], considered the accuracy of 
commercial software to predict nucleotides in gene se- 
quences. The following table shows the number of se- 
quences for which the programs produced predictions and 
the number of nucleotides correctly predicted (computed 
globally from the total number of prediction successes and 

failures on all sequences). 


Number of 

Sequences Proportion 
GenScan 177 0.93 
Blastx default 175 0.91 
Blastx topcomboN 174 0.97 
Blastx 2 stages 175 0.90 
Gene Wise 177 0.98 
Procrustes 177 0.93 


Assume the prediction successes and failures are independent 

among the programs. 

(a) What is the probability that all programs predict a nucleo- 
tide correctly? 

(b) What is the probability that all programs predict a nu- 
cleotide incorrectly? 

(c) What is the probability that at least one Blastx program 
predicts a nucleotide correctly? 

2-194. A batch contains 36 bacteria cells. Assume that 12 of 

the cells are not capable of cellular replication. Six cells are 

selected at random, without replacement, to be checked for 

replication. 

(a) What is the probability that all six cells of the selected 
cells are able to replicate? 

(b) What is the probability that at least one of the selected 
cells is not capable of replication? 
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2-195. A computer system uses passwords that are exactly 

seven characters and each character is one of the 26 letters 

(a—z) or 10 integers (0-9). Uppercase letters are not used. 

(a) How many passwords are possible? 

(b) If a password consists of exactly six letters and one num- 
ber, how many passwords are possible? 

(c) If a password consists of five letters followed by two num- 
bers, how many passwords are possible? 

2-196. Natural red hair consists of two genes. People with 

red hair have two dominant genes, two regressive genes, or 

one dominant and one regressive gene. A group of 1000 people 

was categorized as follows: 


Gene 2 
Gene 1 Dominant Regressive Other 
Dominant o) 25 30 
Regressive 7 63 35 
Other 20 15 800 


Let A denote the event that a person has a dominant red hair 
gene and let B denote the event that a person has a regressive 
red hair gene. If a person is selected at random from this 
group, compute the following. 


(a) P(A) (b) P(A B) 

(c) PAUB) (d) P(A'NB) (e) P(A| B) 

(f ) Probability that the selected person has red hair 

2-197. Two suppliers each supplied 2000 parts and these 
were evaluated for conformance to specifications. One part 
type was of greater complexity than the other. The proportion 
of nonconforming parts of each type are shown in the table. 


Simple Complex 


Supplier Component Assembly Total 
1 Nonconforming 2 10 12 
Total 1000 1000 2000 

2 Nonconforming 4 6 10 
Total 1600 400 2000 


One part is selected at random from each supplier. For each 

supplier, separately calculate the following probabilities: 

(a) What is the probability a part conforms to specifications? 

(b) What is the probability a part conforms to specifications 
given it is a complex assembly? 

(c) What is the probability a part conforms to specifications 
given it is a simple component? 

(d) Compare your answers for each supplier in part (a) to 
those in parts (b) and (c) and explain any unusual results. 


MIND-EXPANDING EXERCISES 


2-198. Suppose documents in a lending organization 
are selected randomly (without replacement) for review. 
In a set of 50 documents, suppose that two actually con- 
tain errors. 

(a) What is the minimum sample size such that the 
probability exceeds 0.90 that at least one document 
in error is selected? 

(b) Comment on the effectiveness of sampling inspection 
to detect errors. 

2-199. Suppose that a lot of washers is large enough 

that it can be assumed that the sampling is done with re- 

placement. Assume that 60% of the washers exceed the 
target thickness. 

(a) What is the minimum number of washers that need 
to be selected so that the probability that none are 
thicker than the target is less than 0.10? 

(b) What is the minimum number of washers that need to 
be selected so that the probability that one or more 
washers are thicker than the target is at least 0.90? 


2-200. A biotechnology manufacturing firm can pro- 
duce diagnostic test kits at a cost of $20. Each kit for 
which there is a demand in the week of production can be 
sold for $100. However, the half-life of components in 
the kit requires the kit to be scrapped if it is not sold in 
the week of production. The cost of scrapping the kit is 
$5. The weekly demand is summarized as follows: 


weekly demand 


Number of 

units 0 50 100 200 
Probability of 

demand 0.05 0.4 0.3 0.25 


How many kits should be produced each week to maxi- 
mize the mean earnings of the firm? 

2-201. Assume the following characteristics of the 
inspection process in Exercise 2-177. If an operator 
checks a bolt, the probability that an incorrectly torqued 
bolt is identified is 0.95. If a checked bolt is correctly 
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MIND-EXPANDING EXERCISES 


torqued, the operator’s conclusion is always correct. 
What is the probability that at least one bolt in the sam- 
ple of four is identified as being incorrectly torqued? 


2-202. 
that A’ and B’ are independent. 


2-203. 
as follows: 


conforms 


supplier 1 ka 


IMPORTANT TERMS AND CONCEPTS 


If the events A and B are independent, show 


Suppose that a table of part counts is generalized 


no 
kb 


where a, b, and k are positive integers. Let A denote the 
event that a part is from supplier 1 and let B denote the 
event that a part conforms to specifications. Show that 
A and B are independent events. 

This exercise illustrates the result that whenever the 
rows of a table (with r rows and c columns) are propor- 
tional, an event defined by a row category and an event 
defined by a column category are independent. 
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Addition rule Independence 
Axioms of probability Multiplication rule 
Bayes’ theorem Mutually exclusive 
Combination events 
Conditional probability Outcome 

Equally likely outcomes Permutation 
Event Probability 


Random experiment 

Random variables— 
discrete and 
continuous 

Sample spaces—discrete 
and continuous 

Simpson’s paradox 


Tree diagram 

Venn diagram 

With or without 
replacement 


Total probability rule 
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Discrete Random Variables 
and Probability Distributions 


A redundant array of independent disks (RAID) uses multiple physical disk drives as one log- 
ical unit in a computer system. The array can increase performance and robustness to a disk 
failure. Data copies can be written simultaneously to multiple drives (known as mirroring) to 
provide immediate backup and the ability to recover from failures, but with less storage ca- 
pacity than would otherwise be available. Alternatively, to increase performance, the data can 
be distributed among multiple disks with only a fraction of the data on each one (known as 
striping). But a failure to even a single disk can lead to loss of data. An intermediate design is 
to distribute the source data along with additional data (known as parity data) across multiple 
disks. With the parity data the source data can be recovered even with disk failures. In partic- 
ular, a RAID 5 design uses striping and parity to be able to recover the source data if one disk 
in the array fails, while a RAID 6 design allows for data recovery even if two disks fail. Disk 
failures due to hardware malfunction are often assumed to be independent with constant prob- 
ability. With a large number of disks in an array the risk of data loss and the appropriate array 
design to meet the system performance, availability, and cost criteria are important. The num- 
ber of failed drives can be modeled as a discrete random variable and the risk of data loss in a 
redundant system is only one example of the use of topics in this chapter. 
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3-2. PROBABILITY DISTRIBUTIONS DISTRIBUTION 
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After careful study of this chapter you should be able to do the following: 
1. Determine probabilities from probability mass functions and the reverse 


2. Determine probabilities from cumulative distribution functions and cumulative distribution func- 
tions from probability mass functions, and the reverse 


Nm BW 


distributions 


. Calculate means and variances for discrete random variables 

. Understand the assumptions for some common discrete probability distributions 

. Select an appropriate discrete probability distribution to calculate probabilities in specific applications 
. Calculate probabilities, determine means and variances for some common discrete probability 


3-1 DISCRETE RANDOM VARIABLES 


Many physical systems can be modeled by the same or similar random experiments and random 
variables. The distribution of the random variables involved in each of these common systems 
can be analyzed, and the results can be used in different applications and examples. In this chap- 
ter, we present the analysis of several random experiments and discrete random variables that 
frequently arise in applications. We often omit a discussion of the underlying sample space of 
the random experiment and directly describe the distribution of a particular random variable. 


EXAMPLE 3-1 Voice Lines 

A voice communication system for a business contains 48 
external lines. At a particular time, the system is observed, and 
some of the lines are being used. Let the random variable X 


EXAMPLE 3-2 

In a semiconductor manufacturing process, two wafers from a 
lot are tested. Each wafer is classified as pass or fail. Assume 
that the probability that a wafer passes the test is 0.8 and that 
wafers are independent. The sample space for the experiment 
and associated probabilities are shown in Table 3-1. For exam- 
ple, because of the independence, the probability of the out- 
come that the first wafer tested passes and the second wafer 


EXAMPLE 3-3 

Define the random variable X to be the number of contamina- 
tion particles on a wafer in semiconductor manufacturing. 
Although wafers possess a number of characteristics, the ran- 
dom variable X summarizes the wafer only in terms of the 
number of particles. 


denote the number of lines in use. Then, X can assume any of 
the integer values 0 through 48. When the system is observed, 
if 10 lines are in use, x = 10. 


tested fails, denoted as pf, is 
P( pf) = 0.8(0.2) = 0.16 


The random variable X is defined to be equal to the number of 
wafers that pass. The last column of the table shows the values 
of X that are assigned to each outcome in the experiment. 


The possible values of X are integers from zero up to 
some large value that represents the maximum number of par- 
ticles that can be found on one of the wafers. If this maximum 
number is very large, we might simply assume that the range 
of X is the set of integers from zero to infinity. 


Note that more than one random variable can be defined on a sample space. In Example 
3-3, we might also define the random variable Y to be the number of chips from a wafer that 


fail the final test. 


Table 3-1 Wafer Tests 


Outcome 
Wafer 1 Wafer 2 Probability x 
Pass Pass 0.64 D 
Fail Pass 0.16 1 
Pass Fail 0.16 1 
Fail Fail 0.04 0 
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EXERCISES FOR SECTION 3-1 


For each of the following exercises, determine the range 
(possible values) of the random variable. 

3-1. The random variable is the number of nonconforming sol- 
der connections on a printed circuit board with 1000 connections. 


3-2. Ina voice communication system with 50 lines, the ran- 
dom variable is the number of lines in use at a particular time. 


3-3. An electronic scale that displays weights to the nearest 
pound is used to weigh packages. The display shows only five 
digits. Any weight greater than the display can indicate is 
shown as 99999. The random variable is the displayed weight. 
3-4. A batch of 500 machined parts contains 10 that do not 
conform to customer requirements. The random variable is the 
number of parts in a sample of five parts that do not conform 
to customer requirements. 

3-5. A batch of 500 machined parts contains 10 that do not 
conform to customer requirements. Parts are selected suc- 
cessively, without replacement, until a nonconforming part is 
obtained. The random variable is the number of parts selected. 


3-6. The random variable is the moisture content of a lot of 
raw material, measured to the nearest percentage point. 


3-7. The random variable is the number of surface flaws in 
a large coil of galvanized steel. 


3-8. The random variable is the number of computer clock 
cycles required to complete a selected arithmetic calculation. 
3-9. An order for an automobile can select the base model or 
add any number of 15 options. The random variable is the 
number of options selected in an order. 


3-10. Wood paneling can be ordered in thicknesses of 1/8, 
1/4, or 3/8 inch. The random variable is the total thickness of 
paneling in two orders. 


3-11. A group of 10,000 people are tested for a gene 
called Ifi202 that has been found to increase the risk for lupus. 
The random variable is the number of people who carry the 
gene. 

3-12. Inan acid-base titration, the milliliters of base that are 
needed to reach equivalence are measured to the nearest milli- 
liter between 0.1 and 0.15 liters (inclusive). 

3-13. The number of mutations in a nucleotide sequence of 
length 40,000 in a DNA strand after exposure to radiation is 
measured. Each nucleotide may be mutated. 


3-2, PROBABILITY DISTRIBUTIONS AND PROBABILITY 
MASS FUNCTIONS 


Random variables are so important in random experiments that sometimes we essentially 
ignore the original sample space of the experiment and focus on the probability distribution 
of the random variable. For example, in Example 3-1, our analysis might focus exclusively on 
the integers {0, 1,...,48} in the range of X. In Example 3-2, we might summarize the ran- 
dom experiment in terms of the three possible values of X, namely {0, 1, 2}. In this manner, a 
random variable can simplify the description and analysis of a random experiment. 

The probability distribution of a random variable X is a description of the probabilities 
associated with the possible values of X. For a discrete random variable, the distribution is 
often specified by just a list of the possible values along with the probability of each. In some 
cases, it is convenient to express the probability in terms of a formula. 


EXAMPLE 3-4 Digital Channel 


There is a chance that a bit transmitted through a digital trans- 
mission channel is received in error. Let X equal the number of 
bits in error in the next four bits transmitted. The possible val- 
ues for Xare {0, 1, 2,3, 4}. Based on a model for the errors that 
is presented in the following section, probabilities for these val- 
ues will be determined. Suppose that the probabilities are 


P(X =0) = 0.6561 
P(X = 2) =0.0486 
P(X = 4) = 0.0001 


P(X = 1) = 0.2916 
P(X = 3) = 0.0036 


The probability distribution of X is specified by the possi- 
ble values along with the probability of each. A graphical 
description of the probability distribution of X is shown in 
Fig. 3-1. 

Practical Interpretation: A random experiment can often 
be summarized with a random variable and its distribution. 
The details of the sample space can often be omitted. 


Suppose a loading on a long, thin beam places mass only at discrete points. See Fig. 3-2. 
The loading can be described by a function that specifies the mass at each of the discrete 
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Figure 3-1 Probability distribution 


for bits in error. 


Figure 3-2 Loadings at discrete points on a 
long, thin beam. 


points. Similarly, for a discrete random variable X, its distribution can be described by a function 
that specifies the probability at each of the possible discrete values for X. 


Probability 
Mass Function 
Cy i =0 
(2) > Har 
GB) i) Fx = 3) 


For a discrete random variable X with possible values xj, x3, ... 
mass function is a function such that 


,X,, a probability 


(3-1) 


For the bits in error in Example 3-4, f(0) = 0.6561, f(1) = 0.2916, f(2) = 0.0486, f(3)= 
0.0036, and f(4) = 0.0001. Check that the probabilities sum to 1. 


EXAMPLE 3-5 Wafer Contamination 

Let the random variable X denote the number of semiconduc- 
tor wafers that need to be analyzed in order to detect a large 
particle of contamination. Assume that the probability that a 
wafer contains a large particle is 0.01 and that the wafers are 
independent. Determine the probability distribution of X. 

Let p denote a wafer in which a large particle is present, and 
let a denote a wafer in which it is absent. The sample space of the 
experiment is infinite, and it can be represented as all possible 
sequences that start with a string of a’s and end with p. That is, 


s = {p, ap, aap, aaap, aaaap, aaaaap, and so forth} 


Consider a few special cases. We have P(X = 1) = 
P(p) = 0.01. Also, using the independence assumption, 


P(X = 2) = P(ap) = 0.99(0.01) = 0.0099 


EXERCISES FOR SECTION 3-2 


A general formula is 


P(X = x) = P(aa... ap) =0.9%—'(0.01), for x = 1, 2,3... 


—— 
(x — 1)a’s 


Describing the probabilities associated with X in terms of this 
formula is a simple method to define the distribution of X in 
this example. Clearly f(x) = 0. The fact that the sum of the 
probabilities is 1 is left as an exercise. This is an example of a 
geometric random variable, and details are provided later in 
this chapter. 

Practical Interpretation: The random experiment here has 
an unbounded number of outcomes but it can still be conve- 
niently modeled with a discrete random variable with a (count- 
ably) infinite range. 


3-14. The sample space of a random experiment is {a, b,c, 
d, e, f}, and each outcome is equally likely. A random variable 
is defined as follows: 


outcome | @ | 4 ei ey oe | 
3 


Determine the probability mass function of X. Use the proba- 
bility mass function to determine the following probabilities: 
(a) P(X = 1.5) (b) P(0.5 < X < 2.7) 

(c) P(X > 3) (d) P(O =X <2) 

(ec) P(X =0 or X=2) 
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For Exercises 3-15 to 3-18, verify that the following functions 
are probability mass functions, and determine the requested 
probabilities. 


3-15. x 


(a) P(X <2) (b) P(X > —2) 

(c) P(-1<X<1) (dd) P(X<-1 or X=2) 
3-16. f(x) = (8/7\(1/2), x = 1,2,3 

(a) P(X < 1) (b) P(X > 1) 

(c) P(2<X<6) (d) (X<1 or X¥>1) 
3-17. f(x) = a Leo 3a 

(a) P(X = 4) (b) P(X < 1) 

(c) P(2<X<4) (d) P(X> 10) 

3-18. f(x) = (3/4)\(1/4), x = 0,1,2,... 
(a) P(X=2) (b) P(X <2) 

(c) (X>2) (@ P(X=1) 


3-19. An article in Knee Surgery, Sports Traumatology, 
Arthroscopy [“Arthroscopic Meniscal Repair with an Absorbable 
Screw: Results and Surgical Technique” (2005, Vol. 13, 
pp. 273-279)] cites a success rate of more than 90% for 
meniscal tears with a rim width of less than 3 mm, but only a 
67% success rate for tears of 3-6 mm. If you are unlucky 
enough to suffer a meniscal tear of less than 3 mm on your left 
knee and one of width 3—6 mm on your right knee, what is the 
probability mass function of the number of successful surgeries? 
Assume the surgeries are independent. 


3-20. An optical inspection system is to distinguish 
among different part types. The probability of a correct 
classification of any part is 0.98. Suppose that three parts 
are inspected and that the classifications are independent. 
Let the random variable X denote the number of parts that 
are correctly classified. Determine the probability mass 
function of X. 


3-21. In a semiconductor manufacturing process, three 
wafers from a lot are tested. Each wafer is classified as pass or 
fail. Assume that the probability that a wafer passes the test is 
0.8 and that wafers are independent. Determine the probabil- 
ity mass function of the number of wafers from a lot that pass 
the test. 


3-22. The space shuttle flight control system called PASS 
(Primary Avionics Software Set) uses four independent 
computers working in parallel. At each critical step, the 
computers “vote” to determine the appropriate step. The 
probability that a computer will ask for a roll to the left 
when a roll to the right is appropriate is 0.0001. Let X de- 
note the number of computers that vote for a left roll when a 
right roll is appropriate. What is the probability mass func- 
tion of X? 


3-23. A disk drive manufacturer sells storage devices with 
capacities of one terabyte, 500 gigabytes, and 100 gigabytes 
with probabilities 0.5, 0.3, and 0.2, respectively. The revenues 
associated with the sales in that year are estimated to be $50 
million, $25 million, and $10 million, respectively. Let X de- 
note the revenue of storage devices during that year. 
Determine the probability mass function of X. 


3-24. Marketing estimates that a new instrument for the 
analysis of soil samples will be very successful, moderately 
successful, or unsuccessful, with probabilities 0.3, 0.6, 
and 0.1, respectively. The yearly revenue associated with 
a very successful, moderately successful, or unsuccessful 
product is $10 million, $5 million, and $1 million, respec- 
tively. Let the random variable X denote the yearly revenue 
of the product. Determine the probability mass function 
of X. 


3-25. The distributor of a machine for cytogenics has 
developed a new model. The company estimates that when it 
is introduced into the market, it will be very successful with a 
probability 0.6, moderately successful with a probability 0.3, 
and not successful with probability 0.1. The estimated 
yearly profit associated with the model being very successful 
is $15 million and with it being moderately successful is 
$5 million; not successful would result in a loss of $500,000. 
Let X be the yearly profit of the new model. Determine the 
probability mass function of X. 


3-26. Anassembly consists of two mechanical components. 
Suppose that the probabilities that the first and second compo- 
nents meet specifications are 0.95 and 0.98. Assume that the 
components are independent. Determine the probability mass 
function of the number of components in the assembly that 
meet specifications. 


3-27. An assembly consists of three mechanical compo- 
nents. Suppose that the probabilities that the first, second, and 
third components meet specifications are 0.95, 0.98, and 0.99. 
Assume that the components are independent. Determine the 
probability mass function of the number of components in the 
assembly that meet specifications. 


3-28. The data from 200 endothermic reactions involving 
sodium bicarbonate are summarized as follows: 


Final Temperature Number 
Conditions of Reactions 
266 K 48 
271K 60 
274K 92 


Calculate the probability mass function of final temperature. 


3-29. Actual lengths of stay at a hospital’s emergency de- 
partment in 2009 are shown in the following table (rounded to 
the nearest hour). Length of stay is the total of wait and service 
times. Some longer stays are also approximated as 15 hours in 
this table. 


Hours Count Percent 
1 19 3.80 
2 51 10.20 
3 86 17.20 
4 102 20.40 
5 87 17.40 
6 62 12.40 
7 40 8.00 
8 18 3.60 
9 14 2.80 

10 11 2.20 
15 10 2.00 


Calculate the probability mass function of the wait time for 
service. 


3-30. The distribution of the time until a Web site changes is 
important to Web crawlers that are used by search engines to 
maintain current information about Web sites. The distribution 
of the time until change (in days) of a Web site is approxi- 
mated in the following table. 


Days until Changes Probability 
1.5 0.05 
3.0 0.25 
4.5 0.35 
5.0 0.20 
7.0 0.15 
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Calculate the probability mass function of the days until 
change. 


3-31. The following table shows the typical depth (rounded 
to the nearest foot) for non-failed wells in geological forma- 
tions in Baltimore County (Zhe Journal of Data Science, 
2009, Vol. 7, pp. 111-127). 


Geological 

Formation Number of Non-failed 
Group Non-failed Wells = Well Depth 
Gneiss 1,515 255 
Granite 26 218 
Loch Raven Schist 3,290 317 
Mafic 349 231 
Marble 280 267 
Prettyboy Schist 1,343 255 
Other schists 887 267 
Serpentine 36 217 
Total 7,726 2,027 


Calculate the probability mass function of depth for non-failed 
wells from the table. 
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EXAMPLE 3-6 Digital Channel 
In Example 3-4, we might be interested in the probability of 
three or fewer bits being in error. This question can be ex- 
pressed as P(X = 3). 

The event that {X <3} is the union of the events 
{X = 0}, {¥ = 1}, {¥ = 2}, and {X= 3}. Clearly, these 
three events are mutually exclusive. Therefore, 


P(X <3) = P(X = 0) + P(X =1) + P(X =2) + P(X =3) 
= 0.6561 + 0.2916 + 0.0486 + 0.0036 = 0.9999 


This approach can also be used to determine 


P(X = 3) = P(X <3) — P(X < 2) = 0.0036 


Example 3-6 shows that it is sometimes useful to be able to provide cumulative proba- 
bilities such as P(X S x) and that such probabilities can be used to find the probability mass 
function of a random variable. Therefore, using cumulative probabilities is an alternate 
method of describing the probability distribution of a random variable. 

In general, for any discrete random variable with possible values xj, x7,...,X;s 
the events {X =x}, {X¥ =xp},..., {X =x,} are mutually exclusive. Therefore, 


PXYS x) = Deel tp: 
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Cumulative 
Distribution 


Function F(x) is 


(2) 0<F(x) <1 
(3) Ifx sy), 


then F(x) S Fly) 


The cumulative distribution function of a discrete random variable X, denoted as 


Mx) = PX=x) = > fe) 


i 


For a discrete random variable X, F(x) satisfies the following properties. 


(1) F(x) = P(X <x) = Dr f(x) 


(G-2) 


Properties (1) and (2) of a cumulative distribution function follow from the definition. 
Property (3) follows from the fact that if x =< y, the event that {X =< x} is contained in the 
event {X < y}. Like a probability mass function, a cumulative distribution function pro- 
vides probabilities. Even if the random variable X can only assume integer values, the 
cumulative distribution function can be defined at noninteger values. In Example 3-6, 
F(.5) = P(X = 1.5) = P{X = 0} + P(X = 1) = 0.6561 + 0.2916 = 0.9477. The next ex- 
ample shows how the cumulative distribution function can be used to determine the probability 
mass function of a discrete random variable. 


EXAMPLE 3-7 Cumulative Distribution Function 
Determine the probability mass function of X from the follow- 
ing cumulative distribution function: 


2. The probability mass function at each point is the change 
in the cumulative distribution function at the point. 
Therefore, 


0 eS =? 
_jJo2 -2=x<0 fl-2) = 02 -0=02 
FO) = )o7  o<x<2 
; Ss AO) = 0.7 — 0.2 = 0.5 


Figure 3-3 displays a plot of F(x). From the plot, the 
only points that receive nonzero probability are —2, 0, and 


EXAMPLE 3-8 Sampling without Replacement 
Suppose that a day’s production of 850 manufactured parts 
contains 50 parts that do not conform to customer require- 
ments. Two parts are selected at random, without replacement, 
from the batch. Let the random variable X equal the number of 
nonconforming parts in the sample. What is the cumulative 
distribution function of X? 

The question can be answered by first finding the proba- 
bility mass function of X. 


800 799 
P(X = 0) =——-— =0. 
Ue) ag gag 
800 50 
P(X =1)=2+—-——=0111 
850 849 
50 49 
P(X = 2) = 0.003 


~ 850 849 


fQ2) = 1.0 -0.7=03 


Therefore, 


Al 
= 
] 
ty 


(X = 0) = 0.886 
F(1) = P(X = 1) = 0.886 + 0.111 = 0.997 


The cumulative distribution function for this example is 
graphed in Fig. 3-4. Note that F(x) is defined for all x from 
—% <x < and not only for 0, 1, and 2. 


F(x) 


0.2 
oo 
<—_———o 
-2 6) 


Figure 3-3 Cumulative distribution function for 


Example 3-7. 


EXERCISES FOR SECTION 3-3 
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F(x) 
eo——____»> 1.000 o—_————————_> 
0.997 eo——o 
0.886 e———° 
<—o 
2 x 0 1 2 x 


Figure 3-4 Cumulative distribution 
function for Example 3-8. 


3-32. Determine the cumulative distribution function of the 
random variable in Exercise 3-14. 

3-33. Determine the cumulative distribution function for 
the random variable in Exercise 3-15; also determine the fol- 
lowing probabilities: 

(a) P(X = 1.25) (b) P(X S 2.2) 

(c) P(-1.1<X=1) (d) P(X>0) 

3-34. Determine the cumulative distribution function for the 
random variable in Exercise 3-16; also determine the following 
probabilities: 


(a) P(X <1.5) (b) P(X S3) 
(c) P(X > 2) (d) Pl < XS 2) 
3-35. Determine the cumulative distribution function for 


the random variable in Exercise 3-21. 

3-36. Determine the cumulative distribution function for 
the random variable in Exercise 3-22. 

3-37. Determine the cumulative distribution function for 
the random variable in Exercise 3-23. 

3-38. Determine the cumulative distribution function for 
the variable in Exercise 3-24. 


Verify that the following functions are cumulative distribution 
functions, and determine the probability mass function and the 
requested probabilities. 


3-39. 0 x<l 
F(x) = 40.5 l=x<3 
1 32x 
(a) P(X = 3) (b) P(X = 2) 
() PL<X<2) @d) P(X>2) 


3-40. Errors in an experimental transmission channel are 
found when the transmission is checked by a certifier that de- 
tects missing pulses. The number of errors found in an eight- 
bit byte is a random variable with the following distribution: 


0 x<l1 
0.7 1sx<4 
0.9 4=x<7 
1 7=sx 


Determine each of the following probabilities: 


(a) P(X <4) (b) P(X>7) 
(c) P(X<5) (dd) P(X>4) 
(ce) P(X <2) 
3-41. 0 x<-10 
3025  -10<x<30 
FO) = ) 9.75 30 <x <50 
1 50 =x 
(a) P(X < 50) (b) P(X < 40) 
(c) P('40<X<60) (d) P(X¥<0) 


(e) PO = X < 10) (f) P(-10 < X < 10) 

3-42. The thickness of wood paneling (in inches) that a cus- 
tomer orders is a random variable with the following cumula- 
tive distribution function: 


0 x < 1/8 
0.2 1/8s<x<1/4 
0.9 1/4<=x < 3/8 
1 3/8 =x 


Determine the following probabilities: 

(a) P(X = 1/18) (b) P(X = 1/4) 

(c) P(X = 5/16) (d) P(X> 1/4) 

(e) P(X = 1/2) 

3-43. Determine the cumulative distribution function for 
the random variable in Exercise 3-28. 

3-44. Determine the cumulative distribution function for 
the random variable in Exercise 3-29. 

3-45. Determine the cumulative distribution function for 
the random variable in Exercise 3-30. 

3-46. Determine the cumulative distribution function for 
the random variable in Exercise 3-31. 
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3-4 MEAN AND VARIANCE OF A DISCRETE 
RANDOM VARIABLE 


Two numbers are often used to summarize a probability distribution for a random variable X. 
The mean is a measure of the center or middle of the probability distribution, and the variance 
is a measure of the dispersion, or variability in the distribution. These two measures do not 
uniquely identify a probability distribution. That is, two different distributions can have the 
same mean and variance. Still, these measures are simple, useful summaries of the probabil- 
ity distribution of X. 


Mean and 
Variance The mean or expected value of the discrete random variable X, denoted as w or 
E(X), is 


p= BX) = Daf) (3-3) 


The variance of X, denoted as o” or V(X), is 


o? = WX) = EX wh = SO — wl) = DA) - w? 


The standard deviation of X is o = Vo". 


The mean of a discrete random variable X is a weighted average of the possible values of 
X, with weights equal to the probabilities. If f(x) is the probability mass function of a loading 
on a long, thin beam, E(X) is the point at which the beam balances. Consequently, E(X) 
describes the “center” of the distribution of X in a manner similar to the balance point of a 
loading. See Fig. 3-5. 

The variance of a random variable X is a measure of dispersion or scatter in the possible 
values for X. The variance of X uses weight f(x) as the multiplier of each possible squared 
deviation (x — wh ae Figure 3-5 illustrates probability distributions with equal means but dif- 
ferent variances. Properties of summations and the definition of can be used to show the 
equality of the formulas for variance. 


VX) = Se — wha) = SPA) - 2m Sols) + w? DA) 


=>) = sr = See 


x 


et | 


0 2 4 YY 8 10 0 2 4 oh 8 10 


(a) (b) 
Figure 3-5 A probability distribution can be viewed as a loading with the mean equal 
to the balance point. Parts (a) and (b) illustrate equal means, but Part (a) illustrates a 
larger variance. 
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(b) 


Figure 3-6 The probability distributions illustrated in Parts (a) and (b) differ even 
though they have equal means and equal variances. 


Either formula for V(x) can be used. Figure 3-6 illustrates that two probability distributions 
can differ even though they have identical means and variances. 


EXAMPLE 3-9 Digital Channel 

In Example 3-4, there is a chance that a bit transmitted through a 
digital transmission channel is received in error. Let XY equal the 
number of bits in error in the next four bits transmitted. The pos- 
sible values for X are {0, 1, 2, 3, 4}. Based on a model for the er- 
rors that is presented in the following section, probabilities for 
these values will be determined. Suppose that the probabilities are 


P(X = 0) = 0.6561 P(X = 2) = 0.0486 P(X = 4) = 0.0001 
P(X = 1) = 0.2916 P(X = 3) = 0.0036 


Now 


pw. = E(X) = Of(0) + If(1) + 2f(2) + 3f(3) + 414) 
= 0(0.6561) + 1(0.2916) + 2(0.0486) 
+ 3(0.0036) + 4(0.0001) 


= 04 


Although X never assumes the value 0.4, the weighted average 
of the possible values is 0.4. 
To calculate V(X), a table is convenient. 


EXAMPLE 3-10 Marketing 
Two new product designs are to be compared on the basis of 
revenue potential. Marketing feels that the revenue from design 
A can be predicted quite accurately to be $3 million. The rev- 
enue potential of design B is more difficult to assess. Marketing 
concludes that there is a probability of 0.3 that the revenue from 
design B will be $7 million, but there is a 0.7 probability that the 
revenue will be only $2 million. Which design do you prefer? 

Let X denote the revenue from design A. Because there is 
no uncertainty in the revenue from design A, we can model the 
distribution of the random variable X as $3 million with prob- 
ability 1. Therefore, E(X) = $3 million. 

Let Y denote the revenue from design B. The expected 
value of Y in millions of dollars is 


E(Y) = $7(0.3) + $2(0.7) = $3.5 


x x-04 (x-04) f(x) S(x)\(x — 0.4) 
0 —0.4 0.16 0.6561 0.104976 
1 0.6 0.36 0.2916 0.104976 
D 1.6 2.56 0.0486 0.124416 
3 2.6 6.76 0.0036 0.024336 
4 3.6 12.96 0.0001 0.001296 


VX) =0 = S f(x)(x; — 0.4)? = 0.36 


The alternative formula for variance could also be used to 
obtain the same result. 

Practical Interpretation: The mean and variance summa- 
rize the distribution of a random variable. The mean is a 
weighted average of the values and the variance measures the 
dispersion of the values from the mean. Different distributions 
may have the same mean and variance. 


Because E(Y) exceeds E(X), we might prefer design B. However, 
the variability of the result from design B is larger. That is, 


o? = (7 — 3.5)°(0.3) + (2 — 3.5)°(0.7) 


= 5.25 millions of dollars squared 


Because the units of the variables in this example are millions of 
dollars, and because the variance of a random variable squares 
the deviations from the mean, the units of o” are millions of 
dollars squared. These units make interpretation difficult. 
Because the units of standard deviation are the same as the 
units of the random variable, the standard deviation o is easier 
to interpret. In this example, we can summarize our results as 
“the average deviation of Y from its mean is $2.29 million.” 
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EXAMPLE 3-11 Messages 


The number of messages sent per hour over a computer network 
has the following distribution: 


x = number of messages | 10 | 11 | 12 | 13 | 14 | 15 


T(x) |0.08 | 0.15 | 0.30 | 0.20 | 0.20 | 0.07 


Determine the mean and standard deviation of the number of 
messages sent per hour. 


E(X) = 10(0.08) + 11(0.15) + +++ + 15(0.07) = 12.5 


V(X) = 10°(0.08) + 117(0.15) + --- + 157(0.07) 
— 12.57 = 1.85 


o = V(X) = V 185 = 1.36 


The variance of a random variable X can be considered to be the expected value of a specific 
function of X, namely, h(X) = (X — «)’. In general, the expected value of any function A(X) 
of a discrete random variable is defined in a similar manner. 


Expected Value of a 
Function of a 
Discrete Random 
Variable 


If X is a discrete random variable with probability mass function f(x), 


E[HX)] = SY Ae)f(e) 


(3-4) 


EXAMPLE 3-12 Digital Channel 


In Example 3-9, X is the number of bits in error in the next 
four bits transmitted. What is the expected value of the square 
of the number of bits in error? Now, A(X) = X. Therefore, 


E[h(X)] = 0? X 0.6561 + 1? x 0.2916 + 27 x 0.0486 
+ 3? X 0.0036 + 47 X 0.0001 = 0.52 


Practical Interpretation: The expected value of a function 
of arandom variable is simply a weighted average of the func- 
tion evaluated at the values of the random variable. 


In the previous example, the expected value of X? does not equal E(X) squared. However, 
in the special case that A(X) = aX + b for any constants a and b, E[h(X)] =aE(X) + b. This 
can be shown from the properties of sums in the definition in Equation 3-4. 


EXERCISES FOR SECTION 3-4 


3-47. If the range of X is the set {0, 1, 2, 3, 4} and P(X = 
x) = 0.2, determine the mean and variance of the random 
variable. 

3-48. Determine the mean and variance of the random variable 
in Exercise 3-14. 

3-49. Determine the mean and variance of the random variable 
in Exercise 3-15. 

3-50. Determine the mean and variance of the random variable 
in Exercise 3-16. 

3-51. Determine the mean and variance of the random variable 
in Exercise 3-17. 

3-52. Determine the mean and variance of the random variable 
in Exercise 3-18. 

3-53. Determine the mean and variance of the random variable 
in Exercise 3-19. 


3-54. Determine the mean and variance of the random vari- 
able in Exercise 3-20. 

3-55. The range of the random variable X is [0, 1, 2, 3, x], 
where x is unknown. If each value is equally likely and the 
mean of X is 6, determine x. 


3-56. Ina NiCd battery, a fully charged cell is composed of 
nickelic hydroxide. Nickel is an element that has multiple 
oxidation states. Assume the following proportions of the 
states: 


Nickel Charge Proportions Found 
0 0.17 
+2 0.35 
+3 0.33 
+4 0.15 


(a) Determine the cumulative distribution function of nickel 
charge. 
(b) Determine the mean and variance of the nickel charge. 


3-57. The space shuttle flight control system called PASS 
(Primary Avionics Software Set) uses four independent com- 
puters working in parallel. At each critical step, the comput- 
ers “vote” to determine the appropriate step. The probability 
that a computer will ask for a roll to the left when a roll to the 
right is appropriate is 0.0001. Let X denote the number of 
computers that vote for a left roll when a right roll is appro- 
priate. What is the mean and variance of X? 


3-58. Trees are subjected to different levels of carbon dioxide 
atmosphere with 6% of the trees in a minimal growth condition 
at 350 parts per million (ppm), 10% at 450 ppm (slow growth), 
47% at 550 ppm (moderate growth), and 37% at 650 ppm (rapid 
growth). What are the mean and standard deviation of the car- 
bon dioxide atmosphere (in ppm) for these trees in ppm? 


3-59. An article in the Journal of Database Management 
(“Experimental Study of a Self-Tuning Algorithm for DBMS 
Buffer Pools” (2005, Vol. 16, pp. 1—20)] provided the workload 
used in the TPC-C OLTP (Transaction Processing Performance 


Average Frequencies and Operations in TPC-C 
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Council’s Version C On-Line Transaction Processing) bench- 
mark, which simulates a typical order entry application. 


The frequency of each type of transaction (in the second col- 

umn) can be used as the percentage of each type of transac- 

tion. The average number of se/ects operations required for 
each type transaction is shown. 

(a) Determine the mean and standard deviation of the number 
of selects operations for a transaction from the distribution 
of types shown in the table. 

(b) Determine the mean and standard deviation of the total 
number of operations (selects, updates, ..., and joins) for 
a transaction from the distribution of types shown in the 
table. 

3-60. Calculate the mean and variance for the random variable 

in Exercise 3-28. 

3-61. Calculate the mean and variance for the random variable 

in Exercise 3-29. 

3-62. Calculate the mean and variance for the random variable 

in Exercise 3-30. 

3-63. Calculate the mean and variance for the random variable 

in Exercise 3-31. 


Non-Unique 
Transaction Frequency Selects Updates Inserts Deletes Selects Joins 
New Order 43 23 11 12 0 0 0 
Payment 44 4.2 3 1 0 0.6 0 
Order Status 11.4 0 0 0 0.6 0 
Delivery 5 130 120 0 10 0 0 
Stock Level 4 0 0 0 0 0 1 
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The simplest discrete random variable is one that assumes only a finite number of possible 
values, each with equal probability. A random variable X that assumes each of the values 


X1,X2,-- 


., X,, with equal probability 1/n, is frequently of interest. 


Discrete 
Uniform 


Distribution its range, say, X1, X, .. 


A random variable _X has a discrete uniform distribution if each of the n values in 
. , Xp, has equal probability. Then, 


f(xi) = 1/n (3-5) 
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Figure 3-7 Probability f() 
mass function for a 01 e 6 “ec - a “e! SG am - ee GS 
discrete uniform random 
variable. 


EXAMPLE 3-13 
The first digit of a part’s serial number is equally likely to be value in R = {0, 1, 2, ..., 9}. That is, 
any one of the digits 0 through 9. If one part is selected from a f= 04 

large batch and_X is the first digit of the serial number, X has a 


discrete uniform distribution with probability 0.1 for each for each value in R. The probability mass function of X is 
shown in Fig. 3-7. 


Suppose the range of the discrete random variable X is the consecutive integers a, 
a+1,a+2,...,b, fora = b. The range of X contains b — a + 1 values each with proba- 
bility 1/(b — a + 1). Now, 


7 1 
w= SHG) 


2 b(b + 1)-(a—- l)a 
The algebraic identity DS k= ( ) 5) ( ) 
k=a 


w = (b + a)/2, The derivation of the variance is left as an exercise. 


can be used to simplify the result to 


Mean and 


Variance Suppose X is a discrete uniform random variable on the consecutive integers 
a,a+1l,a+2,...,b,fora = b. The mean of X is 


bt+a 
p = E(X) = —, 


The variance of X is 


o (3-6) 


EXAMPLE 3-14 Number of Voice Lines 

As in Example 3-1, let the random variable X denote the number and 
of the 48 voice lines that are in use at a particular time. 
Assume that X is a discrete uniform random variable with a 


o = {[(48 —0 + 1)? — 1]/12}'7 = 14.14 


range of 0 to 48. Then, Practical Interpretation: The average number of lines in use is 
24 but the dispersion (as measured by @) is large. Therefore, 
E(X) = (48 + 0)/2 = 24 at many times far more or fewer than 24 lines are in use. 


Equation 3-6 is more useful than it might first appear. If all the values in the range of a 
random variable _X are multiplied by a constant (without changing any probabilities), the mean 
and standard deviation of X are multiplied by the constant. You are asked to verify this result 
in an exercise. Because the variance of a random variable is the square of the standard devia- 
tion, the variance of X is multiplied by the constant squared. More general results of this type 
are discussed in Chapter 5. 


EXAMPLE 3-15 Proportion of Voice Lines 

Let the random variable Y denote the proportion of the 48 
voice lines that are in use at a particular time, and_X denote the 
number of lines that are in use at a particular time. Then, 
Y = X/48. Therefore, 


EXERCISES FOR SECTION 3-5 
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E(Y) = E(X)/48 = 0.5 
and 


V(Y) = V(X)/48? = 0.087 


3-64. Let the random variable X have a discrete uniform 

distribution on the integers 0 = x = 99. Determine the mean 

and variance of X. 

3-65. Let the random variable X have a discrete uniform 

distribution on the integers 1 = x = 3. Determine the mean 

and variance of X. 

3-66. Thickness measurements of a coating process are 

made to the nearest hundredth of a millimeter. The thickness 

measurements are uniformly distributed with values 0.15, 

0.16, 0.17, 0.18, and 0.19. Determine the mean and variance 

of the coating thickness for this process. 

3-67. Product codes of two, three, four, or five letters are 

equally likely. What is the mean and standard deviation of the 

number of letters in the codes? 

3-68. The lengths of plate glass parts are measured to the 

nearest tenth of a millimeter. The lengths are uniformly dis- 

tributed, with values at every tenth of a millimeter starting at 

590.0 and continuing through 590.9. Determine the mean and 

variance of the lengths. 

3-69. Assume that the wavelengths of photosynthetically 

active radiations (PAR) are uniformly distributed at integer 

nanometers in the red spectrum from 675 to 700 nm. 

(a) What is the mean and variance of the wavelength distribu- 
tion for this radiation? 

(b) If the wavelengths are uniformly distributed at integer 
nanometers from 75 to 100 nanometers, how does the 
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mean and variance of the wavelength distribution com- 
pare to the previous part? Explain. 


3-70. The probability of an operator entering alpha- 
numeric data incorrectly into a field in a database is equally 
likely. The random variable X is the number of fields on a 
data entry form with an error. The data entry form has 
28 fields. Is _X a discrete uniform random variable? Why or 
why not? 

3-71. Suppose that X has a discrete uniform distribution on 
the integers 0 through 9. Determine the mean, variance, and 
standard deviation of the random variable Y = 5X and com- 
pare to the corresponding results for _X. 


3-72. Show that for a discrete uniform random variable_X, 
if each of the values in the range of X is multiplied by the 
constant c, the effect is to multiply the mean of X by c and 
the variance of Y by c’. That is, show that E(cX) = cE(X) 
and V(cX) = c?V(X). 

3-73. The number of pages in a PDF document you create has 
a discrete uniform distribution from five to nine pages (including 
the end points). What are the mean and standard deviation of the 
number of pages in the document? 


3-74. Suppose that nine-digit Social security numbers are 
assigned at random. If you randomly select a number, what is 
the probability that it belongs to one of the 300 million people 
in the United States? 


Consider the following random experiments and random variables: 


1. Flip acoin 10 times. Let X¥ = number of heads obtained. 


2. A worn machine tool produces 1% defective parts. Let XY = number of defective parts 
in the next 25 parts produced. 


3. Each sample of air has a 10% chance of containing a particular rare molecule. Let 
X = the number of air samples that contain the rare molecule in the next 18 samples 


analyzed. 


4. Of all bits transmitted through a digital transmission channel, 10% are received in 
error. Let X¥ = the number of bits in error in the next five bits transmitted. 


5. A multiple-choice test contains 10 questions, each with four choices, and you guess 
at each question. Let Y = the number of questions answered correctly. 
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6. In the next 20 births at a hospital, let ¥ = the number of female births. 


7. Of all patients suffering a particular illness, 35% experience improvement from a 
particular medication. In the next 100 patients administered the medication, let 
X = the number of patients who experience improvement. 


These examples illustrate that a general probability model that includes these experiments as 
particular cases would be very useful. 

Each of these random experiments can be thought of as consisting of a series of repeated, 
random trials: 10 flips of the coin in experiment 1, the production of 25 parts in experiment 2, 
and so forth. The random variable in each case is a count of the number of trials that meet a 
specified criterion. The outcome from each trial either meets the criterion that XY counts or it 
does not; consequently, each trial can be summarized as resulting in either a success or a fail- 
ure. For example, in the multiple-choice experiment, for each question, only the choice that is 
correct is considered a success. Choosing any one of the three incorrect choices results in the 
trial being summarized as a failure. 

The terms success and failure are just labels. We can just as well use A and B or 0 or 1. 
Unfortunately, the usual labels can sometimes be misleading. In experiment 2, because X 
counts defective parts, the production of a defective part is called a success. 

A trial with only two possible outcomes is used so frequently as a building block of a ran- 
dom experiment that it is called a Bernoulli trial. It is usually assumed that the trials that con- 
stitute the random experiment are independent. This implies that the outcome from one trial 
has no effect on the outcome to be obtained from any other trial. Furthermore, it is often rea- 
sonable to assume that the probability of a success in each trial is constant. In the multiple- 
choice experiment, if the test taker has no knowledge of the material and just guesses at each 
question, we might assume that the probability of a correct answer is 1/4 for each question. 


EXAMPLE 3-16 Digital Channel 
The chance that a bit transmitted through a digital transmission 
channel is received in error is 0.1. Also, assume that the trans- 
mission trials are independent. Let XY = the number of bits in 
error in the next four bits transmitted. Determine P(X = 2). 
Let the letter F denote a bit in error, and let the letter O 
denote that the bit is okay, that is, received without error. We 
can represent the outcomes of this experiment as a list of four 
letters that indicate the bits that are in error and those that are 
okay. For example, the outcome OFOE indicates that the second 
and fourth bits are in error and the other two bits are okay. The 
corresponding values for x are 


Outcome x Outcome x 
OOOO 0 EOOO 1 
OOOE 1 EOOE 2, 
OOEO 1 EOQEO D 
OOEE 2 EQEE 3 
OEOO 1 EEOO D 
OEOE 2 EEOE 3 
OEEO D, EEEO 3 
OEEE 3 EEEE 4 


The event that Y = 2 consists of the six outcomes: 
{EEOO, EOEO, EOOE, OEEO, OEOE, OOEE} 


Using the assumption that the trials are independent, the 
probability of {EEOO} is 


P(EEOO) = P(E)P(E)P(O)P(O) = (0.1)°(0.9)? = 0.0081 


Also, any one of the six mutually exclusive outcomes for 
which X = 2 has the same probability of occurring. Therefore, 


P(X = 2) = 6(0.0081) = 0.0486 
In general, 


P(X = x) = (number of outcomes that result in x errors) 
x (0.1)(0.9)*-* 


To complete a general probability formula, only an expres- 
sion for the number of outcomes that contain x errors is 
needed. An outcome that contains x errors can be constructed 
by partitioning the four trials (letters) in the outcome into 
two groups. One group is of size x and contains the errors, 
and the other group is of size n — x and consists of the trials 
that are okay. The number of ways of partitioning four objects 
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4! 


4 4 
into two groups, one of which is of size x, is (‘) = m4 —x)! Notice that (5) = 4!/[2! 2!] = 6, as found above. The 


Therefore, in this example, probability mass function of X was shown in Example 3-4 and 


Binomial 
Distribution 


Fig. 3-1. 


The previous example motivates the following result. 


A random experiment consists of n Bernoulli trials such that 


(1) The trials are independent. 


(2) Each trial results in only two possible outcomes, labeled as “success” and 
“failure.” 


(3) The probability of a success in each trial, denoted as p, remains constant. 


The random variable X that equals the number of trials that result in a success 
has a binomial random variable with parameters 0 < p < 1 andn = 1, 2,.... The 
probability mass function of X is 


f(x) = @iac =p x 0 oat (3-7) 


As in Example 3-16, (") equals the total number of different sequences of trials that 
x 
contain x successes and n — x failures. The total number of different sequences that contain x 
successes and n — x failures times the probability of each sequence equals P(X = x). 
The probability expression above is a very useful formula that can be applied in a num- 
ber of examples. The name of the distribution is obtained from the binomial expansion. For 


constants a and b, the binomial expansion is 


Let p denote the probability of success on a single trial. Then, by using the binomial 
expansion with a = p and b = 1 — p:p, we see that the sum of the probabilities for a bino- 
mial random variable is 1. Furthermore, because each trial in the experiment is classified 
into two outcomes, {success, failure}, the distribution is called a “bi”-nomial. A more 
general distribution, which includes the binomial as a special case, is the multinomial 
distribution, and this is presented in Chapter 5. 

Examples of binomial distributions are shown in Fig. 3-8. For a fixed n, the distribution 
becomes more symmetric as p increases from 0 to 0.5 or decreases from 1 to 0.5. For a fixed 
p, the distribution becomes more symmetric as n increases. 
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Figure 3-8 Binomial distributions for selected values of n and p. 


EXAMPLE 3-17 


Several examples using the binomial coefficient (") follow. 
x 
10 
e ) = 10!/[3! 7!] = (10-9 + 8)/(3 + 2) = 120 


& = 15!/[10! 5!] = (15-14-13 + 12+ 11)/(5+4+ 3-2) 


= 3003 


EXAMPLE 3-18 Organic Pollution 
Each sample of water has a 10% chance of containing a partic- 
ular organic pollutant. Assume that the samples are independent 
with regard to the presence of the pollutant. Find the probability 
that in the next 18 samples, exactly 2 contain the pollutant. 

Let X = the number of samples that contain the pollutant 
in the next 18 samples analyzed. Then X is a binomial random 
variable with p = 0.1 and n = 18. Therefore, 


1 
now ("8) = 18!/[2! 16!] = 18(17)/2 = 153. Therefore, 


P(X = 2) = 153(0.1)(0.9)'® = 0.284 
Determine the probability that at least four samples 
contain the pollutant. The requested probability is 


18 


P(X = 4) = > ( 


x=4 


18 
x 


Joos 


e 
eo 
eo 
eo 
i) 
So8 
o 
N —_—e—o 
00 $A 
0 = e—S— — — ——— —  — — ————————————essSsSsSsSsssSsFé 
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BL-oe 


gk O—e 
areO 


1 
( -) = 100!/[4! 96!] = (100 - 99 98 + 97)/(4 +3 + 2) 
= 3,921,225 


Also recall that 0! = 1. 


However, it is easier to use the complementary event, 


P(X=4) =1-P(X<4)=1- S (‘Jono 


x=0 \*¥ 


= 1 — [0.150 + 0.300 + 0.284 + 0.168] = 0.098 


Determine the probability that 3 = ¥ < 7. Now 


PB=X<7)= >(‘") (0.1)(0.9)!8-~* 


x=3 
= 0.168 + 0.070 + 0.022 + 0.005 
= 0.265 


Practical Interpretation: Binomial random variables are 
used to model many physical systems and probabilities for all 
such models can be obtained from the binomial probability 
mass function. 
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A table of cumulative binomial probabilities is provided in Appendix A and this can sim- 
plify some calculations. For example, the binomial distribution in Example 3-16 has p = 0.1 
and n = 4. A probability such as P(X = 2) can be calculated from the table as 

P(X = 2) = P(X S 2) — P(X S 1) = 0.9963 — 0.9477 = 0.0486 
and this agrees with the result obtained previously. 


The mean and variance of a binomial random variable can be obtained from an analysis 
of the independent trials that comprise the binomial experiment. Define new random variables 


: O otherwise 


= \ if ith trial is a success 
fori = 1,2,...,n. Then, 
X=X, +X) ++: +X, 
Also, it is easy to derive the mean and variance of each_X; as 
E(X;) = Ip + 011 — p) =p 
and 
WX) = =pyp + O=pyd =p) =p —2) 


Sums of random variables are discussed in Chapter 5, and there, the intuitively reasonable 
result that 


E(X) = E(X) + E(X) + + EX) 


is derived. Furthermore, for the independent trials of a binomial experiment it is also shown 
in Chapter 5 that 


VX) = VX) + V(X) + + Xn) 


Because E(X;) =p and V(X;)=p(l1—p), we obtain the solution E(X) = np and 
WX) = np(l — p). 


Mean and 
Variance If X is a binomial random variable with parameters p and n, 
w=E(X)=np and = o% = KX) =np(1 —p) (3-8) 
EXAMPLE 3-19 
For the number of transmitted bits received in error in and these results match those obtained from a direct calcula- 
Example 3-16, n = 4 and p = 0.1, so tion in Example 3-9. 


E(X) = 4(0.1)=04 and V(X) = 4(0.1)(0.9) = 0.36 


84 CHAPTER 3 DISCRETE RANDOM VARIABLES AND PROBABILITY DISTRIBUTIONS 


EXERCISES FOR SECTION 3-6 


3-75. Foreach scenario described below, state whether or not 
the binomial distribution is a reasonable model for the random 
variable and why. State any assumptions you make. 

(a) A production process produces thousands of temperature 
transducers. Let X denote the number of nonconforming 
transducers in a sample of size 30 selected at random from 
the process. 

(b) From a batch of 50 temperature transducers, a sample of 
size 30 is selected without replacement. Let X denote the 
number of nonconforming transducers in the sample. 

(c) Four identical electronic components are wired to a con- 
troller that can switch from a failed component to one of 
the remaining spares. Let XY denote the number of compo- 
nents that have failed after a specified period of operation. 

(d) Let X denote the number of accidents that occur along 
the federal highways in Arizona during a one-month 
period. 

(e) Let X denote the number of correct answers by a student tak- 
ing a multiple-choice exam in which a student can eliminate 
some of the choices as being incorrect in some questions 
and all of the incorrect choices in other questions. 

(f) Defects occur randomly over the surface of a semiconduc- 
tor chip. However, only 80% of defects can be found by 
testing. A sample of 40 chips with one defect each is tested. 
Let X denote the number of chips in which the test finds a 
defect. 

(g) Reconsider the situation in part (f). Now, suppose the sample 
of 40 chips consists of chips with 1 and with 0 defects. 

(h) A filling operation attempts to fill detergent packages to the 
advertised weight. Let X denote the number of detergent 
packages that are underfilled. 

(i) Errors in a digital communication channel occur in bursts 
that affect several consecutive bits. Let_X denote the number 
of bits in error in a transmission of 100,000 bits. 

(j) Let X denote the number of surface flaws in a large coil of 
galvanized steel. 

3-76. Let X be a binomial random variable with p = 0.2 

and n = 20. Use the binomial table in Appendix A to deter- 

mine the following probabilities. 


(a) P(X S 3) (b) P(X > 10) 
(c) (X=6) (ad) P6=X= 11) 
3-77. Let X be a binomial random variable with p = 0.1 


and n = 10. Calculate the following probabilities from the 
binomial probability mass function and also from the binomial 
table in Appendix A and compare results. 

(a) P(X = 2) (b) P(X > 8) 

(c) (X=4)  @) PS =X=7) 

3-78. The random variable X has a binomial distribution 
with n = 10 and p = 0.5. Determine the following proba- 
bilities: 

(a) P(X = 5) 
(c) P(X = 9) 


(b) P(X < 2) 
(d) P(3 <X <5) 


3-79. The random variable X has a binomial distribution 
with n = 10 and p = 0.01. Determine the following proba- 
bilities. 

(a) P(X=5) (vb) P(X $2) 

(c) P(X = 9) (d) P(3 =X <5) 

3-80. The random variable X has a binomial distribution with 
n = 10 and p = 0.5. Sketch the probability mass function of X. 
(a) What value of X is most likely? 

(b) What value(s) of X is(are) least likely? 


3-81. Sketch the probability mass function of a binomial 

distribution with n = 10 and p = 0.01 and comment on the 

shape of the distribution. 

(a) What value of X is most likely? 

(b) What value of X is least likely? 

3-82. Determine the cumulative distribution function of a 

binomial random variable with n = 3 and p = 1/2. 

3-83. Determine the cumulative distribution function of a 

binomial random variable with n = 3 and p = 1/4. 

3-84. An electronic product contains 40 integrated circuits. 

The probability that any integrated circuit is defective is 0.01, 

and the integrated circuits are independent. The product oper- 

ates only if there are no defective integrated circuits. What is 

the probability that the product operates? 

3-85. The phone lines to an airline reservation system are 

occupied 40% of the time. Assume that the events that the lines 

are occupied on successive calls are independent. Assume that 

10 calls are placed to the airline. 

(a) What is the probability that for exactly three calls the lines 
are occupied? 

(b) What is the probability that for at least one call the lines 
are not occupied? 

(c) What is the expected number of calls in which the lines 
are all occupied? 


3-86. A multiple-choice test contains 25 questions, each 

with four answers. Assume a student just guesses on each 

question. 

(a) What is the probability that the student answers more than 
20 questions correctly? 

(b) What is the probability the student answers less than five 
questions correctly? 

3-87. A particularly long traffic light on your morning com- 

mute is green 20% of the time that you approach it. Assume 

that each morning represents an independent trial. 

(a) Over five mornings, what is the probability that the light is 
green on exactly one day? 

(b) Over 20 mornings, what is the probability that the light is 
green on exactly four days? 

(c) Over 20 mornings, what is the probability that the light is 
green on more than four days? 


3-88. Samples of rejuvenated mitochondria are mutated 
(defective) in 1% of cases. Suppose 15 samples are studied, 


and they can be considered to be independent for mutation. 
Determine the following probabilities. The binomial table in 
Appendix A can help. 

(a) No samples are mutated. 

(b) At most one sample is mutated. 

(c) More than half the samples are mutated. 


3-89. An article in Information Security Technical Report 
[Malicious Software—Past, Present and Future” (2004, Vol. 9, 
pp. 6-18)] provided the following data on the top ten mali- 
cious software instances for 2002. The clear leader in the num- 
ber of registered incidences for the year 2002 was the Internet 
worm “Klez,” and it is still one of the most widespread threats. 
This virus was first detected on 26 October 2001, and it has 
held the top spot among malicious software for the longest 
period in the history of virology. 


Place Name % Instances 
1 J-Worm.Klez 61.22% 
2 I-Worm.Lentin 20.52% 
3 J-Worm.Tanatos 2.09% 
4 I-Worm.BadtransII 1.31% 
5 Macro. Word97.Thus 1.19% 
6 I-Worm.Hybris 0.60% 
7 I-Worm.Bridex 0.32% 
8 I-Worm.Magistr 0.30% 
9 Win95.CIH 0.27% 

10 I-Worm.Sircam 0.24% 


The 10 most widespread malicious programs for 2002 
(Source—Kaspersky Labs). 


Suppose that 20 malicious software instances are reported. 

Assume that the malicious sources can be assumed to be inde- 

pendent. 

(a) What is the probability that at least one instance is “Klez’’? 

(b) What is the probability that three or more instances are 
“Klez”? 

(c) What are the mean and standard deviation of the number 
of “Klez” instances among the 20 reported? 

3-90. Heart failure is due to either natural occurrences 

(87%) or outside factors (13%). Outside factors are related to 

induced substances or foreign objects. Natural occurrences are 

caused by arterial blockage, disease, and infection. Suppose 

that 20 patients will visit an emergency room with heart failure. 

Assume that causes of heart failure between individuals are 

independent. 

(a) What is the probability that three individuals have condi- 
tions caused by outside factors? 

(b) What is the probability that three or more individuals have 
conditions caused by outside factors? 

(c) What are the mean and standard deviation of the number 
of individuals with conditions caused by outside factors? 
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3-91. A computer system uses passwords that are exactly 

six characters and each character is one of the 26 letters (a—z) 

or 10 integers (0-9). Suppose there are 10,000 users of the 

system with unique passwords. A hacker randomly selects 

(with replacement) one billion passwords from the potential 

set, and a match to a user’s password is called a hit. 

(a) What is the distribution of the number of hits? 

(b) What is the probability of no hits? 

(c) What are the mean and variance of the number of hits? 

3-92. Astatistical process control chart example. Samples 

of 20 parts from a metal punching process are selected every 

hour. Typically, 1% of the parts require rework. Let X denote 

the number of parts in the sample of 20 that require rework. A 

process problem is suspected if X exceeds its mean by more 

than three standard deviations. 

(a) If the percentage of parts that require rework remains at 
1%, what is the probability that X exceeds its mean by 
more than three standard deviations? 

(b) If the rework percentage increases to 4%, what is the 
probability that XY exceeds 1? 

(c) If the rework percentage increases to 4%, what is the 
probability that X exceeds 1 in at least one of the next five 
hours of samples? 


3-93. Because not all airline passengers show up for their 

reserved seat, an airline sells 125 tickets for a flight that holds 

only 120 passengers. The probability that a passenger does not 

show up is 0.10, and the passengers behave independently. 

(a) What is the probability that every passenger who shows 
up can take the flight? 

(b) What is the probability that the flight departs with empty 
seats? 


3-94. This exercise illustrates that poor quality can affect 
schedules and costs. A manufacturing process has 100 customer 
orders to fill. Each order requires one component part that is 
purchased from a supplier. However, typically, 2% of the com- 
ponents are identified as defective, and the components can be 
assumed to be independent. 

(a) If the manufacturer stocks 100 components, what is the 
probability that the 100 orders can be filled without 
reordering components? 

(b) If the manufacturer stocks 102 components, what is the 
probability that the 100 orders can be filled without 
reordering components? 

(c) If the manufacturer stocks 105 components, what is the 
probability that the 100 orders can be filled without 
reordering components? 


3-95. Consider the lengths of stay at a hospital’s emergency 

department in Exercise 3-29. Assume that five persons inde- 

pendently arrive for service. 

(a) What is the probability that the length of stay of exactly 
one person is less than or equal to 4 hours? 

(b) What is the probability that exactly two people wait more 
than 4 hours? 
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(c) What is the probability that at least one person waits more 
than 4 hours? 

3-96. Consider the visits that result in leave without being 

seen (LWBS) at an emergency department in Example 2-8. 

Assume that four persons independently arrive for service at 

Hospital 1. 

(a) What is the probability that exactly one person will LWBS? 

(b) What is the probability, that two or more two people will 
LWBS? 

(c) What is the probability that at least one person will 
LWBS? 

3-97. Assume a Web site changes its content according to 

the distribution in Exercise 3-30. Assume 10 changes are made 

independently. 

(a) What is the probability that the change is made in less than 
4 days in seven of the 10 updates? 


(b) What is the probability that the change is made in less 
than 4 days in two or fewer of the 10 updates? 

(c) What is the probability that at least one change is made in 
less than 4 days? 

(d) What is the expected number of the 10 updates that occur 
in less than 4 days? 


3-98. Consider the endothermic reactions in Exercise 3-28. 

A total of 20 independent reactions are to be conducted. 

(a) What is the probability that exactly 12 reactions result in a 
final temperature less than 272 K? 

(b) What is the probability that at least 19 reactions result in a 
final temperature less than 272 K? 

(c) What is the probability that at least 18 reactions result in a 
final temperature less than 272 K? 

(d) What is the expected number of reactions that result in a 
final temperature of less than 272 K? 


3-7 GEOMETRIC AND NEGATIVE BINOMIAL DISTRIBUTIONS 


Geometric Distribution 


Consider a random experiment that is closely related to the one used in the definition of a 
binomial distribution. Again, assume a series of Bernoulli trials (independent trials with con- 
stant probability p of a success on each trial). However, instead of a fixed number of trials, 
trials are conducted until a success is obtained. Let the random variable X denote the number 
of trials until the first success. In Example 3-5, successive wafers are analyzed until a large 
particle is detected. Then, X is the number of wafers analyzed. In the transmission of bits, X 
might be the number of bits transmitted until an error occurs. 


EXAMPLE 3-20 Digital Channel 
The probability that a bit transmitted through a digital trans- 
mission channel is received in error is 0.1. Assume the trans- 
missions are independent events, and let the random variable 
X denote the number of bits transmitted until the first error. 
Then, P(X = 5) is the probability that the first four bits 
are transmitted correctly and the fifth bit is in error. This event 
can be denoted as {OOOOE}, where O denotes an okay bit. 


Because the trials are independent and the probability of a 
correct transmission is 0.9, 


P(X = 5) = P(OOOOE) = 0.90.1 = 0.066 


Note that there is some probability that XY will equal any inte- 
ger value. Also, if the first trial is a success, X = 1. Therefore, 
the range of X is {1, 2, 3, ... }, that is, all positive integers. 


Geometric 
Distribution 


In aseries of Bernoulli trials (independent trials with constant probability p of a success), 
let the random variable X denote the number of trials until the first success. Then_X is 
a geometric random variable with parameter 0 < p < 1 and 


Ax) = (1 — py 'p 


Key Dee (3-9) 


Examples of the probability mass functions for geometric random variables are shown in 
Fig. 3-9. Note that the height of the line at x is (1 — p) times the height of the line atx — 1. 
That is, the probabilities decrease in a geometric progression. The distribution acquires its 


name from this result. 
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EXAMPLE 3-21 
The probability that a wafer contains a large particle of con- Let X denote the number of samples analyzed until a 
tamination is 0.01. If it is assumed that the wafers are inde- large particle is detected. Then X is a geometric random vari- 


pendent, what is the probability that exactly 125 wafers need able with p = 0.01. The requested probability is 


to be analyzed before a large particle is detected? bi 
P(X = 125) = (0.99)'**0.01 = 0.0029 


The mean of a geometric random variable is 
w= Sl — py =p dig 
k=1 k=1 


where g = p — 1. The right-hand side of the previous equation is recognized to be the partial 
derivative with respect to g of 


where the last equality is obtained from the known sum of a geometric series. Therefore, 


-2/#"|- Pp _p_i 
ogl1-—q] (l-qP p P 

and the mean is derived. To obtain the variance of a geometric random variable, we can first 
derive E(X°) by a similar approach. This can be obtained from partial second derivatives with 
respect to q. Then the formula V(X) = E(X’) — (EXY is applied. The details are a bit more 
work and this is left as a mind-expanding exercise. 
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Mean and 
Variance If X is a geometric random variable with parameter p, 
w=E(X)=1/p and 07 = HX) =(1-p)/p* (3-10) 
EXAMPLE 3-22 


Consider the transmission of bits in Example 3-20. Here, 
p = 0.1. The mean number of transmissions until the first 
error is 1/0.1 = 10. The standard deviation of the number 


Practical Interpretation: The standard deviation here is ap- 
proximately equal to the mean and this occurs when p is small. 
The number of trials until the first success may be much dif- 


of transmissions before the first error is 


ferent from the mean when p is small. 


o = [(1 — 0.1)/0.17]!? = 9.49 


Lack of Memory Property 
A geometric random variable has been defined as the number of trials until the first success. 
However, because the trials are independent, the count of the number of trials until the next 
success can be started at any trial without changing the probability distribution of the random 
variable. For example, in the transmission of bits, if 100 bits are transmitted, the probability 
that the first error, after bit 100, occurs on bit 106 is the probability that the next six outcomes 
are OOOOOE. This probability is (0.9)°(0.1) = 0.059, which is identical to the probability 
that the initial error occurs on bit 6. 

The implication of using a geometric model is that the system presumably will not wear 
out. The probability of an error remains constant for all transmissions. In this sense, the geo- 
metric distribution is said to lack any memory. The lack of memory property will be dis- 


cussed again in the context of an exponential random variable in Chapter 4. 


EXAMPLE 3-23 Lack of Memory 
In Example 3-20, the probability that a bit is transmitted in error 
is equal to 0.1. Suppose 50 bits have been transmitted. The mean 


Negative Binomial Distribution 


number of bits until the next error is 1/0.1 = 10—the same 
result as the mean number of bits until the first error. 


A generalization of a geometric distribution in which the random variable is the number 
of Bernoulli trials required to obtain 7 successes results in the negative binomial distri- 


bution. 


EXAMPLE 3-24 Digital Channel 

As in Example 3-20, suppose the probability that a bit trans- 
mitted through a digital transmission channel is received in er- 
ror is 0.1. Assume the transmissions are independent events, 
and let the random variable X¥ denote the number of bits trans- 
mitted until the fourth error. 

Then, X has a negative binomial distribution with r = 4. 
Probabilities involving X can be found as follows. The P(X = 10) 
is the probability that exactly three errors occur in the first 
nine trials and then trial 10 results in the fourth error. The 
probability that exactly three errors occur in the first nine trials 


is determined from the binomial distribution to be 


(3) (0.1)°(0.9)° 


Because the trials are independent, the probability that exactly 
three errors occur in the first 9 trials and trial 10 results in the 
fourth error is the product of the probabilities of these two 
events, namely, 


(3) o.1%0.9)%0. 7 (J Jonooy 


Negative 
Binomial 
Distribution 


Figure 3-10 Negative 
binomial distributions 
for selected values of the 
parameters r and p. 
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In a series of Bernoulli trials Gndependent trials with constant probability p of a suc- 
cess), let the random variable X denote the number of trials until 7 successes occur. 
Then X is a negative binomial random variable with parameters 0 < p < 1 and 
P= Mo, By cccg BUG! 


nies) =| Je — py 'p x=rr+1,r+2,... (3-11) 


Because at least 7 trials are required to obtain r successes, the range of X is from r to ~. In the 
special case that r = 1, a negative binomial random variable is a geometric random variable. 
Selected negative binomial distributions are illustrated in Fig. 3-10. 

The lack of memory property of a geometric random variable implies the following. Let 
X denote the total number of trials required to obtain r successes. Let X, denote the number of 
trials required to obtain the first success, let X, denote the number of extra trials required to 
obtain the second success, let X3 denote the number of extra trials to obtain the third success, 
and so forth. Then, the total number of trials required to obtain r successes is 
X = X, + X, +++: + X,. Because of the lack of memory property, each of the random vari- 
ables X1, Xz, ..., X,. has a geometric distribution with the same value of p. Consequently, a 
negative binomial random variable can be interpreted as the sum of r geometric random vari- 
ables. This concept is illustrated in Fig. 3-11. 

Recall that a binomial random variable is a count of the number of successes in n 
Bernoulli trials. That is, the number of trials is predetermined, and the number of successes is 
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Figure 3-11 Negative 
binomial random 
variable represented 

as a sum of geometric 
random variables. 


X =X, +X_+X3 


xX, X X3 
e oe e 
1 2 3 4 5 6 7 8 9 10 11 12 
Trials 


e indicates a trial that results in a "success." 


random. A negative binomial random variable is a count of the number of trials required to 
obtain r successes. That is, the number of successes is predetermined, and the number of trials 
is random. In this sense, a negative binomial random variable can be considered the opposite, 


or negative, of a binomial random variable. 
The description of a negative binomial random variable as a sum of geometric random 
variables leads to the following results for the mean and variance. Sums of random variables 


are studied in Chapter 5. 


Mean and 
Variance 


If X is a negative binomial random variable with parameters p and r, 


w= E(X) = r/p 


and = o% = V(X) =r(1 — p)/p’ (3-12) 


EXAMPLE 3-25 Web Servers 
A Web site contains three identical computer servers. Only 
one is used to operate the site, and the other two are spares that 
can be activated in case the primary system fails. The proba- 
bility of a failure in the primary computer (or any activated 
spare system) from a request for service is 0.0005. Assuming 
that each request represents an independent trial, what is the 
mean number of requests until failure of all three servers? 
Let X denote the number of requests until all three servers 
fail, and let X, X2, and X3 denote the number of requests be- 
fore a failure of the first, second, and third servers used, 
respectively. Now, X¥ = X, + X, + X3. Also, the requests are 
assumed to comprise independent trials with constant proba- 
bility of failure p = 0.0005. Furthermore, a spare server is not 
affected by the number of requests before it is activated. 
Therefore, X has a negative binomial distribution with 
p = 0.0005 and r = 3. Consequently, 


E(X) = 3/0.0005 = 6000 requests 


EXERCISES FOR SECTION 3-7 


What is the probability that all three servers fail within five 
requests? The probability is P(X = 5) and because X denotes 
the number of requests to the third failure P(X = 2) = 0. 
Therefore, 


P(X <5) = P(X = 3) + P(X = 4) + P(X =5) 


Il 


3 
0.00057 + () 0.00057(0.9995) 


4 
+ ) 0.0005? (0.9995) 


Il 


1.25 x 107!9 + 3.75 x 107!9 + 7.49 x 10719 
1.249 x 107° 


ll 


Practical Interpretation: Because the trials are independent the 
mean number of trials to the third failure is three times as large 
as the number of trials until the first failure. 


3-99. Suppose the random variable _X has a geometric distribu- 
tion with p = 0.5. Determine the following probabilities: 


(a) P(X¥=1) (b) P(X =4) 
(c) P(X=8) (dd) P(X <2) 
(ec) P(X > 2) 


3-100. Suppose the random variable X has a geometric distri- 
bution with a mean of 2.5. Determine the following probabilities: 
(a) P(X= 1) (b) P(X = 4) 


() (X=5) @) P(X <3) 

(e) P(X > 3) 

3-101. Consider a sequence of independent Bernoulli trials 

with p = 0.2. 

(a) What is the expected number of trials to obtain the first 
success? 


(b) After the eighth success occurs, what is the expected num- 
ber of trials to obtain the ninth success? 
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3-102. Suppose that X is a negative binomial random vari- 
able with p = 0.2 and r = 4. Determine the following: 


(a) E(X) (b) P(X = 20) 
(c) P(X = 19) (d) P(X = 21) 
(e) The most likely value for X 


3-103. The probability of a successful optical alignment in 

the assembly of an optical data storage product is 0.8. Assume 

the trials are independent. 

(a) What is the probability that the first successful alignment 
requires exactly four trials? 

(b) What is the probability that the first successful alignment 
requires at most four trials? 

(c) What is the probability that the first successful alignment 
requires at least four trials? 

3-104. Ina clinical study, volunteers are tested for a gene 

that has been found to increase the risk for a disease. The 

probability that a person carries the gene is 0.1. 

(a) What is the probability four or more people will have to be 
tested before two with the gene are detected? 

(b) How many people are expected to be tested before two 
with the gene are detected? 


3-105. Assume that each of your calls to a popular radio 

station has a probability of 0.02 of connecting, that is, of not 

obtaining a busy signal. Assume that your calls are independent. 

(a) What is the probability that your first call that connects is 
your tenth call? 

(b) What is the probability that it requires more than five calls 
for you to connect? 

(c) What is the mean number of calls needed to connect? 


3-106. A player of a video game is confronted with a series 

of opponents and has an 80% probability of defeating each one. 

Success with any opponent is independent of previous encoun- 

ters. The player continues to contest opponents until defeated. 

(a) What is the probability mass function of the number of 
opponents contested in a game? 

(b) What is the probability that a player defeats at least two 
opponents in a game? 

(c) What is the expected number of opponents contested in a 
game? 

(d) What is the probability that a player contests four or more 
opponents in a game? 

(e) What is the expected number of game plays until a player 
contests four or more opponents? 


3-107. Heart failure is due to either natural occurrences 

(87%) or outside factors (13%). Outside factors are related to 

induced substances or foreign objects. Natural occurrences are 

caused by arterial blockage, disease, and infection. Assume that 
causes of heart failure between individuals are independent. 

(a) What is the probability that the first patient with heart 
failure who enters the emergency room has the condition 
due to outside factors? 

(b) What is the probability that the third patient with heart 
failure who enters the emergency room is the first one due 
to outside factors? 


(c) What is the mean number of heart failure patients with the 
condition due to natural causes who enter the emergency 
room before the first patient with heart failure from out- 
side factors? 


3-108. A computer system uses passwords constructed 
from the 26 letters (a—z) or 10 integers (0-9). Suppose there 
are 10,000 users of the system with unique passwords. A 
hacker randomly selects (with replacement) passwords from 
the potential set. 

(a) Suppose there are 9900 users with unique six-character 
passwords and the hacker randomly selects six-character 
passwords. What is the mean and standard deviation of 
the number of attempts before the hacker selects a user 
password? 

(b) Suppose there are 100 users with unique three-character 
passwords and the hacker randomly selects three-character 
passwords. What is the mean and standard deviation of 
the number of attempts before the hacker selects a user 
password? 

(c) Comment on the security differences between six- and 
three-character passwords. 


3-109. A trading company has eight computers that it uses to 
trade on the New York Stock Exchange (NYSE). The probabil- 
ity of a computer failing in a day is 0.005, and the computers 
fail independently. Computers are repaired in the evening and 
each day is an independent trial. 

(a) What is the probability that all eight computers fail in a day? 


(b) What is the mean number of days until a specific com- 
puter fails? 

(c) What is the mean number of days until all eight computers 
fail in the same day? 


3-110. Assume that 20 parts are checked each hour and that 

X denotes the number of parts in the sample of 20 that require 

rework. Parts are assumed to be independent with respect to 

rework. 

(a) If the percentage of parts that require rework remains at 
1%, what is the probability that hour 10 is the first sample 
at which X exceeds 1? 

(b) If the rework percentage increases to 4%, what is the prob- 
ability that hour 10 is the first sample at which XY exceeds 1? 

(c) If the rework percentage increases to 4%, what is the 
expected number of hours until XY exceeds 1? 


3-111. A fault-tolerant system that processes transactions 
for a financial services firm uses three separate computers. If 
the operating computer fails, one of the two spares can be im- 
mediately switched online. After the second computer fails, 
the last computer can be immediately switched online. 
Assume that the probability of a failure during any transac- 
tion is 107° and that the transactions can be considered to be 
independent events. 

(a) What is the mean number of transactions before all com- 

puters have failed? 


(b) What is the variance of the number of transactions before 
all computers have failed? 
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3-112. Inthe process of meiosis, a single parent diploid cell 

goes through eight different phases. However, only 60% of the 

processes pass the first six phases and only 40% pass all eight. 

Assume the results from each phase are independent. 

(a) If the probability of a successful pass of each one of the 
first six phases is constant, what is the probability of a suc- 
cessful pass of a single one of these phases? 

(b) If the probability of a successful pass of each one of the 
last two phases is constant, what is the probability of a 
successful pass of a single one of these phases? 


3-113. Show that the probability density function of a nega- 
tive binomial random variable equals the probability density 
function of a geometric random variable when r = 1. Show 
that the formulas for the mean and variance of a negative 
binomial random variable equal the corresponding results for 
a geometric random variable when r = 1. 


3-114. Consider the endothermic reactions in Exercise 3-28. 

Assume independent reactions are conducted. 

(a) What is the probability that the first reaction to result in a 
final temperature less than 272 K is the tenth reaction? 

(b) What is the mean number of reactions until the first final 
temperature is less than 272 K? 

(c) What is the probability that the first reaction to result in a 
final temperature less than 272 K occurs within three or 
fewer reactions? 


(d) What is the mean number of reactions until two reactions 
result in final temperatures less than 272 K? 


3-115. A Web site randomly selects among 10 products to 
discount each day. The color printer of interest to you is dis- 
counted today. 

(a) What is the expected number of days until this product is 
again discounted? 

(b) What is the probability that this product is first discounted 
again exactly 10 days from now? 

(c) If the product is not discounted for the next five days, what 
is the probability that it is first discounted again 15 days 
from now? 

(d) What is the probability that this product is first discounted 
again within three or fewer days? 


3-116. Consider the visits that result in leave without being 

seen (LWBS) at an emergency department in Example 2-8. 

Assume that people independently arrive for service at Hospital 1. 

(a) What is the probability that the fifth visit is the first one to 
LWBS? 

(b) What is the probability that either the fifth or sixth visit is 
the first one to LWBS? 

(c) What is the probability that the first visit to LWBS is 
among the first four visits? 

(d) What is the expected number of visits until the third 
LWBS occurs? 


3-8 HYPERGEOMETRIC DISTRIBUTION 


In Example 3-8, a day’s production of 850 manufactured parts contains 50 parts that do not 
conform to customer requirements. Two parts are selected at random, without replacement 
from the day’s production. Let A and B denote the events that the first and second parts are 
nonconforming, respectively. In Chapter 2, we found P(B|A) = 49/849 and P(A) = 50/850. 
Consequently, knowledge that the first part is nonconforming suggests that it is less likely that 
the second part selected is nonconforming. 

Let X equal the number of nonconforming parts in the sample. Then, 


P(X = 0) = P(both parts conform) = (800/850)(799/849) = 0.886 


P(X = 1) = P(first part selected conforms and the second part selected 
does not, or the first part selected does not and the second part 
selected conforms) 


= (800/850)(50/849) + (50/850)(800/849) = 0.111 
P(X = 2) = P(both parts do not conform) = (50/850)(49/849) = 0.003 


This experiment is fundamentally different from the examples based on the binomial dis- 
tribution. In this experiment, the trials are not independent. Note that, in the unusual case that 
each unit selected is replaced before the next selection, the trials are independent and there is 
a constant probability of a nonconforming part on each trial. Then, the number of noncon- 
forming parts in the sample is a binomial random variable. 


Hypergeometric 
Distribution 
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But as in this example, samples are often selected without replacement. Although 
probabilities can be determined by the reasoning used in the example above, a general for- 
mula for computing probabilities when samples are selected without replacement is quite 
useful. The counting rules presented in Chapter 2 can be used to justify the formula given 
below. 


A set of N objects contains 


K objects classified as successes 

N — K objects classified as failures 
A sample of size n objects is selected randomly (without replacement) from the NV 
objects, where K = N andn = N. 

Let the random variable XY denote the number of successes in the sample. Then 
X is a hypergeometric random variable and 


Die van 
( 


The expression min{X, } is used in the definition of the range of X because the maximum 
number of successes that can occur in the sample is the smaller of the sample size, n, 
and the number of successes available, K. Also, ifn + K > N, at least n + K — N suc- 
cesses must occur in the sample. Selected hypergeometric distributions are illustrated in 
Fig. 3-12. 


x = max{0,n + K — N}to min{K, n} (3-13) 
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Figure 3-12 Hypergeometric distributions for 
selected values of parameters N, K, and n. 
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EXAMPLE 3-26 Sampling without Replacement 
The example at the start of this section can be reanalyzed by 
using the general expression in the definition of a hypergeo- 
metric random variable. That is, 


(PV) 
0/\ 2 / _ 319,600 


Gy 360,825 
2 


EXAMPLE 3-27 Parts from Suppliers 
A batch of parts contains 100 parts from a local supplier of 
tubing and 200 parts from a supplier of tubing in the next 
state. If four parts are selected randomly and without re- 
placement, what is the probability they are all from the local 
supplier? 

Let X equal the number of parts in the sample from the 
local supplier. Then, XY has a hypergeometric distribution and 
the requested probability is P(X = 4). Consequently, 


vie 
by 


What is the probability that two or more parts in the sample 
are from the local supplier? 


P(X = 0) = = 0.886 


P(X = 4) = = 0.0119 


(\(e) 

1 1 / _ 40,000 
850 360,825 
i 

ees, 

2/0 1,225 
e& ~ 360,825 
2 


= 0.111 


PX=1)= 


= 0.003 


P(X = 2) = 


vy LORD) CNT) CE) 
GG) 


= 0.298 + 0.098 + 0.0119 = 0.408 


What is the probability that at least one part in the sample is 


from the local supplier? 
( aloe, 
0 4 
=1 


ey 
4 
Practical Interpretation: Sampling without replacement is fre- 


quently used for inspection and the hypergeometric distribution 
simplifies the calculations. 


P(X = 1) = 1—- P(X = 0) = 0.804 


The mean and variance of a hypergeometric random variable can be determined from the 
trials that comprise the experiment. However, the trials are not independent, and so the cal- 
culations are more difficult than for a binomial distribution. The results are stated as follows. 


Mean and 
Variance If X is a hypergeometric random variable with parameters N, K, andn, then 
2 N-n 
w = E(X) = np and o = V(X) = np(1 — p) =a (3-14) 
where p = K/N. 
Here p is interpreted as the proportion of successes in the set of N objects. 
EXAMPLE 3-28 


In the previous example, the sample size is four. The random 
variable X is the number of parts in the sample from the local 
supplier. Then, p = 100/300 = 1/3. Therefore, 


E(X) = 4(100/300) = 1.33 


and 


V(X) = 4(1/3)(2/3)[(300 — 4)/299] = 0.88 


Figure 3-13 
Comparison of hyper- 
geometric and binomial 
distributions. 
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Population 
Correction 

Factor 
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0.3 + + 
0.2 
fo) + + 
0.1 
+ + 
0.0 
1 2 3 4 


e Hypergeometric N = 50,n=5, K=25 
+ Binomial n = 5, p =0.5 


Hypergeometric probability | 0.025 0.149 0.326 0.326 0.149 0.025 
Binomial probability 0.031 0.156 0.312 0.312 0.156 0.031 


For a hypergeometric random variable, E(X) is similar to the mean of a binomial random 
variable. Also, V(X) differs from the result for a binomial random variable only by the term 
shown below. 


The term in the variance of a hypergeometric random variable 
sia (3-15) 
el : 


is called the finite population correction factor. 


Sampling with replacement is equivalent to sampling from an infinite set because the propor- 
tion of success remains constant for every trial in the experiment. As mentioned previously, if 
sampling were done with replacement, X would be a binomial random variable and its vari- 
ance would be np(1 — p). Consequently, the finite population correction represents the cor- 
rection to the binomial variance that results because the sampling is without replacement from 
the finite set of size NV. 

If 1 is small relative to N, the correction is small and the hypergeometric distribution is 
similar to the binomial. In this case, a binomial distribution can effectively approximate the 
distribution of the number of units of a specified type in the sample. A case is illustrated in 
Fig. 3-13. 


EXAMPLE 3-29 Customer Sample 


A listing of customer accounts at a large corporation contains The sampling is without replacement. However, be- 
1000 customers. Of these, 700 have purchased at least one of cause the sample size of 50 is small relative to the number of 
the corporation’s products in the last three months. To evaluate customer accounts, 1000, the probability of selecting a cus- 
a new product design, 50 customers are sampled at random tomer who has purchased from the corporation in the last 
from the corporate listing. What is the probability that more three months remains approximately constant as the cus- 
than 45 of the sampled customers have purchased from the tomers are chosen. 


corporation in the last three months? 
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For example, let A denote the event that the first cus- 
tomer selected has purchased from the corporation in the last 
three months, and let B denote the event that the second 
customer selected has purchased from the corporation in 
the last three months. Then, P(A) = 700/1000 = 0.7 and 
P(B|A) =699/999 = 0.6997. That is, the trials are approxi- 
mately independent. 

Let X denote the number of customers in the sample who 
have purchased from the corporation in the last three months. 
Then, X is a hypergeometric random variable with N = 
1,000, n = 50, and K = 700. Consequently, p = K/N = 0.7. 


EXERCISES FOR SECTION 3-8 


The requested probability is P(X > 45). Because the sample 
size is small relative to the batch size, the distribution of X can be 
approximated as binomial with n = 50 and p = 0.7. Using the 
binomial approximation to the distribution of X results in 


50 

50 

P(X > 45) = > (Jor — 0.7) = 0.00017 
x=46 


The probability from the hypergeometric distribution is 
0.00013, but this requires computer software. The result 
agrees well with the binomial approximation. 


3-117. Suppose X has a hypergeometric distribution with 
N= 100, n = 4, and K = 20. Determine the following: 

(a) P(X = 1) 

(b) P(X = 6) 

(c) P(X = 4) 

(d) Determine the mean and variance of X. 

3-118. Suppose X has a hypergeometric distribution with 
N= 20,n = 4, and K = 4. Determine the following: 

(a) P(X = 1) 

(b) P(X = 4) 

(c) P(X = 2) 

(d) Determine the mean and variance of X. 

3-119. Suppose X has a hypergeometric distribution with 
N = 10,n = 3, and K = 4. Sketch the probability mass func- 
tion of X. Determine the cumulative distribution function for_X. 


3-120. A batch contains 36 bacteria cells and 12 of the cells 

are not capable of cellular replication. Suppose you examine 

three bacteria cells selected at random, without replacement. 

(a) What is the probability mass function of the number of 
cells in the sample that can replicate? 

(b) What are the mean and variance of the number of cells in 
the sample that can replicate? 

(c) What is the probability that at least one of the selected 
cells cannot replicate? 


3-121. A company employs 800 men under the age of 55. 
Suppose that 30% carry a marker on the male chromosome 
that indicates an increased risk for high blood pressure. 

(a) If 10 men in the company are tested for the marker in this 
chromosome, what is the probability that exactly one man 
has the marker? 

(b) If 10 men in the company are tested for the marker in this 
chromosome, what is the probability that more than one 
has the marker? 


3-122. Printed circuit cards are placed in a functional test 

after being populated with semiconductor chips. A lot contains 

140 cards, and 20 are selected without replacement for func- 

tional testing. 

(a) If 20 cards are defective, what is the probability that at 
least 1 defective card is in the sample? 


(b) If5 cards are defective, what is the probability that at least 
one defective card appears in the sample? 

3-123. The analysis of results from a leaf transmutation 

experiment (turning a leaf into a petal) is summarized by type 

of transformation completed: 


Total Textural 
Transformation 
Yes No 
Total Color Yes 243 26 
Transformation No 13 18 


A naturalist randomly selects three leaves from this set, without 
replacement. Determine the following probabilities. 

(a) Exactly one has undergone both types of transformations. 
(b) At least one has undergone both transformations. 

(c) Exactly one has undergone one but not both transformations. 
(d) At least one has undergone at least one transformation. 


3-124. Astate runs a lottery in which six numbers are randomly 

selected from 40, without replacement. A player chooses six 

numbers before the state’s sample is selected. 

(a) What is the probability that the six numbers chosen by a 
player match all six numbers in the state’s sample? 

(b) What is the probability that five of the six numbers chosen 
by a player appear in the state’s sample? 

(c) What is the probability that four of the six numbers chosen 
by a player appear in the state’s sample? 

(d) If a player enters one lottery each week, what is the ex- 
pected number of weeks until a player matches all six 
numbers in the state’s sample? 


3-125. Magnetic tape is slit into half-inch widths that are 
wound into cartridges. A slitter assembly contains 48 blades. 
Five blades are selected at random and evaluated each day for 
sharpness. If any dull blade is found, the assembly is replaced 
with a newly sharpened set of blades. 

(a) If 10 of the blades in an assembly are dull, what is the 
probability that the assembly is replaced the first day it is 
evaluated? 

(b) If 10 of the blades in an assembly are dull, what is the 
probability that the assembly is not replaced until the third 


day of evaluation? [Hint: Assume the daily decisions are 
independent, and use the geometric distribution. ] 

(c) Suppose on the first day of evaluation, two of the blades 
are dull; on the second day of evaluation, six are dull; and 
on the third day of evaluation, ten are dull. What is the 
probability that the assembly is not replaced until the 
third day of evaluation? [Hint: Assume the daily deci- 
sions are independent. However, the probability of re- 
placement changes every day.] 


3-126. 

(a) Calculate the finite population corrections for Exercises 
3-117 and 3-118. For which exercise should the binomial 
approximation to the distribution of X be better? 

(b) For Exercise 3-117, calculate P(X = 1) and P(X = 4) 
assuming that _X has a binomial distribution and compare 
these results to results derived from the hypergeometric 
distribution. 

(c) For Exercise 3-118, calculate P(X = 1) and P(X = 4) 
assuming that X has a binomial distribution and compare 
these results to the results derived from the hypergeometric 
distribution. 


3-9 POISSON DISTRIBUTION 


3-9 POISSON DISTRIBUTION 97 


(d) Use the binomial approximation to the hypergeometric dis- 
tribution to approximate the probabilities in Exercise 3-122. 
What is the finite population correction in this exercise? 


3-127. Consider the visits that result in leave without being 

seen (LWBS) at an emergency department in Example 2-8. 

Assume that four visits that result in LWBS are to be randomly 

selected (without replacement) for a follow-up interview. 

(a) What is the probability that all selected visits are from 
Hospital 4? 

(b) What is the probability that no selected visits are from 
Hospital 4? 

(c) What is the probability that all selected visits are from the 
same hospital? 


3-128. Consider the non-failed wells in Exercises 3-31. 

Assume that four wells are selected randomly (without re- 

placement) for inspection. 

(a) What is the probability that exactly two are selected from 
the Loch Raven Schist? 

(b) What is the probability that one or more is selected from 
the Loch Raven Schist? 

(c) What is the expected number selected from the Loch 
Raven Schist? 


A widely-used distribution emerges as the number of trials in a binomial experiment in- 
creases to infinity while the mean of the distribution remains constant. Consider the following 


example. 


EXAMPLE 3-30 

Consider the transmission of n bits over a digital communica- 
tion channel. Let the random variable X equal the number of 
bits in error. When the probability that a bit is in error is con- 
stant and the transmissions are independent, X has a binomial 
distribution. Let p denote the probability that a bit is in error. 
Let A = pn. Then, E(x) = pn = A and 


mena 2 


Now, suppose that the number of bits transmitted increases 
and the probability of an error decreases exactly enough that 
pn remains equal to a constant. That is, m increases and p 


decreases accordingly, such that E(X) = A remains constant. 
Then, with some work, it can be shown that 


(“\F4 O-4)'s1 0-40 


so that 


. e *A* 
lim,_,.0.P(X = x) = > 
x! 


x = 0,1, 2,... 


Also, because the number of bits transmitted tends to infinity, 
the number of errors can equal any nonnegative integer. 
Therefore, the range of X is the integers from zero to infinity. 


The distribution obtained as the limit in the previous example is more useful than the deri- 
vation implies. The following example illustrates the broader applicability. 


EXAMPLE 3-31 Wire Flaws 

Flaws occur at random along the length of a thin copper wire. 
Let X denote the random variable that counts the number of 
flaws in a length of Z millimeters of wire and suppose that the 
average number of flaws in Z millimeters is A. 


The probability distribution of X can be found by reasoning 
in a manner similar to the previous example. Partition the length 
of wire into n subintervals of small length, say, 1 micrometer 
each. If the subinterval chosen is small enough, the probability 
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that more than one flaw occurs in the subinterval is negligible. we obtain 

Furthermore, we can interpret the assumption that flaws occur 

at random to imply that every subinterval has the same proba- p=h/n 
bility of containing a flaw, say, p. Finally, if we assume that the 


probability that a subinterval contains a flaw is independent of 


That is, the probability that a subinterval contains a flaw is 


other subintervals, we can model the distribution of X as ap- A/n. With small enough subintervals, n is very large and p is 


proximately a binomial random variable. Because 


very small. Therefore, the distribution of X is obtained as in the 
previous example. 


E(X) = A = np 


Poisson 
Distribution 


Example 3-31 can be generalized to include a broad array of random experiments. The in- 
terval that was partitioned was a length of wire. However, the same reasoning can be applied 
to any interval, including an interval of time, an area, or a volume. For example, counts of 
(1) particles of contamination in semiconductor manufacturing, (2) flaws in rolls of textiles, 
(3) calls to a telephone exchange, (4) power outages, and (5) atomic particles emitted from a 
specimen have all been successfully modeled by the probability mass function in the following 
definition. 

In general, consider an interval T of real numbers partitioned into subintervals of small 
length At and assume that as Af tends to zero, 


(1) _ the probability of more than one event in a subinterval tends to zero, 
(2) the probability of one event in a subinterval tends to A A7/T, 


(3) the event in each subinterval is independent of other subintervals. 


A random experiment with these properties is called a Poisson process. 

These assumptions imply that the subintervals can be thought of as approximate inde- 
pendent Bernoulli trials with success probability p = AAt/T and the number of trials equal to 
n = T/At. Here, pn = A, and as At tends to zero, n tends to infinity, so the similarity to the 
limit in Example 3-30 appears. This leads to the following definition. 


The random variable X that equals the number of events in a Poisson process is 
a Poisson random variable with parameter 0 < A, and the probability mass 
function of X is 


= Ay\ 38 
e “A 
I(x) = x =0,1,2,... (3-16) 
x! 
The sum of the probabilities is one because 
(os) —Ayk oo k 
yr a 
mo fh ok! 


and the summation on the right-hand side of the previous equation is recognized to be Taylor’s 
expansion of e* evaluated at A. Therefore, the summation equals e* and the right-hand side 
equals e *e* = 1. 
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Historically, the term process has been used to suggest the observation of a system over 
time. In our example with the copper wire, we showed that the Poisson distribution could also 
apply to intervals such as lengths. Figure 3-14 provides graphs of selected Poisson distributions. 

It is important to use consistent units in the calculation of probabilities, means, and variances 
involving Poisson random variables. The following example illustrates unit conversions. For 
example, if the 


average number of flaws per millimeter of wire is 3.4, then the 
average number of flaws in 10 millimeters of wire is 34, and the 
average number of flaws in 100 millimeters of wire is 340. 


If a Poisson random variable represents the number of events in some interval, the mean of the 
random variable must equal the expected number of events in the same length of interval. 
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Figure 3-14 Poisson distributions for selected values of the parameters. 
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EXAMPLE 3-32 Calculations for Wire Flaws 
For the case of the thin copper wire, suppose that the number 
of flaws follows a Poisson distribution with a mean of 2.3 
flaws per millimeter. Determine the probability of exactly 
two flaws in 1 millimeter of wire. 

Let _X denote the number of flaws in | millimeter of wire. 
Then, E(X) = 2.3 flaws and 


ae 
2! 


P(X =2)= = 0.265 

Determine the probability of 10 flaws in 5 millimeters of 
wire. Let _X denote the number of flaws in 5 millimeters of 
wire. Then, X has a Poisson distribution with 


E(X) = 5mm X 2.3 flaws/mm = 11.5 flaws 


EXAMPLE 3-33 CDs 
Contamination is a problem in the manufacture of optical storage 
disks (CDs). The number of particles of contamination that 
occur on an optical disk has a Poisson distribution, and the aver- 
age number of particles per centimeter squared of media surface 
is 0.1. The area of a disk under study is 100 squared centimeters. 
Find the probability that 12 particles occur in the area of a disk 
under study. 

Let X denote the number of particles in the area of a disk 
under study. Because the mean number of particles is 0.1 par- 
ticles per cm’, 


E(X) = 100 cm? X 0.1 particles/em? = 10 particles 
Therefore, 


e 9192 
12! 


P(X = 12) = = 0.095 


Therefore, 


11.5!° 
10! 


P(X = 10) =e 118 = 0.113 


Determine the probability of at least one flaw in 2 millime- 
ters of wire. Let XY denote the number of flaws in 2 millimeters 
of wire. Then, _X has a Poisson distribution with 


E(X) = 2mm X 2.3 flaws/mm = 4.6 flaws 
Therefore, 


P(X=1)=1 


P(X = 0) = 1 — e ** = 0.9899 
Practical Interpretation: Notice that when a probability 

was requested for 2 mm of wire 2A was adjusted to 4.6, the 

mean number of flaws in 2 mm. With such adjustments, proba- 


bilities can be calculated for intervals of any size. 


The probability that zero particles occur in the area of the 
disk under study is 


P(X = 0) =e = 4.54 x 10> 


Determine the probability that 12 or fewer particles occur 
in the area of the disk under study. The probability is 


P(X = 12) = P(X = 0) + P(X = 1) +--+ + P(X = 12) 
12 e 910! 


= Ol! 


Because this sum is tedious to compute, many computer pro- 
grams calculate cumulative Poisson probabilities. From one 
such program, P(X = 12) = 0.792. 


The mean of a Poisson random variable is 


where the summation can start atk = 1 because the k = 0 term is zero. If a change of variable 
j =k — 1 is used, the summation on the right-hand side of the previous equation is recog- 
nized to be the sum of the probabilities of a Poisson random variable and this equals one. 
Therefore, the previous equation simplifies to y = A and the mean is derived. 

To obtain the variance of a Poisson random variable we can start with E(X7), and this 


equals 
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Write k = (k — 1) + 1 to obtain 


The summation in the first term on the right-hand side of the previous equation is recog- 
nized to be the mean of X and this equals A so that the first term is A”. The summation in the 
second term on the right-hand side is recognized to be the sum of the probabilities and this 
equals one. Therefore, the previous equation simplifies to E(X =) = \* + X. Because the 
V(X) = E(X’) — (EXY, we have 


VX)=M+A-N=A 


and the variance is derived. 


Mean and 


Variance If X is a Poisson random variable with parameter A, then 


ee = ean Ose —2) XX (3-17) 


The mean and variance of a Poisson random variable are equal. For example, if particle counts 
follow a Poisson distribution with a mean of 25 particles per square centimeter, the variance 
is also 25 and the standard deviation of the counts is five per square centimeter. Consequently, 
information on the variability is very easily obtained. Conversely, if the variance of count data 
is much greater than the mean of the same data, the Poisson distribution is not a good model 
for the distribution of the random variable. 


EXERCISES FOR SECTION 3-9 


3-129. Suppose X has a Poisson distribution with a mean 
of 4. Determine the following probabilities: 


(d) What is the probability that there are exactly five calls in 
30 minutes? 


(a) PX =0) (b) PX = 2) 3-133. Astronomers treat the number of stars in a given vol- 
(c) PX=4) (d) PX=8) ume of space as a Poisson random variable. The density in the 
3-130. Suppose X has a Poisson distribution with a mean Milky Way Galaxy in the vicinity of our solar system is one 


of 0.4. Determine the following probabilities: star per 16 cubic light-years. 


(a) P(X =0) (b) P(X $2) (a) What is the probability of two or more stars in 16 cubic 
(c) P(X =4) (d) P(X = 8) light-years? 
3-131. Suppose that the number of customers who enter (b) How many cubic light-years of space must be studied so 


a bank in an hour is a Poisson random variable, and suppose that that the probability of one or more stars exceeds 0.95? 


P(X = 0) = 0.05. Determine the mean and variance of X. 


3-132. The number of telephone calls that arrive at a phone 

exchange is often modeled as a Poisson random variable. 

Assume that on the average there are 10 calls per hour. 

(a) What is the probability that there are exactly five calls in 
one hour? 

(b) What is the probability that there are three or fewer calls 
in one hour? 

(c) What is the probability that there are exactly 15 calls in 
two hours? 


3-134. Data from www.centralhudsonlabs determined the 
mean number of insect fragments in 225-gram chocolate 
bars was 14.4, but three brands had insect contamination 
more than twice the average. See the U.S. Food and Drug 
Administration—Center for Food Safety and Applied Nutrition 
for Defect Action Levels for food products. Assume the num- 
ber of fragments (contaminants) follows a Poisson distribution. 
(a) If you consume a 225-gram bar from a brand at the mean 
contamination level, what is the probability of no insect 
contaminants? 
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(b) Suppose you consume a bar that is one-fifth the size tested 
(45 grams) from a brand at the mean contamination level. 
What is the probability of no insect contaminants? 

(c) If you consume seven 28.35-gram (one-ounce) bars this 
week from a brand at the mean contamination level, what 
is the probability that you consume one or more insect 
fragments in more than one bar? 

(d) Is the probability of contamination more than twice the 
mean of 14.4 unusual, or can it be considered typical vari- 
ation? Explain. 

3-135. In 1898 L. J. Bortkiewicz published a book entitled 

The Law of Small Numbers. He used data collected over 

20 years to show that the number of soldiers killed by horse 

kicks each year in each corps in the Prussian cavalry followed 

a Poisson distribution with a mean of 0.61. 

(a) What is the probability of more than one death in a corps 
in a year? 

(b) What is the probability of no deaths in a corps over five 
years? 

3-136. The number of flaws in bolts of cloth in textile man- 

ufacturing is assumed to be Poisson distributed with a mean of 

0.1 flaw per square meter. 

(a) What is the probability that there are two flaws in 1 square 
meter of cloth? 

(b) What is the probability that there is one flaw in 10 square 
meters of cloth? 

(c) What is the probability that there are no flaws in 20 square 
meters of cloth? 

(d) What is the probability that there are at least two flaws in 
10 square meters of cloth? 

3-137. When a computer disk manufacturer tests a disk, it 

writes to the disk and then tests it using a certifier. The certi- 

fier counts the number of missing pulses or errors. The number 
of errors on a test area on a disk has a Poisson distribution with 

A = 0.2. 

(a) What is the expected number of errors per test area? 

(b) What percentage of test areas have two or fewer errors? 

3-138. The number of cracks ina section of interstate highway 

that are significant enough to require repair is assumed to follow 

a Poisson distribution with a mean of two cracks per mile. 

(a) What is the probability that there are no cracks that require 
repair in 5 miles of highway? 

(b) What is the probability that at least one crack requires 
repair in 1/2 mile of highway? 

(c) If the number of cracks is related to the vehicle load on the 
highway and some sections of the highway have a heavy 
load of vehicles whereas other sections carry a light load, 
how do you feel about the assumption of a Poisson distri- 
bution for the number of cracks that require repair? 

3-139. The number of surface flaws in plastic panels used 

in the interior of automobiles has a Poisson distribution with 

a mean of 0.05 flaw per square foot of plastic panel. Assume 

an automobile interior contains 10 square feet of plastic 

panel. 


(a) What is the probability that there are no surface flaws in 
an auto’s interior? 

(b) If 10 cars are sold to a rental company, what is the proba- 
bility that none of the 10 cars has any surface flaws? 

(c) If 10 cars are sold to a rental company, what is the proba- 
bility that at most one car has any surface flaws? 


3-140. The number of failures of a testing instrument from 

contamination particles on the product is a Poisson random 

variable with a mean of 0.02 failure per hour. 

(a) What is the probability that the instrument does not fail in 
an eight-hour shift? 

(b) What is the probability of at least one failure in a 24-hour 
day? 

3-141. The number of content changes to a Web site follows 

a Poisson distribution with a mean of 0.25 per day. 

(a) What is the probability of two or more changes in a day? 

(b) What is the probability of no content changes in five days? 

(c) What is the probability of two or fewer changes in five 
days? 

3-142. The number of views of a page on a Web site follows 

a Poisson distribution with a mean of 1.5 per minute. 

(a) What is the probability of no views in a minute? 

(b) What is the probability of two or fewer views in 10 minutes? 

(c) Does the answer to the previous part depend on whether 
the 10-minute period is an uninterrupted interval? Explain. 


Supplemental Exercises 


3-143. Let the random variable XY be equally likely to as- 
sume any of the values 1/8, 1/4, or 3/8. Determine the mean 
and variance of X. 


3-144. Let X denote the number of bits received in error in 
a digital communication channel, and assume that X is a binomial 
random variable with p = 0.001. If 1000 bits are transmitted, 
determine the following: 


(a) P(X =1) (b) P(X =1) 
(c) P(X = 2) (d) mean and variance of X 
3-145. Batches that consist of 50 coil springs from a produc- 


tion process are checked for conformance to customer require- 

ments. The mean number of nonconforming coil springs in a 

batch is five. Assume that the number of nonconforming springs 

in a batch, denoted as_X, is a binomial random variable. 

(a) What are n and p? 

(b) What is P(X = 2)? 

(c) What is PY = 49)? 

3-146. An automated egg carton loader has a 1% probability 

of cracking an egg, and a customer will complain if more than 

one egg per dozen is cracked. Assume each egg load is an inde- 

pendent event. 

(a) What is the distribution of cracked eggs per dozen? Include 
parameter values. 

(b) What are the probability that a carton of a dozen eggs results 
in a complaint? 


(c) What are the mean and standard deviation of the number 
of cracked eggs in a carton of one dozen? 


3-147. A total of 12 cells are replicated. Freshly synthesized 
DNA cannot be replicated again until mitosis is completed. 
Two control mechanisms have been identified—one positive 
and one negative—that are used with equal probability. 
Assume that each cell independently uses a control mecha- 
nism. Determine the following probabilities. 

(a) All cells use a positive control mechanism. 

(b) Exactly half the cells use a positive control mechanism. 
(c) More than four, but less than seven, cells use a positive 

control mechanism. 


3-148. A congested computer network has a 1% chance of 

losing a data packet and packet losses are independent events. 

An e-mail message requires 100 packets. 

(a) What is the distribution of data packets that must be re- 
sent? Include the parameter values. 

(b) What is the probability that at least one packet must 
be re-sent? 

(c) What is the probability that two or more packets must 
be re-sent? 

(d) What are the mean and standard deviation of the number 
of packets that must be re-sent? 

(e) If there are 10 messages and each contains 100 packets, 
what is the probability that at least one message requires 
that two or more packets be re-sent? 


3-149. A particularly long traffic light on your morning 

commute is green 20% of the time that you approach it. 

Assume that each morning represents an independent trial. 

(a) What is the probability that the first morning that the light 
is green is the fourth morning that you approach it? 

(b) What is the probability that the light is not green for 10 
consecutive mornings? 


3-150. The probability is 0.6 that a calibration of a trans- 
ducer in an electronic instrument conforms to specifications 
for the measurement system. Assume the calibration attempts 
are independent. What is the probability that at most three 
calibration attempts are required to meet the specifications for 
the measurement system? 


3-151. Anelectronic scale in an automated filling operation 

stops the manufacturing line after three underweight packages 

are detected. Suppose that the probability of an underweight 

package is 0.001 and each fill is independent. 

(a) What is the mean number of fills before the line is stopped? 

(b) What is the standard deviation of the number of fills 
before the line is stopped? 


3-152. The probability that an eagle kills a jackrabbit in a 

day of hunting is 10%. Assume that results are independent 

between days. 

(a) What is the distribution of the number of days until a suc- 
cessful jackrabbit hunt? 

(b) What is the probability that the eagle must wait five days 
for its first successful hunt? 
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(c) What is the expected number of days until a successful 
hunt? 

(d) If the eagle can survive up to 10 days without food (it requires 
a successful hunt on the tenth day), what is the probability 
that the eagle is still alive 10 days from now? 


3-153. Traffic flow is traditionally modeled as a Poisson dis- 
tribution. A traffic engineer monitors the traffic flowing through 
an intersection with an average of six cars per minute. To set the 
timing of a traffic signal, the following probabilities are used. 

(a) What is the probability of no cars through the intersection 
within 30 seconds? 

(b) What is the probability of three or more cars through the 
intersection within 30 seconds? 

(c) Calculate the minimum number of cars through the inter- 
section so that the probability of this number or fewer cars 
in 30 seconds is at least 90%. 

(d) If the variance of the number of cars through the intersec- 
tion per minute is 20, is the Poisson distribution appropriate? 
Explain. 

3-154. A shipment of chemicals arrives in 15 totes. Three of 

the totes are selected at random, without replacement, for an 

inspection of purity. If two of the totes do not conform to 
purity requirements, what is the probability that at least one of 
the nonconforming totes is selected in the sample? 


3-155. The probability that your call to a service line is an- 

swered in less than 30 seconds is 0.75. Assume that your calls 

are independent. 

(a) If you call 10 times, what is the probability that exactly 
nine of your calls are answered within 30 seconds? 

(b) If you call 20 times, what is the probability that at least 16 
calls are answered in less than 30 seconds? 

(c) If you call 20 times, what is the mean number of calls that 
are answered in less than 30 seconds? 


3-156. Continuation of Exercise 3-155. 

(a) What is the probability that you must call four times to 
obtain the first answer in less than 30 seconds? 

(b) What is the mean number of calls until you are answered 
in less than 30 seconds? 


3-157. Continuation of Exercise 3-155. 

(a) What is the probability that you must call six times in 
order for two of your calls to be answered in less than 
30 seconds? 

(b) What is the mean number of calls to obtain two answers in 
less than 30 seconds? 


3-158. The number of messages sent to a computer bulletin 

board is a Poisson random variable with a mean of five mes- 

sages per hour. 

(a) What is the probability that five messages are received in 
1 hour? 

(b) What is the probability that 10 messages are received in 
1.5 hours? 

(c) What is the probability that less than two messages are 
received in one-half hour? 
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3-159. A Web site is operated by four identical computer 
servers. Only one is used to operate the site; the others are 
spares that can be activated in case the active server fails. The 
probability that a request to the Web site generates a failure in 
the active server is 0.0001. Assume that each request is an in- 
dependent trial. What is the mean time until failure of all four 
computers? 


3-160. The number of errors in a textbook follows a 
Poisson distribution with a mean of 0.01 error per page. What 
is the probability that there are three or less errors in 100 
pages? 

3-161. The probability that an individual recovers from 
an illness in a one-week time period without treatment is 
0.1. Suppose that 20 independent individuals suffering 
from this illness are treated with a drug and four recover in 
a one-week time period. If the drug has no effect, what is 
the probability that four or more people recover in a one- 
week time period? 


3-162. Patient response to a generic drug to control pain is 
scored on a 5-point scale, where a 5 indicates complete relief. 
Historically, the distribution of scores is 


1 2 3 4 5 
0.05 0.1 0.2 0.25 0.4 


Two patients, assumed to be independent, are each scored. 

(a) What is the probability mass function of the total score? 

(b) What is the probability mass function of the average 
score? 


3-163. In a manufacturing process that laminates several 

ceramic layers, 1% of the assemblies are defective. Assume 

that the assemblies are independent. 

(a) What is the mean number of assemblies that need to be 
checked to obtain five defective assemblies? 

(b) What is the standard deviation of the number of assem- 
blies that need to be checked to obtain five defective 
assemblies? 


3-164. Continuation of Exercise 3-163. Determine the 
minimum number of assemblies that need to be checked 
so that the probability of at least one defective assembly 
exceeds 0.95. 


3-165. Determine the constant c so that the following 
function is a probability mass function: f(x) = cx for x = 
1, 2,3, 4. 


3-166. A manufacturer of a consumer electronics product 

expects 2% of units to fail during the warranty period. A 

sample of 500 independent units is tracked for warranty 

performance. 

(a) What is the probability that none fails during the warranty 
period? 

(b) What is the expected number of failures during the 
watranty period? 


(c) What is the probability that more than two units fail 
during the warranty period? 


3-167. Messages that arrive at a service center for an in- 
formation systems manufacturer have been classified on the 
basis of the number of keywords (used to help route mes- 
sages) and the type of message, either e-mail or voice. Also, 
70% of the messages arrive via e-mail and the rest are voice. 


number of keywords 0 1 2 3 4 
e-mail 0.1 0.1 0.2 0.4 0.2 
voice 0.3 0.4 0.2 0.1 0 


Determine the probability mass function of the number of 
keywords in a message. 


3-168. The random variable X has the following probability 
distribution: 


x 2 3 5 8 
probability 0.2 0.4 0.3 0.1 


Determine the following: 
(a) P(X <3) 

(c) P(2.7<X <5.1) 
(e) V(X) 

3-169. Determine the probability mass function for the 
random variable with the following cumulative distribution 
function: 


(b) P(X > 2.5) 
(d) E(x) 


0 x<2 
0.2 2=x<57 

F(x) = §0.5 5.75x<65 
0.8 6.5 sx <85 
1 85x 


3-170. Each main bearing cap in an engine contains four 

bolts. The bolts are selected at random, without replacement, 

from a parts bin that contains 30 bolts from one supplier and 

70 bolts from another. 

(a) What is the probability that a main bearing cap contains 
all bolts from the same supplier? 

(b) What is the probability that exactly three bolts are from 
the same supplier? 


3-171. Assume the number of errors along a magnetic 

recording surface is a Poisson random variable with a mean of 

one error every 10° bits. A sector of data consists of 4096 

eight-bit bytes. 

(a) What is the probability of more than one error in a sector? 

(b) What is the mean number of sectors until an error is 
found? 


3-172. An installation technician for a specialized commu- 
nication system is dispatched to a city only when three or 
more orders have been placed. Suppose orders follow a 
Poisson distribution with a mean of 0.25 per week for a city 
with a population of 100,000, and suppose your city contains 
a population of 800,000. 


(a) What is the probability that a technician is required after a 
one-week period? 

(b) If you are the first one in the city to place an order, what is 
the probability that you have to wait more than two weeks 
from the time you place your order until a technician is 
dispatched? 


3-173. From 500 customers, a major appliance manufac- 
turer will randomly select a sample without replacement. The 
company estimates that 25% of the customers will provide 
useful data. If this estimate is correct, what is the probability 
mass function of the number of customers that will provide 
useful data? 

(a) Assume that the company samples five customers. 

(b) Assume that the company samples 10 customers. 


3-174. It is suspected that some of the totes containing 
chemicals purchased from a supplier exceed the moisture 
content target. Samples from 30 totes are to be tested for 
moisture content. Assume that the totes are independent. 
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Determine the proportion of totes from the supplier that must 
exceed the moisture content target so that the probability is 
0.90 that at least one tote in the sample of 30 fails the test. 


3-175. Messages arrive to a computer server according 
to a Poisson distribution with a mean rate of 10 per 
hour. Determine the length of an interval of time such that 
the probability that no messages arrive during this interval 
is 0.90. 


3-176. Flaws occur in the interior of plastic used for auto- 

mobiles according to a Poisson distribution with a mean of 

0.02 flaw per panel. 

(a) If 50 panels are inspected, what is the probability that 
there are no flaws? 

(b) What is the expected number of panels that need to be 
inspected before a flaw is found? 

(c) If 50 panels are inspected, what is the probability that the 
number of panels that have one or more flaws is less than or 
equal to 2? 


MIND-EXPANDING EXERCISES 


3-177. Derive the convergence results used to obtain a 
Poisson distribution as the limit of a binomial distribution. 


3-178. Show that the function f(x) in Example 3-5 
satisfies the properties of a probability mass function 
by summing the infinite series. 


3-179. Derive the formula for the mean and standard 
deviation of a discrete uniform random variable over the 
range of integersa,a + 1,...,b. 

3-180. Derive the expression for the variance of a 
geometric random variable with parameter p. 


3-181. An air flight can carry 120 passengers. A pas- 
senger with a reserved seat arrives for the flight with 
probability 0.95 Assume the passengers behave inde- 
pendently. (Computer software is expected.) 


(a) What is the minimum number of seats the airline 
should reserve for the probability of a full flight to 
be at least 0.90? 

(b) What is the maximum number of seats the airline 
should reserve for the probability that more passen- 
gers arrive than the flight can seat to be less than 0.10? 

(c) Discuss some reasonable policies the airline could 
use to reserve seats based on these probabilities. 


3-182. A company performs inspection on shipments 
from suppliers in order to defect nonconforming prod- 
ucts. Assume a lot contains 1000 items and 1% are 
nonconforming. What sample size is needed so that the 


probability of choosing at least one nonconforming item 
in the sample is at least 0.90? Assume the binomial 
approximation to the hypergeometric distribution is 
adequate. 


3-183. A company performs inspection on shipments 
from suppliers in order to detect nonconforming prod- 
ucts. The company’s policy is to use a sample size that is 
always 10% of the lot size. Comment on the effective- 
ness of this policy as a general rule for all sizes of lots. 


3-184. A manufacturer stocks components obtained 
from a supplier. Suppose that 2% of the components are 
defective and that the defective components occur inde- 
pendently. How many components must the manufacturer 
have in stock so that the probability that 100 orders can be 
completed without reordering components is at least 0.95? 
3-185. A large bakery can produce rolls in lots of 
either 0, 1000, 2000, or 3000 per day. The production 
cost per item is $0.10. The demand varies randomly 
according to the following distribution: 


demand for rolls 0 1000 2000 3000 
probability of demand 0.3 0.2 0.3 0.2 


Every roll for which there is a demand is sold for $0.30. 
Every roll for which there is no demand is sold in a sec- 
ondary market for $0.05. How many rolls should the 
bakery produce each day to maximize the mean profit? 


106 CHAPTER 3 DISCRETE RANDOM VARIABLES AND PROBABILITY DISTRIBUTIONS 


IMPORTANT TERMS AND CONCEPTS 


Bernoulli trial Finite population Mean-function of a Probability mass 
Binomial distribution correction factor discrete random function 
Cumulative probability Geometric distribution variable Standard deviation- 
distribution function- Hypergeometric distri- Negative binomial discrete random 
discrete random bution distribution variable 
variable Lack of memory Poisson distribution Variance-discrete 
Discrete uniform distri- property-discrete Poisson process random variable 
bution random variable Probability distribution- 
Expected value of a Mean-discrete random discrete random 
function of a random variable variable 


variable 
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Continuous Random Variables 
and Probability Distributions 


A link between statistics and physical phenomena is provided by the kinetic theory of 
gases. The physicist James Maxwell used some basic assumptions to determine the distri- 
bution of molecular velocity in a gas at equilibrium. As a result of molecular collisions all 
directions of rebound are equally likely. From this concept he assumed equal probabilities 
for velocities in all the x, y, and z directions and also independence of these components 
of velocity. This alone is sufficient to show that the probability distribution of the veloc- 
ity in a particular direction x is the continuous probability distribution known as the nor- 
mal distribution. This fundamental probability distribution can be derived from other di- 
rections (such as the central limit theorem to be discussed in a later chapter), but the 
kinetic theory may be the most parsimonious. This role for the normal distribution illustrates 
one example of the importance of continuous probability distribution distributions within 
science and engineering. 


CHAPTER OUTLINE 

4-1 CONTINUOUS RANDOM 4-4. MEAN AND VARIANCE OF A 
VARIABLES CONTINUOUS RANDOM 

4-2 PROBABILITY DISTRIBUTIONS VARIABLE 
AND PROBABILITY DENSITY 4-5 CONTINUOUS UNIFORM 
FUNCTIONS DISTRIBUTION 

4-3. CUMULATIVE DISTRIBUTION 4-6 NORMAL DISTRIBUTION 
FUNCTIONS 
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4-7 NORMAL APPROXIMATION TO 4-10 WEIBULL DISTRIBUTION 
THE BINOMIAL AND POISSON 4-11 LOGNORMAL DISTRIBUTION 


Pe UON: 4-12 BETA DISTRIBUTION 
4-8 EXPONENTIAL DISTRIBUTION 


4-9 ERLANG AND GAMMA 
DISTRIBUTIONS 


LEARNING OBJECTIVES 


After careful study of this chapter you should be able to do the following: 

1. Determine probabilities from probability density functions 

2. Determine probabilities from cumulative distribution functions and cumulative distribution func- 
tions from probability density functions, and the reverse 

3. Calculate means and variances for continuous random variables 

4. Understand the assumptions for some common continuous probability distributions 

5. Select an appropriate continuous probability distribution to calculate probabilities in specific applications 

6. Calculate probabilities, determine means and variances for some common continuous probability 
distributions 

7. Standardize normal random variables 

8. Use the table for the cumulative distribution function of a standard normal distribution to calcu- 
late probabilities 


9. Approximate probabilities for some binomial and Poisson distributions 


4-1 CONTINUOUS RANDOM VARIABLES 


Suppose a dimensional length is measured on a manufactured part selected from a day’s 
production. In practice, there can be small variations in the measurements due to many causes, 
such as vibrations, temperature fluctuations, operator differences, calibrations, cutting tool 
wear, bearing wear, and raw material changes. In an experiment such as this, the measurement 
is naturally represented as a random variable X and it is reasonable to model the range of pos- 
sible values of X with an interval of real numbers. Recall from Chapter 2 that a continuous 
random variable is a random variable with an interval (either finite or infinite) of real num- 
bers for its range. The model provides for any precision in length measurements. 

Because the number of possible values of X is uncountably infinite, X has a distinctly dif- 
ferent distribution from the discrete random variables studied previously. But as in the discrete 
case, many physical systems can be modeled by the same or similar continuous random vari- 
ables. These random variables are described, and example computations of probabilities, 
means, and variances are provided in the remaining sections of this chapter. 


4-2 PROBABILITY DISTRIBUTIONS AND PROBABILITY 
DENSITY FUNCTIONS 


Density functions are commonly used in engineering to describe physical systems. For exam- 
ple, consider the density of a loading on a long, thin beam as shown in Fig. 4-1. For any point 
x along the beam, the density can be described by a function (in grams/cm). Intervals with 
large loadings correspond to large values for the function. The total loading between points a 
and b is determined as the integral of the density function from a to b. This integral is the area 
under the density function over this interval, and it can be loosely interpreted as the sum of all 
the loadings over this interval. 


Probability 
Density 
Function 


Figure 4-3 
Histogram approxi- 
mates a probability 
density function. 
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f (x) 
Lot} 
< Pia<X<b) 
no} 
oO 
E 2, aw 
* a b x 


Figure 4-1 Density function ofa Figure 4-2 Probability determined from the area 
loading on a long, thin beam. under f(x). 


Similarly, a probability density function f(x) can be used to describe the probability dis- 
tribution of a continuous random variable X. If an interval is likely to contain a value for X, 
its probability is large and it corresponds to large values for f(x). The probability that X is 
between a and b is determined as the integral of f(x) from a to b. See Fig. 4-2. 


For a continuous random variable X, a probability density function is a function 
such that 


(1) f(x) =0 
@ | fa=1 


b 
3) PiasXsb)= | 10) dx = area under f(x) from a to b 


for any a and b (4-1) 


A probability density function provides a simple description of the probabilities asso- 
ciated with a random variable. As long as f(x) is nonnegative and {~, f(x) dx = 1, 
0 = P(a < X <b) S1 so that the probabilities are properly restricted. A probability density 
function is zero for x values that cannot occur and it is assumed to be zero wherever it is not 
specifically defined. 

A histogram is an approximation to a probability density function. See Fig. 4-3. For each 
interval of the histogram, the area of the bar equals the relative frequency (proportion) of the 
measurements in the interval. The relative frequency is an estimate of the probability that a 
measurement falls in the interval. Similarly, the area under f(x) over any interval equals the 
true probability that a measurement falls in the interval. 

The important point is that f(x) is used to calculate an area that represents the probabil- 
ity that X assumes a value in [a, b]. For the current measurement example, the probability that 
X results in [14 mA, 15 mA] is the integral of the probability density function of X over this in- 
terval. The probability that X results in [14.5 mA, 14.6 mA] is the integral of the same function, 
f(x), over the smaller interval. By appropriate choice of the shape of f(x), we can represent the 


f(x) 
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probabilities associated with any continuous random variable X. The shape of f(x) determines 
how the probability that X assumes a value in [14.5 mA, 14.6 mA] compares to the probabil- 
ity of any other interval of equal or different length. 

For the density function of a loading on a long, thin beam, because every point has zero width, 
the loading at any point is zero. Similarly, for a continuous random variable X and any value x, 


P(X =x) =0 


Based on this result, it might appear that our model of a continuous random variable is use- 
less. However, in practice, when a particular current measurement is observed, such as 14.47 
milliamperes, this result can be interpreted as the rounded value of a current measurement that 
is actually in arange suchas 14.465 = x = 14.475. Therefore, the probability that the rounded 
value 14.47 is observed as the value for X is the probability that X assumes a value in the inter- 
val [14.465, 14.475], which is not zero. Similarly, because each point has zero probability, one 
need not distinguish between inequalities such as < or = for continuous random variables. 


If X is a continuous random variable, for any x, and x», 


P(x; = X Sx) = P(x, < X SX) = P(x) S X < xy) = Play < X <Q) (42) 


EXAMPLE 4-1 Electric Current 
Let the continuous random variable X denote the current meas- 
ured in a thin copper wire in milliamperes. Assume that the range 
of X is [0, 20 mA], and assume that the probability density func- 
tion of X is f(x) = 0.05 for 0 = x = 20. What is the probabil- 
ity that a current measurement is less than 10 milliamperes? 
The probability density function is shown in Fig. 4-4. It is 
assumed that f(x) = 0 wherever it is not specifically defined. 
The probability requested is indicated by the shaded area in 
Fig. 4-4. 


EXAMPLE 4-2 Hole Diameter 

Let the continuous random variable X denote the diameter of a 
hole drilled in a sheet metal component. The target diameter is 
12.5 millimeters. Most random disturbances to the process 
result in larger diameters. Historical data show that the distri- 
bution of X can be modeled by a probability density function 


10 10 
P(X < 10) = [19 dx = | 0.05 dx = 0.5 
0 0 
As another example, 
20 
P(5 < X < 20) = [709 dx = 0.75 


5 


function and the requested probability are shown in Fig. 4-5. 
A part is scrapped if X > 12.60. Now, 


oo 


P(X > 12.60) = | f(x) dx = | 20e-2%"= 125) dy 


f(x) = 20e7 200 12.5), x= 12.5. 12.6 12.6 
If a part with a diameter larger than 12.60 millimeters is = — 97 20(x-12.5) 7 = 0.135 
scrapped, what proportion of parts is scrapped? The density 12.6 : 
F(x) 
f (x) 
0.05 
6) 10 20 a 12.5 12:6 x 


Figure 4-4 Probability density 
function for Example 4-1. 


Figure 4-5 Probability density function for 
Example 4-2. 


What proportion of parts is between 12.5 and 12.6 mil- 
limeters? Now, 


12.6 
12.6 
P(12.5 < X < 12.6) = | fx) dx = —e-200-125 
ing 12.5 
= 0.865 


EXERCISES FOR SECTION 4-2 
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Because the total area under f(x) equals 1, we can also calcu- 
late P(12.5 < X < 12.6) =1 — P(X > 12.6) = 1 — 0.135 = 
0.865. 

Practical Interpretation: Because 0.135 is the proportion of 
parts with diameters greater than 12.60 mm a large proportion of 
parts are scrapped. Process improvements are needed to increase 
the proportion of parts with dimensions near 12.50 mm. 


4-1. Suppose that f(x) =e * for 0 <x. Determine the 
following: 


(a) P(1 <X) (b) P(1 < X < 2.5) 
(c) P(X = 3) (d) P(X < 4) 
(ec) P(3 =X) 


(f) Determine x such that P(x < X) = 0.10. 

(g) Determine x such that P(X = x) = 0.10. 

4-2. Suppose that f(x) = 3(8x — x°)/256 forO <x <8. 
Determine the following: 

(a) P(X <2) (b) P(X < 9) 

(c) P(2<X<4) (d) P(X > 6) 

(e) Determine x such that P(X < x) = 0.95. 

4-3. Suppose that f(x) =0.5cosxfor —w/2 <x <7q/2. 
Determine the following: 

(a) P(X <0) (b) P(X < —1/4) 

(c) P(-w/4<X<q/4)  (d) P(X > —a/4) 

(e) Determine x such that P(X < x) = 0.95. 

4-4. The diameter of a particle of contamination (in mi- 
crometers) is modeled with the probability density function 
f(x) = 2/2? for x > 1. Determine the following: 

(a) P(X <2) (b) P(X > 5) 

(c) P('4<X <8) (d) P(X < 40rX > 8) 

(e) Determine x such that P(X < x) = 0.95. 

4-5. Suppose that f(x) = x/8 for 3 < x < 5. Determine the 
following probabilities: 

(a) P(X <4) 

(c) P(4<X <5) 

(ec) P(X < 3.5 or X > 4.5) 
4-6. Suppose that f(x) = e “~“) for 4 < x. Determine the 
following: 

(a) P(1 <X) (b) P(2<X <5) 

(c) P(S < X) (d) P(8 < X < 12) 

(e) Determine x such that P(X < x) = 0.90. 

4-7. Suppose that f(x) = 1.5x” for -1 <x < 1. Determine 
the following: 

(a) P(O < X) 

(c) P(—0.5 = X $0.5) 


(b) P(X > 3.5) 
(d) P(X < 4.5) 


(b) P(0.5 < X) 
(d) P(X < —2) 


(ec) P(X < OorX > —0.5) 
(f) Determine x such that P(x < X) = 0.05. 


4-8. The probability density function of the time to failure 

of an electronic component in a copier (in hours) is f(x) = 

e */1099/1000 for x > 0. Determine the probability that 

(a) A component lasts more than 3000 hours before failure. 

(b) A component fails in the interval from 1000 to 2000 
hours. 

(c) A component fails before 1000 hours. 

(d) Determine the number of hours at which 10% of all com- 
ponents have failed. 


4-9. The probability density function of the net weight in 

pounds of a packaged chemical herbicide is f(x) = 2.0 for 

49.75 < x < 50.25 pounds. 

(a) Determine the probability that a package weighs more 
than 50 pounds. 

(b) How much chemical is contained in 90% of all packages? 


4-10. The probability density function of the length of a cut- 
ting blade is f(x) = 1.25 for 74.6 <x < 75.4 millimeters. 
Determine the following: 

(a) P(X < 74.8) 

(b) P(X < 74.8 or X > 75.2) 

(c) If the specifications for this process are from 74.7 
to 75.3 millimeters, what proportion of blades meets 
specifications? 

4-11. The probability density function of the length of a 

metal rod is f(x) = 2 for 2.3 <x < 2.8 meters. 

(a) If the specifications for this process are from 2.25 to 2.75 
meters, what proportion of rods fail to meet the specifica- 
tions? 

(b) Assume that the probability density function is f(x) = 2 
for an interval of length 0.5 meters. Over what value 
should the density be centered to achieve the greatest pro- 
portion of bars within specifications? 

4-12. If Xis acontinuous random variable, argue that P(x, = 

X =X) = P(x, < X SX) = PO, SX <x) = PO, <X <x). 


4-3 CUMULATIVE DISTRIBUTION FUNCTIONS 


An alternative method to describe the distribution of a discrete random variable can also be 
used for continuous random variables. 
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Cumulative 
Distribution 
Function 


for —-~<x< ©, 


The cumulative distribution function of a continuous random variable X is 


(4-3) 


Extending the definition of f(x) to the entire real line enables us to define the cumulative dis- 
tribution function for all real numbers. The following example illustrates the definition. 


EXAMPLE 4-3 Electric Current 

For the copper current measurement in Example 4-1, the 
cumulative distribution function of the random variable X con- 
sists of three expressions. If x < 0, f(x) = 0. Therefore, 


F(x) =0, for x<0 


and 
F(x) = | 10) du = 0.05x, for 0=x< 20 


0 


Finally, 
F(x) = | nw du=1, for 20=x 
0 
Therefore, 
0 x <0 
F(x) = §0.05x Ox < 20 
1 20 =x 


The plot of F(x) is shown in Fig. 4-6. 


Notice that in the definition of F(x), any < can be changed to < and vice versa. That is, 
F(x) can be defined as either 0.05x or O at the end-point x = 0, and F(x) can be defined as 
either 0.05x or 1 at the end-point x = 20. In other words, F(x) is a continuous function. For a 
discrete random variable, F(x) is not a continuous function. Sometimes, a continuous random 
variable is defined as one that has a continuous cumulative distribution function. 


EXAMPLE 4-4 Hole Diameter 
For the drilling operation in Example 4-2, F(x) consists of two 
expressions. 


F(x) =0 for x< 12.5 


and for 12.5 = x, 
F(x) = [ 20erae-29 du 
12.5 


= 1 — o7 20-125) 


Therefore, 


0 x< 12.5 
Fla) = {i =e MP5) 125 <% 


Figure 4-7 displays a graph of F(x). 

Practical Interpretation: The cumulative distribution 
function enables one to easily calculate the probability a di- 
ameter in less than a value (such as 12.60 mm). Therefore, the 
probability of a scrapped part can be easily determined. 


The probability density function of a continuous random variable can be determined from 
the cumulative distribution function by differentiating. Recall that the fundamental theorem of 


calculus states that 


F(x) 


0) 20 


Figure 4-6 Cumulative distribution 
function for Example 4-3. 


Then, given F(x), 


as long as the derivative exists. 


EXAMPLE 4-5 Reaction Time 
The time until a chemical reaction is complete (in millisec- 
onds) is approximated by the cumulative distribution function 


0 x<0 


1= oe 0-O1x O=<x 


Determine the probability density function of X. What propor- 
tion of reactions is complete within 200 milliseconds? Using 


EXERCISES FOR SECTION 4-3 
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a 0 12.5 x 


Figure 4-7 Cumulative distribution 
function for Example 4-4. 


the result that the probability density function is the deriva- 
tive of F(x), we obtain 


0 x<0 
f(x) = ee O<x 


The probability that a reaction completes within 200 milli- 
seconds is 


P(X < 200) = F(200) = 1 — e~? = 0.8647. 


4-13. Suppose the cumulative distribution function of the 
random variable X is 
0 x<0 
F(x) = §0.2x OSx<5 
1 5S=x 


Determine the following: 

(a) P(X < 2.8) (b) P(X > 1.5) 

(c) P(X <—2) (d) P(X> 6) 

4-14. Suppose the cumulative distribution function of the 
random variable X is 


0 x<—-2 
F(x) = §0.25x +05 -25x<2 
1 2sx 


Determine the following: 

(a) P(X < 1.8) (b) P(X > -1.5) 

(c) P(X <—2) (d) P(-1<X<1) 

4-15. Determine the cumulative distribution function for 
the distribution in Exercise 4-1. 

4-16. Determine the cumulative distribution function for 
the distribution in Exercise 4-2. 


4-17. Determine the cumulative distribution function for 

the distribution in Exercise 4-3. 

4-18. Determine the cumulative distribution function for 

the distribution in Exercise 4-4. 

4-19. Determine the cumulative distribution function for 

the distribution in Exercise 4-5. 

4-20. Determine the cumulative distribution function for 

the distribution in Exercise 4-8. Use the cumulative distribu- 

tion function to determine the probability that a component 

lasts more than 3000 hours before failure. 

4-21. Determine the cumulative distribution function for 

the distribution in Exercise 4-11. Use the cumulative distribu- 

tion function to determine the probability that a length 

exceeds 2.7 meters. 

4-22. The probability density function of the time customers 

arrive at a terminal (in minutes after 8:00 A.M.) is f(x) = 

e ~/'9/10 for 0 <x. Determine the probability that 

(a) The first customer arrives by 9:00 A.M. 

(b) The first customer arrives between 8:15 A.M. and 8:30 A.M. 

(c) Two or more customers arrive before 8:40 A.M. among five 
who arrive at the terminal. Assume customer arrivals are 
independent. 

(d) Determine the cumulative distribution function and use 
the cumulative distribution function to determine the 
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probability that the first customer arrives between 4-25. 

8:15 A.M. and 8:30 A.M. 0 x<0 
4-23. The gap width is an important property of a magnetic 0.2x 0O<x<4 
recording head. In coded units, if the width isa continuous (x) 0.04x +064 4<x<9 
random variable over the range from 0 < x < 2 with f(x) = 0.5x, 1 ge 
determine the cumulative distribution function of the gap =e 
width. 4-26. 

Determine the probability density function for each of the fol- 0 x< 2 

lowing cumulative distribution functions. ee 0.25x +05 —2<x<1 

4-24, F(x)=1—-e* x>0 MV o5,4025  1<x<15 
1 15=x 


4-4 MEAN AND VARIANCE OF A CONTINUOUS 
RANDOM VARIABLE 


The mean and variance can also be defined for a continuous random variable. Integration 
replaces summation in the discrete definitions. If a probability density function is viewed as a 
loading on a beam as in Fig. 4-1, the mean is the balance point. 


Mean 


: and Suppose X is a continuous random variable with probability density function f(x). 
Variance The mean or expected value of X, denoted as w or E(X), is 


p= 21x) = | fla) ax (4-4) 
The variance of X, denoted as V(X) or 0°, is 
o =x) = | (- wyedr= | Ae) ar~ Ww 


The standard deviation of X is o = Vo. 


The equivalence of the two formulas for variance can be derived from the same approach used 
for discrete random variables. 


EXAMPLE 4-6 Electric Current 


For the copper current measurement in Example 4-1, the mean The variance of X is 
of X is 
20 
20 es : 3 | 
E(X) = [ one) dx = 0.05x2/2 | = 10 Vix) = | (x — 10)f(x) dx = 0.05(x — 10)°/3 2 33.33 
0 0 
0 


The expected value of a function h(X) of a continuous random variable is also defined in a 
straightforward manner. 
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Expected 
Value of a 
Function of a 
Continuous 
Random 
Variable 


If X is a continuous random variable with probability density function f(x), 


(4-5) 


EXAMPLE 4-7 

In Example 4-1, X is the current measured in milliamperes. 
What is the expected value of the squared current? Now, 
h(X) = X?. Therefore, 


oo 20 


xe 20 
E[h(X)] = | x f(x) dx = | 0.05x? dx = 0.05 = 
—0 0 9 
= 133.33 


EXAMPLE 4-8 Hole Diameter 


For the drilling operation in Example 4-2, the mean of X is 


E(X) = | af(x) dx = | x 202% 12-5) dy 
2 2.5 
Integration by parts can be used to show that 


e720 12.5) 


20 


~20(x—12.5) 


Il 


= 12.5 + 0.05 
12.5 


—xe 


E(X) 


ll 


12.55 


EXERCISES FOR SECTION 4-4 


In the previous example, the expected value of X? does not equal 
E(X) squared. However, in the special case that h(X) = aX + b 
for any constants a and b, E[h(X)] = aE(X) + b. This can be 
shown from the properties of integrals. 


The variance of X is 


V(X) = | (x — 12.55)?f(x) dx 


Although more difficult, integration by parts can be used two 
times to show that V(X) = 0.0025 and o = 0.05. 

Practical Interpretation: The scrap limit at 12.60 mm is 
only one standard deviation greater than the mean. This is gen- 
erally a warning that the scraps may be unacceptably high. 


4-27. Suppose f(x) = 0.25 for 0 < x < 4. Determine the 

mean and variance of X. 

4-28. Suppose f(x) = 0.125x for 0 < x < 4. Determine 

the mean and variance of X. 

4-29. Suppose f(x) = 1.5x? for —1 <x < 1. Determine 

the mean and variance of X. 

4-30. Suppose that f(x) = x/8 for 3 <x <5. Determine 

the mean and variance of x. 

4-31. Determine the mean and variance of the random vari- 

able in Exercise 4-2. 

4-32. Determine the mean and variance of the random vari- 

able in Exercise 4-1. 

4-33. Suppose that contamination particle size (in microm- 

eters) can be modeled as f(x) = 2x”? for 1 < x. Determine 

the mean of X. 

4-34. Suppose the probability density function of the length 

of computer cables is f(x) = 0.1 from 1200 to 1210 millimeters. 

(a) Determine the mean and standard deviation of the cable 
length. 

(b) Ifthe length specifications are 1195 < x < 1205 millimeters, 
what proportion of cables are within specifications? 


4-35. The thickness of a conductive coating in micrometers 
has a density function of 600x~? for 100 wm < x < 120 pm. 
(a) Determine the mean and variance of the coating thickness. 
(b) If the coating costs $0.50 per micrometer of thickness on 

each part, what is the average cost of the coating per part? 


4-36. The probability density function of the weight of 

packages delivered by a post office is f(x) = 70/(69x7) for 

1 <x< 70 pounds. 

(a) Determine the mean and variance of weight. 

(b) If the shipping cost is $2.50 per pound, what is the aver- 
age shipping cost of a package? 

(c) Determine the probability that the weight of a package 
exceeds 50 pounds. 


4-37. Integration by parts is required. The probability den- 
sity function for the diameter of a drilled hole in millimeters is 
10e7!°C—9) for x > 5 mm. Although the target diameter is 
5 millimeters, vibrations, tool wear, and other nuisances pro- 
duce diameters larger than 5 millimeters. 

(a) Determine the mean and variance of the diameter of the holes. 
(b) Determine the probability that a diameter exceeds 5.1 

millimeters. 
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4-5 CONTINUOUS UNIFORM DISTRIBUTION 
The simplest continuous distribution is analogous to its discrete counterpart. 


Continuous 


Uniform A continuous random variable X with probability density function 
Distribution 


f(x) = 1/(b — a), axx<=b (4-6) 


is a continuous uniform random variable. 


The probability density function of a continuous uniform random variable is shown in Fig. 4-8. 
The mean of the continuous uniform random variable X is 


The variance of X is 


These results are summarized as follows. 


Mean and 


Variance If X is a continuous uniform random variable over a = x = b, 


(a + b) 


— ay 
pw = E(X) = 5 and ea 


12 


(a7) 


EXAMPLE 4-9 Uniform Current 

Let the continuous random variable X denote the current The mean and variance formulas can be applied with a = 0 

measured in a thin copper wire in milliamperes. Assume that and b = 20. Therefore, 

the range of X is [0, 20 mA], and assume that the probability 

density function of X is f(x) = 0.05,0 < x = 20. E(X)=10mA and V(X) = 207/12 = 33.33 mA” 
What is the probability that a measurement of current is 

between 5 and 10 milliamperes? The requested probability is Consequently, the standard deviation of X is 5.77 mA. 

shown as the shaded area in Fig. 4-9. 


P(5 < X < 10) = | f(x) dx 


—— Ss 


ll 
Noa 


(0.05) = 0.25 


f) 


a b 


Figure 4-8 Continuous uniform 
probability density function. 
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6) 5 10 15 20 % 
Figure 4-9 Probability for Example 4-9. 


The cumulative distribution function of a continuous uniform random variable is ob- 


tained by integration. Ifa < x < 5, 


a)du = x/(b — a) — a/(b — a) 


Therefore, the complete description of the cumulative distribution function of a continuous 


uniform random variable is 


0 x<a 
(x -—a)(b-—a) axsx<b 
1 bsx 


An example of F(x) for a continuous uniform random variable is shown in Fig. 4-6. 


EXERCISES FOR SECTION 4-5 


4-38. Suppose X has a continuous uniform distribution over 

the interval [1.5, 5.5]. 

(a) Determine the mean, variance, and standard deviation of X. 

(b) What is P(X < 2.5)? 

(c) Determine the cumulative distribution function. 

4-39. Suppose X has a continuous uniform distribution over 

the interval [—1, 1]. 

(a) Determine the mean, variance, and standard deviation 
of X. 

(b) Determine the value for x such that P(—x < X < x) = 0.90. 

(c) Determine the cumulative distribution function. 


4-40. The net weight in pounds of a packaged chemical 

herbicide is uniform for 49.75 < x < 50.25 pounds. 

(a) Determine the mean and variance of the weight of packages. 

(b) Determine the cumulative distribution function of the 
weight of packages. 

(c) Determine P(X < 50.1). 

4-41. The thickness of a flange on an aircraft component is 

uniformly distributed between 0.95 and 1.05 millimeters. 

(a) Determine the cumulative distribution function of flange 
thickness. 

(b) Determine the proportion of flanges that exceeds 1.02 
millimeters. 

(c) What thickness is exceeded by 90% of the flanges? 

(d) Determine the mean and variance of flange thickness. 


4-42. Suppose the time it takes a data collection operator to 

fill out an electronic form for a database is uniformly between 

1.5 and 2.2 minutes. 

(a) What is the mean and variance of the time it takes an op- 
erator to fill out the form? 

(b) What is the probability that it will take less than two min- 
utes to fill out the form? 

(c) Determine the cumulative distribution function of the time 
it takes to fill out the form. 


4-43. The thickness of photoresist applied to wafers in 

semiconductor manufacturing at a particular location on the 

wafer is uniformly distributed between 0.2050 and 0.2150 

micrometers. 

(a) Determine the cumulative distribution function of pho- 
toresist thickness. 

(b) Determine the proportion of wafers that exceeds 0.2125 
micrometers in photoresist thickness. 

(c) What thickness is exceeded by 10% of the wafers? 

(d) Determine the mean and variance of photoresist thickness. 


4-44. An adult can lose or gain two pounds of water in the 
course of a day. Assume that the changes in water weight are 
uniformly distributed between minus two and plus two pounds in 
a day. What is the standard deviation of your weight over a day? 


4-45. A show is scheduled to start at 9:00 A.M., 9:30 A.M., 
and 10:00 A.M. Once the show starts, the gate will be closed. 
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A visitor will arrive at the gate at a time uniformly distributed 

between 8:30 A.M. and 10:00 A.M. Determine 

(a) The cumulative distribution function of the time (in minutes) 
between arrival and 8:30 A.M. 

(b) The mean and variance of the distribution in the previous 
part. 

(c) The probability that a visitor waits less than 10 minutes 
for a show. 

(d) The probability that a visitor waits more than 20 minutes 
for a show. 


4-46. The volume of a shampoo filled into a container is uni- 

formly distributed between 374 and 380 milliliters. 

(a) What are the mean and standard deviation of the volume 
of shampoo? 

(b) What is the probability that the container is filled with less 
than the advertised target of 375 milliliters? 

(c) What is the volume of shampoo that is exceeded by 95% 
of the containers? 

(d) Every milliliter of shampoo costs the producer $0.002. 
Any more shampoo in the container than 375 milliliters is 
an extra cost to the producer. What is the mean extra cost? 
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4-47. An e-mail message will arrive at a time uniformly 

distributed between 9:00 A.M. and 11:00 A.m. You check e-mail 

at 9:15 A.M. and every 30 minutes afterward. 

(a) What is the standard deviation of arrival time (in minutes)? 

(b) What is the probability that the message arrives less than 
10 minutes before you view it? 

(c) What is the probability that the message arrives more than 
15 minutes before you view it? 


4-48. Measurement error that is continuous and uniformly 
distributed from —3 to +3 millivolts is added to the true voltage 
of a circuit. Then the measurement is rounded to the nearest mil- 
livolt so that it becomes discrete. Suppose that the true voltage 
is 250 millivolts. 

(a) What is the probability mass function of the measured 

voltage? 
(b) What is the mean and variance of the measured voltage? 


Undoubtedly, the most widely used model for the distribution of a random variable is a 
normal distribution. Whenever a random experiment is replicated, the random variable that 
equals the average (or total) result over the replicates tends to have a normal distribution as 
the number of replicates becomes large. De Moivre presented this fundamental result, known 
as the central limit theorem, in 1733. Unfortunately, his work was lost for some time, and 
Gauss independently developed a normal distribution nearly 100 years later. Although De 
Moivre was later credited with the derivation, a normal distribution is also referred to as a 
Gaussian distribution. 

When do we average (or total) results? Almost always. For example, an automotive engi- 
neer may plan a study to average pull-off force measurements from several connectors. If we 
assume that each measurement results from a replicate of a random experiment, the normal 
distribution can be used to make approximate conclusions about this average. These conclu- 
sions are the primary topics in the subsequent chapters of this book. 

Furthermore, sometimes the central limit theorem is less obvious. For example, assume 
that the deviation (or error) in the length of a machined part is the sum of a large number of 
infinitesimal effects, such as temperature and humidity drifts, vibrations, cutting angle 
variations, cutting tool wear, bearing wear, rotational speed variations, mounting and fixture 
variations, variations in numerous raw material characteristics, and variation in levels of 
contamination. If the component errors are independent and equally likely to be positive or 
negative, the total error can be shown to have an approximate normal distribution. 
Furthermore, the normal distribution arises in the study of numerous basic physical phenomena. 
For example, the physicist Maxwell developed a normal distribution from simple assumptions 
regarding the velocities of molecules. 

The theoretical basis of a normal distribution is mentioned to justify the somewhat com- 
plex form of the probability density function. Our objective now is to calculate probabilities 
for a normal random variable. The central limit theorem will be stated more carefully later. 


Normal 
Distribution A random variable X with probability density function 
1 ica) 
f(x) = e ww —w<x<0 (4-8) 
2710 
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x 
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Figure 4-10 Normal probability density functions for 
selected values of the parameters js and 0”. 


Random variables with different means and variances can be modeled by normal proba- 
bility density functions with appropriate choices of the center and width of the curve. The 
value of E(X) = p determines the center of the probability density function and the value of 
V(X) = o° determines the width. Figure 4-10 illustrates several normal probability density 
functions with selected values of jz and o”. Each has the characteristic symmetric bell-shaped 
curve, but the centers and dispersions differ. The following definition provides the formula for 
normal probability density functions. 


is a normal random variable with parameters , where —%© < pw < ©, and o > 0. 
Also, 
EOC and) Vx) — 0 (4-9) 


and the notation M(w, 0”) is used to denote the distribution. 


The mean and variance of X are shown to equal w and o”, respectively, in an exercise at the 
end of chapter 5. 


EXAMPLE 4-10 
Assume that the current measurements in a strip of wire fol- 
low a normal distribution with a mean of 10 milliamperes and 
a variance of 4 (milliamperes). What is the probability that a 
measurement exceeds 13 milliamperes? 

Let X denote the current in milliamperes. The requested 
probability can be represented as P(X > 13). This probability 


is shown as the shaded area under the normal probability 
density function in Fig. 4-11. Unfortunately, there is no 
closed-form expression for the integral of a normal probabil- 
ity density function, and probabilities based on the normal dis- 
tribution are typically found numerically or from a table (that 
we will later introduce). 


Some useful results concerning a normal distribution are summarized below and in 
Fig. 4-12. For any normal random variable, 


P(p-— ao <X <p +o) = 0.6827 
P(p — 20 < X < p + 20) = 0.9545 
P(p — 30 < X <p + 30) = 0.9973 


Also, from the symmetry of f(x), P(X > w) = P(X < p) = 0.5. Because f(x) is positive for 
all x, this model assigns some probability to each interval of the real line. However, the 
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f(x) 
f(x) 
p-30 U-20 p-o 1 eto pt+2o0 wt+30 x 
|<«— 63% —>| 
I< 95% >| 
10 13 x 99.7% al 

Figure 4-11 Probability that X > 13 for a normal Figure 4-12 Probabilities associated with a normal 
random variable with ~ = 10 and o” = 4. distribution. 


probability density function decreases as x moves farther from jz. Consequently, the probability 
that a measurement falls far from ws is small, and at some distance from yw the probability of an 
interval can be approximated as zero. 

The area under a normal probability density function beyond 30 from the mean is quite 
small. This fact is convenient for quick, rough sketches of a normal probability density function. 
The sketches help us determine probabilities. Because more than 0.9973 of the probability of a 
normal distribution is within the interval (uw — 30, p» + 30), 60 is often referred to as 
the width of a normal distribution. Advanced integration methods can be used to show that the 
area under the normal probability density function from —% < x < ois I. 


Standard 


Normal A normal random variable with 
Random 


Variable w=0 and ceil 


is called a standard normal random variable and is denoted as Z. The cumulative 
distribution function of a standard normal random variable is denoted as 


Dic PZ =z) 


Appendix Table III provides cumulative probabilities for a standard normal random vari- 
able. Cumulative distribution functions for normal random variables are also widely available 
in computer packages. They can be used in the same manner as Appendix Table III to obtain 
probabilities for these random variables. The use of Table III is illustrated by the following 
example. 


EXAMPLE 4-11 Standard Normal Distribution 

Assume Z is a standard normal random variable. Appendix The column headings refer to the hundredths digit of the 
Table III provides probabilities of the form ®(z) = P(Z = 2). value of zin P(Z = z). For example, P(Z = 1.53) is found by 
The use of Table III to find P(Z = 1.5) is illustrated in Fig. 4-13. reading down the z column to the row 1.5 and then selecting 
Read down the z column to the row that equals 1.5. The prob- the probability from the column labeled 0.03 to be 0.93699. 
ability is read from the adjacent column, labeled 0.00, to be 

0.93319. 


P(Z < 1.5) = (1.5) 
= shaded area 
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0.00 0.01 0.02 0.03 


Figure 4-13 Standard 
normal probability den- 


sity function. 0) 1.5 z 


1.5 


0.50000 0.50399 0.50398 0.51197 


0.93319 0.93448 0.93574 0.93699 


Probabilities that are not of the form P(Z = z) are found by using the basic rules of prob- 
ability and the symmetry of the normal distribution along with Appendix Table III. The fol- 


lowing examples illustrate the method. 


EXAMPLE 4-12 

The following calculations are shown pictorially in Fig. 4-14. 
In practice, a probability is often rounded to one or two signif- 
icant digits. 


(1) P(Z> 1.26) =1— P(Z< 1.26) = 1 — 0.89616 
= 0.10384 

(2) P(Z< —0.86) = 0.19490. 

(3) P(Z > 1.37) = P(Z < 1.37) = 0.91465 


(4) P(—1.25 < Z < 0.37). This probability can be found 
from the difference of two areas, P(Z < 0.37) — 
P(Z < —1.25). Now, 


(5) 


1.37 


-1.25 O 0.37 0 0.37 


Figure 4-14 Graphical displays for standard normal distributions. 


1.26 


P(Z < 0.37) = 0.64431 


and 

P(Z < —1.25) = 0.10565 

Therefore, 

P(-1.25 < Z < 0.37) = 0.64431 — 0.10565 


= 0.53866 


P(Z = —4.6) cannot be found exactly from Appendix 
Table III. However, the last entry in the table can be used 
to find that P(Z = —3.99) = 0.00003. Because 
P(Z = —4.6) < P(Z = —3.99), P(Z = —4.6) is nearly 
zero. 


(5) 
-4.6 -3.99 0 
(6) 
0.05 
0 z=1.65 
7) 0.99 
0.005 0.005 
-zZ 0 z2=2.58 
-1.25 0 
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(6) Find the value z such that P(Z > z) = 0.05. This proba- 
bility expression can be written as P(Z = z) = 0.95. 
Now, Table III is used in reverse. We search through the 
probabilities to find the value that corresponds to 0.95. 
The solution is illustrated in Fig. 4-14. We do not find 
0.95 exactly; the nearest value is 0.95053, corresponding 
to z= 1.65. 


(7) Find the value of z such that P(—z < Z < z) = 0.99. 
Because of the symmetry of the normal distribution, if 
the area of the shaded region in Fig. 4-14(7) is to equal 
0.99, the area in each tail of the distribution must equal 
0.005. Therefore, the value for z corresponds to a proba- 
bility of 0.995 in Table II. The nearest probability in 
Table III is 0.99506, when z = 2.58. 


The preceding examples show how to calculate probabilities for standard normal 
random variables. To use the same approach for an arbitrary normal random variable would 
require a separate table for every possible pair of values for . and o. Fortunately, all normal 
probability distributions are related algebraically, and Appendix Table III can be used to find 
the probabilities associated with an arbitrary normal random variable by first using a simple 


transformation. 


Standardizing a 


Normal If X is anormal random variable with E(X) = p and V(X) = o”, the random variable 
Random 
Variable a 
Us aa (4-10) 


normal random variable. 


is a normal random variable with E(Z) = O and V(Z) = 1. That is, Z is a standard 


Creating a new random variable by this transformation is referred to as standardizing. 
The random variable Z represents the distance of X from its mean in terms of standard 
deviations. It is the key step to calculating a probability for an arbitrary normal random 


variable. 


EXAMPLE 4-13 Normally Distributed Current 
Suppose the current measurements in a strip of wire are as- 
sumed to follow a normal distribution with a mean of 10 mil- 
liamperes and a variance of 4 (milliamperes)*. What is the 
probability that a measurement will exceed 13 milliamperes? 
Let X denote the current in milliamperes. The requested 
probability can be represented as P(X > 13). Let Z = (X — 
10)/2. The relationship between the several values of X and the 
transformed values of Z are shown in Fig. 4-15. We note that X > 
13 corresponds to Z > 1.5. Therefore, from Appendix Table III, 


P(X > 13) = P(Z > 15) =1— P(Z<1.5) =1— 0.93319 
= 0.06681 


Rather than using Fig. 4-15, the probability can be found from 
the inequality X > 13. That is, 


P(X > 13) = (5 5 a 5 ) 


= 0.06681 


) = P(Z> 155) 


Practical Interpretation: Probabilities for any normal random 
variable can be computed with a simple transform to a stan- 
dard normal random variable. 


In the preceding example, the value 13 is transformed to 1.5 by standardizing, and 1.5 is 
often referred to as the z-value associated with a probability. The following summarizes the 
calculation of probabilities derived from normal random variables. 


X-u 
oO 


Distribution of Z = 


Distribution of X 


10 13 
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Figure 4-15 Standardizing a normal random variable. 


Standardizing 
to Calculate 
a Probability 


with z = (x — p)/o. 


Suppose X is a normal random variable with mean w and variance o”. Then, 


Pisa) = P( < 


where Z is a standard normal random variable, and z = 
obtained by standardizing X. The probability is obtained by using Appendix Table III 


A= fh ae = {il 
=e (4-11) 


)=rz<o 
Aces) 


is the z-value 


EXAMPLE 4-14 Normally Distributed Current 
Continuing the previous example, what is the probability that 
a current measurement is between 9 and 11 milliamperes? 
From Fig. 4-15, or by proceeding algebraically, we have 


PQ <X < 11) = P((9 — 10)/2 < (X — 10)/2 
< (11 — 10)/2) 
= P(-0.5 < Z < 0.5) = P(Z < 0.5) 
— P(Z < —0.5) 
= 0.69146 — 0.30854 = 0.38292 


Determine the value for which the probability that a cur- 
rent measurement is below this value is 0.98. The requested 
value is shown graphically in Fig. 4-16. We need the value of 
x such that P(X < x) = 0.98. By standardizing, this probabil- 
ity expression can be written as 


Figure 4-16 Deter- 
mining the value of x 
to meet a specified 
probability. 


P(X <x) = P((X — 10)/2 < (x — 10)/2) 
= P(Z < (x — 10)/2) 
= 0.98 


Appendix Table III is used to find the z-value such that P(Z < z) = 
0.98. The nearest probability from Table III results in 


P(Z < 2.05) = 0.97982 


Therefore, (x — 10)/2 = 2.05, and the standardizing transfor- 
mation is used in reverse to solve for x. The result is 


x = 2(2.05) + 10 = 14.1 milliamperes 
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, ee N 
Figure 4-17 Deter- Standardized distribution of O45 


mining the value of x 
to meet a specified 
probability. -z 


EXAMPLE 4-15 Signal Detection 
Assume that in the detection of a digital signal the background 
noise follows a normal distribution with a mean of 0 volt and 
standard deviation of 0.45 volt. The system assumes a digital 
1 has been transmitted when the voltage exceeds 0.9. What is 
the probability of detecting a digital 1 when none was sent? 
Let the random variable N denote the voltage of noise. 
The requested probability is 


Ns O2.) = Piz > 2) 
0.45 0.45 
= | — 0.97725 = 0.02275 


P(N > 0.9) = ( 


This probability can be described as the probability of a false 
detection. 

Determine symmetric bounds about 0 that include 99% of 
all noise readings. The question requires us to find x such that 
P(-x < N < x) = 0.99. A graph is shown in Fig. 4-17. Now, 


P(—x <.N <x) = P(—x/0.45 < N/0.45 < x/0.45) 
= P(—x/0.45 < Z < x/0.45) = 0.99 


From Appendix Table II, 
P(-2.58 < Z < 2.58) = 0.99 


EXAMPLE 4-16 Shaft Diameter 
The diameter of a shaft in an optical storage drive is normally 
distributed with mean 0.2508 inch and standard deviation 
0.0005 inch. The specifications on the shaft are 0.2500 + 
0.0015 inch. What proportion of shafts conforms to specifi- 
cations? 

Let X denote the shaft diameter in inches. The requested 
probability is shown in Fig. 4-18 and 


P(0.2485 < X < 0.2515) 
_ (o28° — 0.2508 


0.0005 


= P(-4.6 < Z< 14) = P(Z< 14) 
— P(Z < —4.6) 
= 0.91924 — 0.0000 = 0.91924 


0.2515 — 72508) 
0.0005 


Distribution of N 


Therefore, 
x/0.45 = 2.58 
and 
x = 2.58(0.45) = 1.16 


Suppose that when a digital 1 signal is transmitted, the 
mean of the noise distribution shifts to 1.8 volts. What is the 
probability that a digital 1 is not detected? Let the random vari- 
able S denote the voltage when a digital 1 is transmitted. Then, 


S—18 09-18 
a aed ( 045 ~ 045 ) 
= P(Z < —2) = 0.02275 


This probability can be interpreted as the probability of a 
missed signal. 

Practical Interpretation: Probability calculations such as 
these can be used to quantify the rates of missed signals or 
false signals and to select a threshold to distinguish a zero 
and a one bit. 


Most of the nonconforming shafts are too large, because the 
process mean is located very near to the upper specification 
limit. If the process is centered so that the process mean is 
equal to the target value of 0.2500, 


P(0.2485 < X < 0.2515) 
0.2485 — 0.2500 
~ ( 0.0005 
= P(-3 <Z< 3) 
= P(Z < 3) — P(Z < —3) 
= 0.99865 — 0.00135 
= 0.9973 


0.2515 — 02500) 
0.0005 


Practical Interpretation: By recentering the process, the yield 
is increased to approximately 99.73%. 


f(x) Specifications ————>] 


Figure 4-18 
Distribution for 0.2485 ij 0.2508 
Example 4-16. 0.25 


EXERCISES FOR SECTION 4-6 


4-49. Use Appendix Table III to determine the following 
probabilities for the standard normal random variable Z: 

(a) P(Z < 1.32) (b) P(Z < 3.0) 

(c) P(Z > 1.45) (d) P(Z > —2.15) 

(e) P(—2.34 < Z < 1.76) 

4-50. Use Appendix Table II to determine the following 
probabilities for the standard normal random variable Z: 

(a) P(-1<Z<1) (b) P(-2<Z<2) 

(c) P((-3 <Z<3) (d) P(Z>3) 

(e) PO<Z<1) 

4-51. Assume Z has a standard normal distribution. Use 
Appendix Table III to determine the value for z that solves 
each of the following: 

(a) P?0Z<z) =0.9 

(c) PCZ>z) =0.1 

(e) P(-1.24<Z<z7=0.8 
4-52. Assume Z has a standard normal distribution. Use 
Appendix Table III to determine the value for z that solves 
each of the following: 

(a) P(-z<Z<z) =0.95 


(b) P(Z<z) =0.5 
(d) P(Z>z) = 0.9 


(b) P(-z< Z<z) = 0.99 

(c) P(-z <Z<z) =0.68 (d) P(-z < Z< zg) = 0.9973 
4-53. Assume X is normally distributed with a mean of 10 
and a standard deviation of 2. Determine the following: 

(a) P(X < 13) (b) P(X > 9) 

(c) P6<X<14) (d) P2<X<4) 

(e) P(-2<X <8) 

4-54. Assume X is normally distributed with a mean of 10 
and a standard deviation of 2. Determine the value for x that 
solves each of the following: 

(a) P(X > x) = 0.5 

(b) P(X > x) = 0.95 

(c) Pox < X < 10) = 0.2 

(d) P(-x < X — 10 <x) = 0.95 

(e) P(-x < X — 10 < x) = 0.99 

4-55. Assume X is normally distributed with a mean of 5 
and a standard deviation of 4. Determine the following: 

(a) P(X < 11) (b) P(X > 0) 

(c) PB<X<7) (d) P(-2<X<9) 

(e) P(2<X <8) 

4-56. Assume X is normally distributed with a mean of 5 
and a standard deviation of 4. Determine the value for x that 
solves each of the following: 


0.2515 
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x 


(a) P(X > x) =0.5 (b) P(X > x) = 0.95 

(c) Px<X <9) =0.2 (d) P3B<X<x)=0.95 

(e) P(-x < X —5 <x) = 0.99 

4-57. The compressive strength of samples of cement can 

be modeled by a normal distribution with a mean of 6000 

kilograms per square centimeter and a standard deviation of 

100 kilograms per square centimeter. 

(a) What is the probability that a sample’s strength is less than 
6250 Kg/cm*? 

(b) What is the probability that a sample’s strength is between 
5800 and 5900 Kg/cm*? 

(c) What strength is exceeded by 95% of the samples? 


4-58. The time until recharge for a battery in a laptop com- 

puter under common conditions is normally distributed with a 

mean of 260 minutes and a standard deviation of 50 minutes. 

(a) What is the probability that a battery lasts more than four 
hours? 

(b) What are the quartiles (the 25% and 75% values) of bat- 
tery life? 

(c) What value of life in minutes is exceeded with 95% 
probability? 


4-59. An article in Knee Surgery Sports Traumatol Arthrosc 
[Effect of Provider Volume on Resource Utilization for 
Surgical Procedures” (2005, Vol. 13, pp. 273-279)] showed a 
mean time of 129 minutes and a standard deviation of 14 min- 
utes for ACL reconstruction surgery at high-volume hospitals 

(with more than 300 such surgeries per year). 

(a) What is the probability that your ACL surgery at a high- 
volume hospital requires a time more than two standard 
deviations above the mean? 

(b) What is the probability that your ACL surgery at a high- 
volume hospital is completed in less than 100 minutes? 

(c) The probability of a completed ACL surgery at a high- 
volume hospital is equal to 95% at what time? 

(d) If your surgery requires 199 minutes, what do you conclude 
about the volume of such surgeries at your hospital? Explain. 


4-60. Cholesterol is a fatty substance that is an important part 
of the outer lining (membrane) of cells in the body of animals. 
Its normal range for an adult is 120-240 mg/dl. The Food and 
Nutrition Institute of the Philippines found that the total choles- 
terol level for Filipino adults has a mean of 159.2 mg/dl and 
84.1% of adults have a cholesterol level below 200 mg/dl 
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(http://www.fnri.dost.gov.ph/). Suppose that the total choles- 

terol level is normally distributed. 

(a) Determine the standard deviation of this distribution. 

(b) What are the quartiles (the 25% and 75% values) of this 
distribution? 

(c) What is the value of the cholesterol level that exceeds 90% 
of the population? 

(d) An adult is at moderate risk if cholesterol level is more 
than one but less than two standard deviations above the 
mean. What percentage of the population is at moderate 
risk according to this criterion? 

(e) An adult is thought to be at high risk if his cholesterol 
level is more than two standard deviations above the mean. 
What percentage of the population is at high risk? 

(f) An adult has low risk if cholesterol level is one standard 
deviation or more below the mean. What percentage of the 
population is at low risk? 

4-61. The line width for semiconductor manufacturing is 

assumed to be normally distributed with a mean of 0.5 micro- 

meter and a standard deviation of 0.05 micrometer. 

(a) What is the probability that a line width is greater than 
0.62 micrometer? 

(b) What is the probability that a line width is between 0.47 
and 0.63 micrometer? 

(c) The line width of 90% of samples is below what value? 

4-62. The fill volume of an automated filling machine used 

for filling cans of carbonated beverage is normally distributed 

with a mean of 12.4 fluid ounces and a standard deviation of 

0.1 fluid ounce. 

(a) What is the probability that a fill volume is less than 12 fluid 
ounces? 

(b) If all cans less than 12.1 or greater than 12.6 ounces are 
scrapped, what proportion of cans is scrapped? 

(c) Determine specifications that are symmetric about the 
mean that include 99% of all cans. 

4-63. In the previous exercise, suppose that the mean of the 

filling operation can be adjusted easily, but the standard devi- 

ation remains at 0.1 ounce. 

(a) At what value should the mean be set so that 99.9% of all 
cans exceed 12 ounces? 

(b) At what value should the mean be set so that 99.9% of all 
cans exceed 12 ounces if the standard deviation can be re- 
duced to 0.05 fluid ounce? 


4-64. The reaction time of a driver to visual stimulus is nor- 

mally distributed with a mean of 0.4 seconds and a standard 

deviation of 0.05 seconds. 

(a) What is the probability that a reaction requires more than 
0.5 seconds? 

(b) What is the probability that a reaction requires between 
0.4 and 0.5 seconds? 

(c) What is the reaction time that is exceeded 90% of the time? 

4-65. The speed of a file transfer from a server on campus 

to a personal computer at a student’s home on a weekday 

evening is normally distributed with a mean of 60 kilobits per 

second and a standard deviation of 4 kilobits per second. 


(a) What is the probability that the file will transfer at a speed 
of 70 kilobits per second or more? 

(b) What is the probability that the file will transfer at a speed 
of less than 58 kilobits per second? 

(c) If the file is 1 megabyte, what is the average time it will 
take to transfer the file? (Assume eight bits per byte.) 
4-66. The average height of a woman aged 20-74 years is 64 
inches in 2002, with an increase of approximately one inch from 
1960 (http://usgovinfo.about.com/od/healthcare). Suppose the 
height of a woman is normally distributed with a standard de- 

viation of 2 inches. 

(a) What is the probability that a randomly selected woman in 
this population is between 58 inches and 70 inches? 

(b) What are the quartiles of this distribution? 

(c) Determine the height that is symmetric about the mean 
that includes 90% of this population. 

(d) What is the probability that five women selected at ran- 
dom from this population all exceed 68 inches? 


4-67. Inanaccelerator center, an experiment needs a 1.41-cm- 

thick aluminum cylinder (http://puhep1.princeton.edu/mumu/ 

target/Solenoid_Coil.pdf). Suppose that the thickness of a 

cylinder has a normal distribution with a mean of 1.41 cm and 

a standard deviation of 0.01 cm. 

(a) What is the probability that a thickness is greater than 
1.42 cm? 

(b) What thickness is exceeded by 95% of the samples? 

(c) If the specifications require that the thickness is between 
1.39 cm and 1.43 cm, what proportion of the samples meet 
specifications? 

4-68. The demand for water use in Phoenix in 2003 hit a 

high of about 442 million gallons per day on June 27, 2003 

(http://phoenix.gov/WATER/wtrfacts.html). Water use in the 

summer is normally distributed with a mean of 310 million 

gallons per day and a standard deviation of 45 million gallons 
per day. City reservoirs have a combined storage capacity of 
nearly 350 million gallons. 

(a) What is the probability that a day requires more water than 
is stored in city reservoirs? 

(b) What reservoir capacity is needed so that the probability 
that it is exceeded is 1%? 

(c) What amount of water use is exceeded with 95% proba- 
bility? 

(d) Water is provided to approximately 1.4 million people. 
What is the mean daily consumption per person at which 
the probability that the demand exceeds the current reser- 
voir capacity is 1%? Assume that the standard deviation of 
demand remains the same. 

4-69. The life of a semiconductor laser at a constant power 

is normally distributed with a mean of 7000 hours and a stan- 

dard deviation of 600 hours. 

(a) What is the probability that a laser fails before 5000 hours? 

(b) What is the life in hours that 95% of the lasers exceed? 

(c) If three lasers are used in a product and they are assumed 
to fail independently, what is the probability that all three 
are still operating after 7000 hours? 
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4-70. The diameter of the dot produced by a printer is nor- 

mally distributed with a mean diameter of 0.002 inch and a 

standard deviation of 0.0004 inch. 

(a) What is the probability that the diameter of a dot exceeds 
0.0026 inch? 

(b) What is the probability that a diameter is between 0.0014 
and 0.0026 inch? 

(c) What standard deviation of diameters is needed so that the 
probability in part (b) is 0.995? 

4-71. The weight of a sophisticated running shoe is nor- 

mally distributed with a mean of 12 ounces and a standard 

deviation of 0.5 ounce. 

(a) What is the probability that a shoe weighs more than 13 
ounces? 

(b) What must the standard deviation of weight be in order for 
the company to state that 99.9% of its shoes are less than 
13 ounces? 

(c) If the standard deviation remains at 0.5 ounce, what must 
the mean weight be in order for the company to state that 
99.9% of its shoes are less than 13 ounces? 


4-72. Measurement error that is normally distributed with a 
mean of zero and a standard deviation of 0.5 gram is added to 
the true weight of a sample. Then the measurement is rounded 
to the nearest gram. Suppose that the true weight of a sample 
is 165.5 grams. 

(a) What is the probability that the rounded result is 167 grams? 
(b) What is the probability that the rounded result is 167 

grams or greater? 

4-73. Assume that a random variable is normally distributed 
with a mean of 24 and a standard deviation of 2. Consider an 
interval of length one unit that starts at the value a so that the 
interval is [a, a + 1]. For what value of a is the probability of 
the interval greatest? Does the standard deviation affect that 
choice of interval? 


4-74. A study by Bechtel, et al., 2009, in the Archives of 
Environmental & Occupational Health considered polycyclic 
aromatic hydrocarbons and immune system function in beef 
cattle. Some cattle were near major oil- and gas-producing areas 


of western Canada. The mean monthly exposure to PM1.0 (par- 
ticulate matter that is < 1 jm in diameter) was approximately 
7.1 yg/m? with standard deviation 1.5. Assume the monthly 
exposure is normally distributed. 


(a) What is the probability of a monthly exposure greater than 
9 pg/m*? 

(b) What is the probability of a monthly exposure between 3 
and 8 g/m*? 

(c) What is the monthly exposure level that is exceeded with 
probability 0.05? 

(d) What value of mean monthly exposure is needed so that 
the probability of a monthly exposure greater than 9 
wg/m? is 0.01? 

4-75. Anarticle under review for Air Quality, Atmosphere & 
Health titled “Linking Particulate Matter (PM10) and 
Childhood Asthma in Central Phoenix” used PM 10 (particu- 
late matter < 10 wm in diameter) air quality data measured 
hourly from sensors in Phoenix, Arizona. The 24-hour (daily) 
mean PM10 for a centrally located sensor was 50.9 yg/m? 
with a standard deviation of 25.0. Assume that the daily mean 
of PM10 is normally distributed. 

(a) What is the probability of a daily mean of PM10 greater 
than 100 g/m?? 

(b) What is the probability of a daily mean of PM10 less than 
25 wg/m*? 

(c) What daily mean of PM10 value is exceeded with proba- 
bility 5%? 

4-76. The length of stay at a specific emergency department 

in Phoenix, Arizona, in 2009 had a mean of 4.6 hours with a 

standard deviation of 2.9. Assume that the length of stay is 

normally distributed. 

(a) What is the probability of a length of stay greater than 
10 hours? 

(b) What length of stay is exceeded by 25% of the visits? 

(c) From the normally distributed model, what is the proba- 
bility of a length of stay less than zero hours? Comment on 
the normally distributed assumption in this example. 


4-7 NORMAL APPROXIMATION TO THE BINOMIAL 


AND POISSON DISTRIBUTIONS 


We began our section on the normal distribution with the central limit theorem and the normal 
distribution as an approximation to a random variable with a large number of trials. 
Consequently, it should not be a surprise to learn that the normal distribution can be used to 
approximate binomial probabilities for cases in which n is large. The following example 
illustrates that for many physical systems the binomial model is appropriate with an extremely 
large value for n. In these cases, it is difficult to calculate probabilities by using the binomial 
distribution. Fortunately, the normal approximation is most effective in these cases. An 
illustration is provided in Fig. 4-19. The area of each bar equals the binomial probability of x. 
Notice that the area of bars can be approximated by areas under the normal density function. 

From Fig. 4-19 it can be seen that a probability such as P(3 = X = 7) is better approxi- 
mated by the area under the normal curve from 2.5 to 7.5. This observation provides a method 
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binomial distribution. 


to approximate binomial probabilities. Because a continuous normal distribution is used to 
approximate a discrete binomial distribution, the modification is referred to as a continuity 


correction. 


EXAMPLE 4-17 
In a digital communication channel, assume that the number 
of bits received in error can be modeled by a binomial random 
variable, and assume that the probability that a bit is received 
in error is 1 X 107°. If 16 million bits are transmitted, what is 
the probability that 150 or fewer errors occur? 

Let the random variable X denote the number of errors. 
Then X is a binomial random variable and 


— (Gas (107->)*(1 a 10~>)16,000,000-x 
x 


P(X =150) = >} 
x=0 
Practical Interpretation: Clearly this probability is 
difficult to compute. Fortunately, the normal distribution 
can be used to provide an excellent approximation in this 
example. 


Normal 
Approximation If X is a binomial random variable with parameters n and p, 
to the Binomial 
Distribution Pelee Ue (4-12) 
np(1 — p) 


and 


is approximately a standard normal random variable. To approximate a binomial 
probability with a normal distribution, a continuity correction is applied as follows: 


P(X Sx) = P(X S$x+ 0.5) = (z 


= 05 — 
Ps xX) = Pe 05 =x) = (3 bz) 


The approximation is good for np > 5 and n(1 — p) > 5. 


#405 — P| 
Vip(1 — p) 


V np(1 — p) 
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Recall that for a binomial variable X, E(X) = np and V(X) = np(1 — p). Consequently, the 
expression in Equation 4-12 is nothing more than the formula for standardizing the random 
variable X. Probabilities involving X can be approximated by using a standard normal 
distribution. The approximation is good when zn is large relative to p. 

A way to remember the approximation is to write the probability in terms of = or = and 
then add or subtract the 0.5 correction factor to make the probability greater. 


EXAMPLE 4-18 
The digital communication problem in the previous example Because np = (16 X 10°)(1 X 107°) = 160 and n(1 — p) is 
is solved as follows: much larger, the approximation is expected to work well in 
this case. 

P(X = 150) = P(X = 150.5) Practical Interpretation: Binomial probabilities that are 
X — 160 150.5 — 160 difficult to compute exactly can be approximated with easy to 

/ (1 — 1075) = V1 60(1 — 107) compute probabilities based on the normal distribution 

160(1 — = . 


= P(Z = —0.75) = 0.227 


EXAMPLE 4-19 Normal Approximation to Binomial 
Again consider the transmission of bits in Example 4-18. To 
judge how well the normal approximation works, assume only 
n = 50 bits are to be transmitted and that the probability of an 
error is p = 0.1. The exact probability that 2 or less errors 


8:5°— 5 
PO =X) = PBS =x) ~P( rep <2) 


= P(1.65 < Z) = 0.05 


occur is 
We can even approximate P(X = 5) = P(5 = X = 5)as 
50 50 50 49 
P(X = 2)= (os +(P)o109 ) P(5 =X <5) =P(45 5X <555) 
(|, oe ne ae 2 (Sof <728 5) 
+ (Pos (0.9*°) = 0.112 a2 ~~ 21D 
= P(—0.24 = Z S$ 0.24) = 0.19 
Based on the normal approximation, = P(4.5 <X <5.5) 
P(X<2)=P ( x—5 < 2.5 — 5 ) and this compares well with the exact answer of 0.1849. 
V50(0.1)(0.9) — V50(0.1)(0.9) Practical Interpretation: Even for a sample as small as 


= P(Z < —1.18) = 0.119 50 bits, the normal approximation is reasonable, when p = 0.1. 


As another example, P(8 < X) = P(9 S X) and this is better 
approximated as 


The correction factor is used to improve the approximation. However, if np or n(1 — p) is 
small, the binomial distribution is quite skewed and the symmetric normal distribution is not 
a good approximation. Two cases are illustrated in Fig. 4-20. 

Recall that the binomial distribution is a satisfactory approximation to the hypergeomet- 
ric distribution when n, the sample size, is small relative to N, the size of the population from 
which the sample is selected. A rule of thumb is that the binomial approximation is effective 
if n/N < 0.1. Recall that for a hypergeometric distribution, p is defined as p = K/N. That 
is, p is interpreted as the number of successes in the population. Therefore, the normal 
distribution can provide an effective approximation of hypergeometric probabilities when n/N < 
0.1, np > 5, and n(1 — p) > 5. Figure 4-21 provides a summary of these guidelines. 
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Recall that the Poisson distribution was developed as the limit of a binomial distribution as 


the number of trials increased to infinity. Consequently, it should not be surprising to find that the 
normal distribution can also be used to approximate probabilities of a Poisson random variable. 


Normal 


Approximation If X is a Poisson random variable with E(X) = A and V(X) = A, 
to the Poisson 


Distribution poe (4-13) 
Vi 


is approximately a standard normal random variable. The same continuity correction 
used for the binomial distribution can also be applied. The approximation is good for 


A>5 
EXAMPLE 4-20 Normal Approximation to Poisson 
Assume that the number of asbestos particles in a squared 950.5 — 1000 
meter of dust on a surface follows a Poisson distribution with P(X = 950) = P(X = 950.5) ~ P| Z= 1000 
a mean of 1000. If a squared meter of dust is analyzed, what is = P(Z < 1.57) = 0.058 
the probability that 950 or fewer particles are found? 
This probability can be expressed exactly as Practical Interpretation: Poisson probabilities that are difficult 


to compute exactly can be approximated with easy to compute 


350 e~ 1009 1. 900* probabilities based on the normal distribution. 


P(X = 950) = > 


x=0 x! 


The computational difficulty is clear. The probability can be 
approximated as 


hypergometric od binomial = normal 
distribution ae 01 distribution np > 5 distribution 
N n(l — p)>5 
Figure 4-21 Conditions for approximating hypergeometric and binomial probabilities. 
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EXERCISES FOR SECTION 4-7 


4-77. Suppose that X is a binomial random variable with 

n = 200 and p = 0.4. 

(a) Approximate the probability that X is less than or equal 
to 70. 

(b) Approximate the probability that X is greater than 70 and 
less than 90. 

(c) Approximate the probability that X = 80. 


4-78. 

N= 6. 

(a) Compute the exact probability that X is less than 4. 

(b) Approximate the probability that X is less than 4 and com- 
pare to the result in part (a). 

(c) Approximate the probability that 8 < X < 12. 


4-79. Suppose that X has a Poisson distribution with a mean 
of 64. Approximate the following probabilities: 

(a) P(X >72) 

(b) P(X < 64) 

(c) P(60 < X = 68) 


Suppose that X is a Poisson random variable with 


4-80. The manufacturing of semiconductor chips produces 

2% defective chips. Assume the chips are independent and 

that a lot contains 1000 chips. 

(a) Approximate the probability that more than 25 chips are 
defective. 

(b) Approximate the probability that between 20 and 30 chips 
are defective. 


4-81. There were 49.7 million people with some type of 

long-lasting condition or disability living in the United 

States in 2000. This represented 19.3 percent of the majority 

of civilians aged five and over (http://factfinder. census.gov). 

A sample of 1000 persons is selected at random. 

(a) Approximate the probability that more than 200 persons 
in the sample have a disability. 

(b) Approximate the probability that between 180 and 300 
people in the sample have a disability. 


4-82. Phoenix water is provided to approximately 1.4 million 

people, who are served through more than 362,000 accounts 

(http://phoenix.gov/WATER/wtrfacts.html). All accounts are 

metered and billed monthly. The probability that an account 

has an error in a month is 0.001, and accounts can be assumed 

to be independent. 

(a) What is the mean and standard deviation of the number of 
account errors each month? 

(b) Approximate the probability of fewer than 350 errors in a 
month. 

(c) Approximate a value so that the probability that the num- 
ber of errors exceeds this value is 0.05. 

(d) Approximate the probability of more than 400 errors per 
month in the next two months. Assume that results be- 
tween months are independent. 


4-83. Anelectronic office product contains 5000 electronic 
components. Assume that the probability that each 
component operates without failure during the useful life of 


the product is 0.999, and assume that the components fail 
independently. Approximate the probability that 10 or more 
of the original 5000 components fail during the useful life of 
the product. 


4-84. A corporate Web site contains errors on 50 of 1000 
pages. If 100 pages are sampled randomly, without replace- 
ment, approximate the probability that at least 1 of the pages 
in error is in the sample. 


4-85. Suppose that the number of asbestos particles in a 
sample of 1 squared centimeter of dust is a Poisson random 
variable with a mean of 1000. What is the probability that 10 
squared centimeters of dust contains more than 10,000 
particles? 


4-86. A high-volume printer produces minor print-quality 

errors on a test pattern of 1000 pages of text according to a 

Poisson distribution with a mean of 0.4 per page. 

(a) Why are the numbers of errors on each page independent 
random variables? 

(b) What is the mean number of pages with errors (one or more)? 

(c) Approximate the probability that more than 350 pages 
contain errors (one or more). 


4-87. Hits to a high-volume Web site are assumed to follow 

a Poisson distribution with a mean of 10,000 per day. 

Approximate each of the following: 

(a) The probability of more than 20,000 hits in a day 

(b) The probability of less than 9900 hits in a day 

(c) The value such that the probability that the number of hits 
in a day exceeds the value is 0.01 

(d) Approximate the expected number of days in a year (365 
days) that exceed 10,200 hits. 

(e) Approximate the probability that over a year (365 days) 
more than 15 days each have more than 10,200 hits. 


4-88. An acticle in Biometrics [Integrative Analysis 
of Transcriptomic and Proteomic Data of Desulfovibrio 
vulgaris: A Nonlinear Model to Predict Abundance of 
Undetected Proteins” (2009)] found that protein abundance 
from an operon (a set of biologically related genes) was less 
dispersed than from randomly selected genes. In the research, 
1000 sets of genes were randomly constructed and 75% of 
these sets were more disperse than a specific opteron. If the 
probability that a random set is more disperse than this 
opteron is truly 0.5, approximate the probability that 750 or 
more random sets exceed the opteron. From this result, what 
do you conclude about the dispersion in the opteron versus 
random genes? 


4-89. An article under review for Air Quality, Atmosphere 
& Health titled “Linking Particulate Matter (PM10) and 
Childhood Asthma in Central Phoenix” linked air quality to 
childhood asthma incidents. The study region in central 
Phoenix, Arizona, recorded 10,500 asthma incidents in chil- 
dren in a 21-month period. Assume that the number of asthma 
incidents follows a Poisson distribution. 
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(a) Approximate the probability of more than 550 asthma (d) If the number of asthma incidents were greater during the 


incidents in a month. 


winter than the summer, what would this imply about the 


(b) Approximate the probability of 450 to 550 asthma inci- Poisson distribution assumption?. 


dents in a month. 


(c) Approximate the number of asthma incidents exceeded 


with probability 5%. 


4-8 EXPONENTIAL DISTRIBUTION 


Exponential 
Distribution 


The discussion of the Poisson distribution defined a random variable to be the number of 
flaws along a length of copper wire. The distance between flaws is another random variable 
that is often of interest. Let the random variable X denote the length from any starting point on 
the wire until a flaw is detected. As you might expect, the distribution of X can be obtained 
from knowledge of the distribution of the number of flaws. The key to the relationship is the 
following concept. The distance to the first flaw exceeds 3 millimeters if and only if there are 
no flaws within a length of 3 millimeters—simple, but sufficient for an analysis of the distri- 
bution of X. 

In general, let the random variable N denote the number of flaws in x millimeters of wire. 
If the mean number of flaws is A per millimeter, NV has a Poisson distribution with mean Ax. 
We assume that the wire is longer than the value of x. Now, 


e (Ax)? 7 
P(X > x) = P(N = 0) = 01 =e 


Therefore, 


is the cumulative distribution function of X. By differentiating F(x), the probability density 
function of X is calculated to be 


f(x) =Ae™*, x=0 


The derivation of the distribution of X depends only on the assumption that the flaws in 
the wire follow a Poisson process. Also, the starting point for measuring X doesn’t matter 
because the probability of the number of flaws in an interval of a Poisson process depends 
only on the length of the interval, not on the location. For any Poisson process, the following 
general result applies. 


The random variable X that equals the distance between successive events of a Poisson 
process with mean number of events A > 0 per unit interval is an exponential 
random variable with parameter A. The probability density function of X is 


ola Ne ton Ol== ge (4-14) 
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Figure 4-22 

Probability density 

function of exponential 

random variables for 0 D) 4 6 8 10 12 
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The exponential distribution obtains its name from the exponential function in the proba- 
bility density function. Plots of the exponential distribution for selected values of A are shown 
in Fig. 4-22. For any value of 4, the exponential distribution is quite skewed. The following 
results are easily obtained and are left as an exercise. 


Mean and 
Variance If the random variable X has an exponential distribution with parameter A, 
1 2 1 
p = E(X) = ry and of = V(X) = 2 (4-15) 


It is important to use consistent units in the calculation of probabilities, means, and variances 
involving exponential random variables. The following example illustrates unit conversions. 


EXAMPLE 4-21 Computer Usage 

In a large corporate computer network, user log-ons to the | 

system can be modeled as a Poisson process with a mean of P(X > 0.1) = | 2567 2* dy = 0725-1) = 9,082 
25 log-ons per hour. What is the probability that there are no Fi 

log-ons in an interval of 6 minutes? 


00 


.1 


Let X denote the time in hours from the start of the interval Also, the cumulative distribution function can be used to obtain 
until the first log-on. Then, X has an exponential distribution the same result as follows: 
with A = 25 log-ons per hour. We are interested in the proba- 
bility that X exceeds 6 minutes. Because A is given in log-ons P(X > 0.1) = 1 — F(O.1) = 7 O00 
per hour, we express all time units in hours. That is, 6 minutes = 
0.1 hour. The probability requested is shown as the shaded area An identical answer is obtained by expressing the mean 


under the probability density function in Fig. 4-23. Therefore, number of log-ons as 0.417 log-ons per minute and computing 
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Figure 4-23 
Probability for the 
exponential distribu- 
tion in Example 4-21. 0.1 x 
the probability that the time until the next log-on exceeds P(X > x) =e? = 0.90 


6 minutes. Try it. 
What is the probability that the time until the next log-on Take the (natural) log of both sides to obtain —25x = In(0.90) = 


is between 2 and 3 minutes? Upon converting all units to —0.1054. Therefore 
hours, : 
0.05 x = 0.00421 hour = 0.25 minute 
P(0.033 < X < 0.05) = | 25e>* dx 
Furthermore, the mean time until the next log-on is 


0.033 


= 0.152 p = 1/25 = 0.04 hour = 2.4 minutes 


: ee The standard deviation of the time until the next log-on is 
An alternative solution is 


= 1/25h = 2.4 minut 
P(0.033 < X < 0.05) = F(0.05) — F(0.033) = 0.152 a= a aes pene 
Practical Interpretation: Probabilities for exponential 
random variables are widely used by organization to evaluate 
resources and staffing levels to meet customer service needs. 


Determine the interval of time such that the probability 
that no log-on occurs in the interval is 0.90. The question asks 
for the length of time x such that P(X > x) = 0.90. Now, 


In the previous example, the probability that there are no log-ons in a 6-minute interval is 
0.082 regardless of the starting time of the interval. A Poisson process assumes that events oc- 
cur uniformly throughout the interval of observation; that is, there is no clustering of events. 
If the log-ons are well modeled by a Poisson process, the probability that the first log-on after 
noon occurs after 12:06 P.M. is the same as the probability that the first log-on after 3:00 PM. 
occurs after 3:06 PM. And if someone logs on at 2:22 P.M., the probability that the next log-on 
occurs after 2:28 P.M. is still 0.082. 

Our starting point for observing the system does not matter. However, if there are high- 
use periods during the day, such as right after 8:00 A.M., followed by a period of low use, a 
Poisson process is not an appropriate model for log-ons and the distribution is not appropriate 
for computing probabilities. It might be reasonable to model each of the high- and low-use 
periods by a separate Poisson process, employing a larger value for A during the high-use peri- 
ods and a smaller value otherwise. Then, an exponential distribution with the corresponding 
value of A can be used to calculate log-on probabilities for the high- and low-use periods. 


Lack of Memory Property 
An even more interesting property of an exponential random variable is concerned with con- 
ditional probabilities. 


EXAMPLE 4-22 Lack of Memory Property 
Let X denote the time between detections of a particle with a 
Geiger counter and assume that X has an exponential distribu- 
tion with E(X ) = 1.4 minutes. The probability that we detect 
a particle within 30 seconds of starting the counter is 


P(X < 0.5 minute) = F(0.5) = 1 — e°°/!4 = 0.30 


In this calculation, all units are converted to minutes. Now, sup- 
pose we turn on the Geiger counter and wait 3 minutes without 
detecting a particle. What is the probability that a particle is de- 
tected in the next 30 seconds? 

Because we have already been waiting for 3 minutes, we 
feel that we are “due.” That is, the probability of a detection 
in the next 30 seconds should be greater than 0.3. However, 
for an exponential distribution, this is not true. The requested 
probability can be expressed as the conditional probability 
that P(X < 3.5|X > 3). From the definition of conditional 
probability, 
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P(X < 3.5|X > 3) = P(3 < X < 3.5)/P(X > 3) 


where 
P(3 < X < 3.5) = F(3.5) — F(3) 
= [1 — e354) - [1 — e 3/4] = 0.035 
and 
P(X > 3) =1- F(3)=e3%!4=0.117 
Therefore, 


P(X < 3.5|X > 3) = 0.035/0.117 = 0.30 


Practical Interpretation: After waiting for 3 minutes without a 
detection, the probability of a detection in the next 30 seconds 
is the same as the probability of a detection in the 30 seconds 
immediately after starting the counter. The fact that you have 
waited 3 minutes without a detection does not change the 
probability of a detection in the next 30 seconds. 


Example 4-22 illustrates the lack of memory property of an exponential random vari- 
able, and a general statement of the property follows. In fact, the exponential distribution is the 
only continuous distribution with this property. 


Lack of 
Memory 
Property 


For an exponential random variable X, 


Px bk = 4 — 


P(X < ty) (4-16) 


Figure 4-24 graphically illustrates the lack of memory property. The area of region A divided 
by the total area under the probability density function (A + B + C + D = 1) equals 
P(X < ty). The area of region C divided by the areaC + Dequals P(X < t, + t)|X > t,). The 
lack of memory property implies that the proportion of the total area that is in A equals the 
proportion of the area in C and D that is in C. The mathematical verification of the lack of 
memory property is left as a mind-expanding exercise. 

The lack of memory property is not that surprising when you consider the development 
of a Poisson process. In that development, we assumed that an interval could be partitioned 
into small intervals that were independent. These subintervals are similar to independent 
Bernoulli trials that comprise a binomial experiment; knowledge of previous results does not 


f(x) 


Figure 4-24 Lack of A 

memory property of B 
an exponential c D 
distribution. t ty 


x 
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affect the probabilities of events in future subintervals. An exponential random variable is the 
continuous analog of a geometric random variable, and they share a similar lack of memory 
property. 

The exponential distribution is often used in reliability studies as the model for the time 
until failure of a device. For example, the lifetime of a semiconductor chip might be modeled as 
an exponential random variable with a mean of 40,000 hours. The lack of memory property of 
the exponential distribution implies that the device does not wear out. That is, regardless of how 
long the device has been operating, the probability of a failure in the next 1000 hours is the 
same as the probability of a failure in the first 1000 hours of operation. The lifetime L of a de- 
vice with failures caused by random shocks might be appropriately modeled as an exponential 
random variable. However, the lifetime L of a device that suffers slow mechanical wear, such as 
bearing wear, is better modeled by a distribution such that P(L < t + Ar|L > t) increases with 
t. Distributions such as the Weibull distribution are often used, in practice, to model the failure 


time of this type of device. The Weibull distribution is presented in a later section. 


EXERCISES FOR SECTION 4-8 


4-90. Suppose X has an exponential distribution with \ = 2. 
Determine the following: 

(a) P(X <0) (b) P(X = 2) 

(c) (X=1) (d) PL <X<2) 

(e) Find the value of x such that P(X < x) = 0.05. 


4-91. Suppose X has an exponential distribution with mean 
equal to 10. Determine the following: 

(a) P(X > 10) (b) P(X > 20) 

(c) P(X < 30) 

(d) Find the value of x such that P(X < x) = 0.95. 


4-92. Suppose X has an exponential distribution with a 

mean of 10. Determine the following: 

(a) P(X <5) 

(b) P(X < 15|X > 10) 

(c) Compare the results in parts (a) and (b) and comment on 
the role of the lack of memory property. 


4-93. Suppose the counts recorded by a Geiger counter follow 

a Poisson process with an average of two counts per minute. 

(a) What is the probability that there are no counts in a 
30-second interval? 

(b) What is the probability that the first count occurs in less 
than 10 seconds? 

(c) What is the probability that the first count occurs between 
1 and 2 minutes after start-up? 


4-94. Suppose that the log-ons to a computer network fol- 

low a Poisson process with an average of 3 counts per minute. 

(a) What is the mean time between counts? 

(b) What is the standard deviation of the time between counts? 

(c) Determine x such that the probability that at least one 
count occurs before time x minutes is 0.95. 


4-95. The time between calls to a plumbing supply business 
is exponentially distributed with a mean time between calls of 
15 minutes. 


(a) What is the probability that there are no calls within a 
30-minute interval? 

(b) What is the probability that at least one call arrives within 
a 10-minute interval? 

(c) What is the probability that the first call arrives within 5 
and 10 minutes after opening? 

(d) Determine the length of an interval of time such that the 
probability of at least one call in the interval is 0.90. 


4-96. The life of automobile voltage regulators has an expo- 

nential distribution with a mean life of six years. You purchase 

an automobile that is six years old, with a working voltage 

regulator, and plan to own it for six years. 

(a) What is the probability that the voltage regulator fails 
during your ownership? 

(b) If your regulator fails after you own the automobile three 
years and it is replaced, what is the mean time until the 
next failure? 


4-97. Suppose that the time to failure (in hours) of fans in a 
personal computer can be modeled by an exponential distribu- 
tion with A = 0.0003. 

(a) What proportion of the fans will last at least 10,000 hours? 
(b) What proportion of the fans will last at most 7000 hours? 


4-98. The time between the arrival of electronic messages 
at your computer is exponentially distributed with a mean of 
two hours. 

(a) What is the probability that you do not receive a message 
during a two-hour period? 

(b) If you have not had a message in the last four hours, what 
is the probability that you do not receive a message in the 
next two hours? 

(c) What is the expected time between your fifth and sixth 
messages? 


4-99. The time between arrivals of taxis at a busy intersec- 
tion is exponentially distributed with a mean of 10 minutes. 


(a) What is the probability that you wait longer than one hour 
for a taxi? 

(b) Suppose you have already been waiting for one hour for a 
taxi. What is the probability that one arrives within the 
next 10 minutes? 

(c) Determine x such that the probability that you wait more 
than x minutes is 0.10. 

(d) Determine x such that the probability that you wait less 
than x minutes is 0.90. 

(e) Determine x such that the probability that you wait less 
than x minutes is 0.50. 


4-100. The number of stork sightings on a route in South 

Carolina follows a Poisson process with a mean of 2.3 per 

year. 

(a) What is the mean time between sightings? 

(b) What is the probability that there are no sightings within 
three months (0.25 years)? 

(c) What is the probability that the time until the first sighting 
exceeds six months? 

(d) What is the probability of no sighting within three years? 


4-101. According to results from the analysis of chocolate 

bars in Chapter 3, the mean number of insect fragments was 

14.4 in 225 grams. Assume the number of fragments follows a 

Poisson distribution. 

(a) What is the mean number of grams of chocolate until a 
fragment is detected? 

(b) What is the probability that there are no fragments in a 
28.35-gram (one-ounce) chocolate bar? 

(c) Suppose you consume seven one-ounce (28.35-gram) bars 
this week. What is the probability of no insect fragments? 


4-102. The distance between major cracks in a highway fol- 

lows an exponential distribution with a mean of 5 miles. 

(a) What is the probability that there are no major cracks in a 
10-mile stretch of the highway? 

(b) What is the probability that there are two major cracks in 
a 10-mile stretch of the highway? 

(c) What is the standard deviation of the distance between 
major cracks? 

(d) What is the probability that the first major crack occurs 
between 12 and 15 miles of the start of inspection? 

(e) What is the probability that there are no major cracks in 
two separate 5-mile stretches of the highway? 

(f) Given that there are no cracks in the first 5 miles in- 
spected, what is the probability that there are no major 
cracks in the next 10 miles inspected? 


4-103. The lifetime of a mechanical assembly in a vibration 

test is exponentially distributed with a mean of 400 hours. 

(a) What is the probability that an assembly on test fails in 
less than 100 hours? 

(b) What is the probability that an assembly operates for more 
than 500 hours before failure? 

(c) If an assembly has been on test for 400 hours without a fail- 
ure, what is the probability of a failure in the next 100 hours? 
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(d) If 10 assemblies are tested, what is the probability that 
at least one fails in less than 100 hours? Assume that the 
assemblies fail independently. 

(e) If 10 assemblies are tested, what is the probability that all 
have failed by 800 hours? Assume the assemblies fail 
independently. 


4-104. The time between arrivals of small aircraft at a 
county airport is exponentially distributed with a mean of one 
hour. 

(a) What is the probability that more than three aircraft arrive 
within an hour? 

(b) If 30 separate one-hour intervals are chosen, what is the 
probability that no interval contains more than three 
arrivals? 

(c) Determine the length of an interval of time (in hours) such 
that the probability that no arrivals occur during the inter- 
val is 0.10. 


4-105. The time between calls to a corporate office is expo- 

nentially distributed with a mean of 10 minutes. 

(a) What is the probability that there are more than three calls 
in one-half hour? 

(b) What is the probability that there are no calls within one- 
half hour? 

(c) Determine x such that the probability that there are no 
calls within x hours is 0.01. 

(d) What is the probability that there are no calls within a two- 
hour interval? 

(e) If four nonoverlapping one-half-hour intervals are 
selected, what is the probability that none of these intervals 
contains any call? 

(f ) Explain the relationship between the results in part (a) and (b). 


4-106. Assume that the flaws along a magnetic tape follow 
a Poisson distribution with a mean of 0.2 flaw per meter. Let 
X denote the distance between two successive flaws. 

(a) What is the mean of X? 

(b) What is the probability that there are no flaws in 10 con- 
secutive meters of tape? 

(c) Does your answer to part (b) change if the 10 meters are 
not consecutive? 

(d) How many meters of tape need to be inspected so that the 
probability that at least one flaw is found is 90%? 

(e) What is the probability that the first time the distance 
between two flaws exceeds 8 meters is at the fifth 
flaw? 

(f) What is the mean number of flaws before a distance 
between two flaws exceeds 8 meters? 


4-107. Ifthe random variable X has an exponential distribu- 
tion with mean 6, determine the following: 

(a) P(X > 8) (b) P(X > 26) 

(c) P(X > 36) 

(d) How do the results depend on 6? 

4-108. Derive the formula for the mean and variance of an 
exponential random variable. 
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4-109. Web crawlers need to estimate the frequency of 

changes to Web sites to maintain a current index for Web 

searches. Assume that the changes to a Web site follow a Poisson 

process with a mean of 3.5 days. 

(a) What is the probability that the next change occurs in less 
than two days? 

(b) What is the probability that the next change occurs in 
greater than seven days? 

(c) What is the time of the next change that is exceeded with 
probability 90%? 


(d) What is the probability that the next change occurs in less 
than 10 days, given that it has not yet occurred after three 
days? 


4-110. The length of stay at a specific emergency depart- 

ment in Phoenix, Arizona, had a mean of 4.6 hours. Assume 

that the length of stay is exponentially distributed. 

(a) What is the standard deviation of the length of stay? 

(b) What is the probability of a length of stay greater than 
10 hours? 

(c) What length of stay is exceeded by 25% of the visits? 


4-9 ERLANG AND GAMMA DISTRIBUTIONS 


An exponential random variable describes the length until the first count is obtained in a 
Poisson process. A generalization of the exponential distribution is the length until 7 counts 
occur in a Poisson process. Consider the following example. 


EXAMPLE 4-23 Processor Failure 
The failures of the central processor units of large computer 
systems are often modeled as a Poisson process. Typically, 
failures are not caused by components wearing out, but by 
more random failures of the large number of semiconductor 
circuits in the units. Assume that the units that fail are imme- 
diately repaired, and assume that the mean number of failures 
per hour is 0.0001. Let X denote the time until four failures 
occur in a system. Determine the probability that X exceeds 
40,000 hours. 

Let the random variable N denote the number of failures 
in 40,000 hours of operation. The time until four failures occur 


exceeds 40,000 hours if and only if the number of failures in 
40,000 hours is three or less. Therefore, 


P(X > 40,000) = P(N = 3) 


The assumption that the failures follow a Poisson process im- 
plies that N has a Poisson distribution with 


E(N) = 40,000(0.0001) = 4 failures per 40,000 hours 


Therefore, 


P(X > 40,000) = P(N <3) = 


The previous example can be generalized to show that if X is the time until the rth event in a 


Poisson process, then 


Because P(X > x) = 1 — F(x), the probability density function of X equals the negative of 
the derivative of the right-hand side of the previous equation. After extensive algebraic sim- 
plification, the probability density function of X can be shown to equal 


f(x) 


Nx" 


1,-Ax 


forx > Oandr=1,2,.... 


This probability density function defines an Erlang distribution. Clearly, an Erlang random 
variable with r = 1 is an exponential random variable. 

It is convenient to generalize the Erlang distribution to allow 7 to assume any nonnegative 
value. Then the Erlang and some other common distributions become special cases of this 
generalized distribution. To accomplish this step, the factorial function (r — 1)! has to be gen- 
eralized to apply to any nonnegative value of 7, but the generalized function should still equal 


Q! when r is a positive integer. 
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The gamma function is 


ah= [et dx, forr>0 (4-17) 
0 


It can be shown that the integral in the definition of I(r) is finite. Furthermore, by using integra- 
tion by parts it can be shown that 


T(r) 


(r — I(r - 1) 

This result is left as an exercise. Therefore, if r is a positive integer (as in the Erlang distribution), 
T(r) 

1 and it can be shown that I'(1/2) = a”. The gamma function can be 


interpreted as a generalization to noninteger values of r of the term that is used in the Erlang 
probability density function. Now the Erlang distribution can be generalized. 


=(r— 1)! 


Also, [(1) = 0! = 


Gamma 
Distribution 


The random variable X with probability density function 


Nxt be Ax 
ae > = 
f(x) To for x > 0 (4-18) 


has a gamma random variable with parameters A > 0 andr > 0. If ris an integer, 


X has an Erlang distribution. 


The parameters A and r are often called the scale and shape parameters, respectively. 

However, one should check the definitions used in software packages. For example, Minitab 

defines the scale parameter as 1/A. Sketches of the gamma distribution for several values of A 

and r are shown in Fig. 4-25. Many different shapes can be generated from changes to the 
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parameters. Also, the change of variable u = Ax and the definition of the gamma function can 
be used to show that the density integrates to one. 

Recall that for an exponential distribution with parameter A, the mean and variance are 
1/A and 1/A2, respectively. An Erlang random variable is the time until the rth event in a 
Poisson process and the time between events are independent. Therefore, it is plausible that the 
mean and variance of a gamma random variable multiply the exponential results by r. This 
motivates the following conclusions. Repeated integration by parts can be used to derive these, 


but the details are lengthy and omitted. 


Mean and 
Variance If X is a gamma random variable with parameters J and r, 
ee i Neeandiee —r he (4-19) 
EXAMPLE 4-24 


The time to prepare a micro-array slide for high-throughput ge- 
nomics is a Poisson process with a mean of two hours per slide. 
What is the probability that 10 slides require more than 25 hours 
to prepare? 

Let X denote the time to prepare 10 slides. Because of the as- 
sumption of a Poisson process, X has a gamma distribution with 
A = 1/2, r = 10, and the requested probability is P(X > 25). 
The probability can be obtained from software that provides 
cumulative Poisson probabilities or gamma probabilities. For 
the cumulative Poisson probabilities we use the method in 
Example 4-23 to obtain 


ae we te 
P(X > 25) a = c y 


In Minitab we set the mean = 12.5 and the input = 9 to obtain 
P(X > 25) = 0.2014. 

As a check, we use the gamma cumulative probability 
function in Minitab. Set the shape parameter to 10, the scale 
parameter to 2 because Minitab uses the inverse of our defini- 
tion, and the input to 25. The probability computed is 
P(X = 25) = 0.7986, and when this is subtracted from one 


we match with the previous result that P(X > 25) = 0.2014. 


What are the mean and standard deviation of the time to 
prepare 10 slides? The mean time is 


E(X) = r/A = 10/0.5 = 20 


The variance of time is 
V(X) = r/A? = 10/0.57 = 40 


so that the standard deviation is 40'/? = 6.32 hours. 
The slides will be completed by what length of time with 
probability equal to 0.95? The question asks for x such that 


P(X <x) = 0.95 


where X is gamma with A = 0.5 and r = 10. In Minitab, we 
use the gamma inverse cumulative probability function and set 
the shape parameter to 10, the scale parameter to 2 because 
Minitab uses the inverse of our definition, and the probability 
to 0.95. The solution computed is 


P(X < 31.41) = 0.95 


Practical Interpretation: Based on this result a schedule that al- 
lows 31.41 hours to prepare 10 slides should be met on 95% 
of occasions. 


Furthermore, the chi-squared distribution is a special case of the gamma distribution in 


which A = 1/2 and r equals one of the values 1/2, 1, 3/2, 2,... 


. This distribution is used 


extensively in interval estimation and tests of hypotheses that are discussed in subsequent 
chapters. The chi-squared distribution is discussed in Chapter 7. 


EXERCISES FOR SECTION 4-9 


4-111. Use the properties of the gamma function to evaluate 
the following: 
(a) T(6) 

(c) T(9/2) 
4-112. Given the probability density function f(x) = 
0.017x7e~°°'*/T(3), determine the mean and variance of the 
distribution. 


(b) P(5/2) 


4-113. Calls to a telephone system follow a Poisson distri- 

bution with a mean of five calls per minute. 

(a) What is the name applied to the distribution and parame- 
ter values of the time until the tenth call? 

(b) What is the mean time until the tenth call? 

(c) What is the mean time between the ninth and tenth 
calls? 


(d) What is the probability that exactly four calls occur within 
one minute? 

(e) If 10 separate 1-minute intervals are chosen, what is the 
probability that all intervals contain more than two calls? 


4-114. Raw materials are studied for contamination. 

Suppose that the number of particles of contamination per 

pound of material is a Poisson random variable with a mean of 

0.01 particle per pound. 

(a) What is the expected number of pounds of material 
required to obtain 15 particles of contamination? 

(b) What is the standard deviation of the pounds of materials 
required to obtain 15 particles of contamination? 


4-115. The time between failures of a laser in a cytogenics ma- 

chine is exponentially distributed with a mean of 25,000 hours. 

(a) What is the expected time until the second failure? 

(b) What is the probability that the time until the third failure 
exceeds 50,000 hours? 


4-116. Ina data communication system, several messages 

that arrive at a node are bundled into a packet before they are 

transmitted over the network. Assume the messages arrive at 

the node according to a Poisson process with tT = 30 mes- 

sages per minute. Five messages are used to form a packet. 

(a) What is the mean time until a packet is formed, that is, 
until five messages have arrived at the node? 

(b) What is the standard deviation of the time until a packet is 
formed? 

(c) What is the probability that a packet is formed in less than 
10 seconds? 

(d) What is the probability that a packet is formed in less than 
5 seconds? 


4-117. Errors caused by contamination on optical disks 

occur at the rate of one error every 10° bits. Assume the errors 

follow a Poisson distribution. 

(a) What is the mean number of bits until five errors occur? 

(b) What is the standard deviation of the number of bits until 
five errors occur? 
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(c) The error-correcting code might be ineffective if there are 
three or more errors within 10° bits. What is the probabil- 
ity of this event? 


4-118. Calls to the help line of a large computer distributor 

follow a Poisson distribution with a mean of 20 calls per minute. 

(a) What is the mean time until the one-hundredth call? 

(b) What is the mean time between call numbers 50 and 80? 

(c) What is the probability that three or more calls occur 
within 15 seconds? 


4-119. The time between arrivals of customers at an auto- 

matic teller machine is an exponential random variable with a 

mean of 5 minutes. 

(a) What is the probability that more than three customers 
arrive in 10 minutes? 

(b) What is the probability that the time until the fifth cus- 
tomer arrives is less than 15 minutes? 


4-120. Use integration by parts to show that '(r) = (r — 1) 
T(r — 1). 

4-121. Show that the gamma density function f(x, A, r) in- 
tegrates to 1. 


4-122. Use the result for the gamma distribution to determine 
the mean and variance of a chi-square distribution with r = 7/2. 


4-123. Patients arrive at an emergency department accord- 

ing to a Poisson process with a mean of 6.5 per hour. 

(a) What is the mean time until the tenth arrival? 

(b) What is the probability that more than 20 minutes is re- 
quired for the third arrival? 


4-124. The total service time of a multistep manufacturing 

operation has a gamma distribution with mean 18 minutes and 

standard deviation 6. 

(a) Determine the parameters X and r of the distribution. 

(b) Assume each step has the same distribution for service 
time. What distribution for each step and how many steps 
produce this gamma distribution of total service time? 


As mentioned previously, the Weibull distribution is often used to model the time until failure 
of many different physical systems. The parameters in the distribution provide a great deal of 
flexibility to model systems in which the number of failures increases with time (bearing 
wear), decreases with time (some semiconductors), or remains constant with time (failures 
caused by external shocks to the system). 


Weibull 


Distribution The random variable X with probability density function 


fx) = Bz) exp -(2} forx > 0 


is a Weibull random variable with scale parameter 6 > 0 and shape parameter B > 0. 


(4-20) 
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Figure 4-26 Weibull probability density functions 
for selected values of 6 and B. 


The flexibility of the Weibull distribution is illustrated by the graphs of selected probability 
density functions in Fig. 4-26. By inspecting the probability density function, it is seen that 
when £ = 1, the Weibull distribution is identical to the exponential distribution. Also, the 
Raleigh distribution is a special case when the shape parameter is 2. 

The cumulative distribution function is often used to compute probabilities. The follow- 
ing result can be obtained. 


If X has a Weibull distribution with parameters 5 and B, then the cumulative distri- 
bution function of X is 


F(x) =1- (Gs) (4-21) 


Also, the following result can be obtained. 


If X has a Weibull distribution with parameters 6 and B, 


4 = 1960) = ar(i + *) and o? = V(X) = er(1 ct ~) a e[r(1 i =] 


EXAMPLE 4-25 Bearing Wear 
The time to failure (in hours) of a bearing in a mechanical 
shaft is satisfactorily modeled as a Weibull random variable 
with B = 1/2 and 6 = 5000 hours. Determine the mean time 
until failure. 

From the expression for the mean, 


E(X) = 5000P[1 + (1/2)] = SOOOT[1.5] 
= 5000 X 0.5Vm! = 4431.1 hours 


EXERCISES FOR SECTION 4-10 
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Determine the probability that a bearing lasts at least 6000 
hours. Now, 


2 
P(X > 6000) = 1 — F(6000) = exp| - (or) 


=e !“4 = 0.237 


Practical Interpretation: Consequently, only 23.7% of all bear- 
ings last at least 6000 hours. 


4-125. Suppose that X has a Weibull distribution with 
8B = 0.2 and 6 = 100 hours. Determine the mean and vari- 
ance of X. 


4-126. Suppose that X has a Weibull distribution with 
6B = 0.2 and 8 = 100 hours. Determine the following: 

(a) P(X < 10,000) (b) P(X > 5000) 

4-127. If X is a Weibull random variable with 8B = 1 and 
5 = 1000, what is another name for the distribution of X and 
what is the mean of X? 


4-128. Assume that the life of a roller bearing follows a 

Weibull distribution with parameters B = 2 and 8 = 10,000 

hours. 

(a) Determine the probability that a bearing lasts at least 8000 
hours. 

(b) Determine the mean time until failure of a bearing. 

(c) If 10 bearings are in use and failures occur independently, 
what is the probability that all 10 bearings last at least 
8000 hours? 


4-129. The life (in hours) of a computer processing unit 
(CPU) is modeled by a Weibull distribution with parameters 
6B = 3 and 6 = 900 hours. 

(a) Determine the mean life of the CPU. 

(b) Determine the variance of the life of the CPU. 

(c) What is the probability that the CPU fails before 500 hours? 


4-130. Assume the life of a packaged magnetic disk exposed 

to corrosive gases has a Weibull distribution with B = 0.5 and 

the mean life is 600 hours. 

(a) Determine the probability that a packaged disk lasts at 
least 500 hours. 

(b) Determine the probability that a packaged disk fails be- 
fore 400 hours. 


4-131. The life (in hours) of a magnetic resonance imaging 
machine (MRI) is modeled by a Weibull distribution with pa- 
rameters B = 2 and 6 = 500 hours. 

(a) Determine the mean life of the MRI. 

(b) Determine the variance of the life of the MRI. 

(c) What is the probability that the MRI fails before 250 hours? 


4-132. An article in the Journal of the Indian Geophysical 
Union titled “Weibull and Gamma Distributions for Wave 


Parameter Predictions” (2005, Vol. 9, pp. 55-64) used the 
Weibull distribution to model ocean wave heights. Assume that 
the mean wave height at the observation station is 2.5 m and the 
shape parameter equals 2. Determine the standard deviation of 
wave height. 


4-133. An article in the Journal of Geophysical Research 
(“Spatial and Temporal Distributions of U.S. of Winds and 
Wind Power at 80 m Derived from Measurements,” (2003, 
vol. 108, pp. 10-1: 10—20)] considered wind speed at stations 
throughout the U.S. A Weibull distribution can be used to 
model the distribution of wind speeds at a given location. 
Every location is characterized by a particular shape and 
scale parameter. For a station at Amarillo, Texas, the mean 
wind speed at 80 m (the hub height of large wind turbines) in 
2000 was 10.3 m/s with a standard deviation of 4.9 m/s. 
Determine the shape and scale parameters of a Weibull distri- 
bution with these properties. 


4-134. Suppose that X has a Weibull distribution with B = 2 
and 6 = 8.6. Determine the following: 

(a) P(X < 10) (b) P(X > 9) 

(c) P(8&<X<11) 

(d) value for x such that P(X > x) = 0.9 


4-135. Suppose the lifetime of a component (in hours) is 

modeled with a Weibull distribution with B = 2 and 6 = 4000. 

Determine the following: 

(a) P(X > 3000) (b) P(X > 6000 | X > 3000) 

(c) Comment on the probabilities in the previous parts com- 
pared to the results for an exponential distribution. 


4-136. Suppose the lifetime of a component (in hours) is 

modeled with a Weibull distribution with 8 = 0.5 and 6 = 

4000. Determine the following: 

(a) P(X > 3000) (b) P(X > 6000 | X > 3000) 

(c) Comment on the probabilities in the previous parts com- 
pared to the results for an exponential distribution. 

(d) Comment on the role of the parameter B in a lifetime 
model with the Weibull distribution. 


4-137. Suppose X has a Weibull distribution with B = 2 and 
5 = 2000. 
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(a) Determine P(X > 3500). (c) Suppose X represent the lifetime of a component in hours. 
(b) Determine P(X > 3500) for an exponential random vari- Comment on the probability that the lifetime exceeds 3500 
able with the same mean as the Weibull distribution. hours under the Weibull and exponential distributions. 


4-11 LOGNORMAL DISTRIBUTION 


Lognormal 
Distribution 


Variables in a system sometimes follow an exponential relationship as x = exp(w). If the 
exponent is a random variable W, then X = exp(W) is a random variable with a distribution 
of interest. An important special case occurs when W has a normal distribution. In that 
case, the distribution of X is called a lognormal distribution. The name follows from the 
transformation In (X) = W. That is, the natural logarithm of X is normally distributed. 

Probabilities for X are obtained from the transformation to W, but the range of X is (0, ©). 
Suppose that Wis normally distributed with mean @ and variance w”; then the cumulative dis- 
tribution function for X is 


F(x) = P[X <x] = Plexp(W) <x] = P[W<In()] 
_ rz in(s) = ) 7 [ner J 


for x > 0, where Z is a standard normal random variable. Therefore, Appendix Table III can 
be used to determine the probability. Also, F(x) = 0 for x = 0. 

The probability density function of X can be obtained from the derivative of F(x). This 
derivative is applied to the last term in the expression for F(x), the integral of the standard 
normal density function. Furthermore, from the probability density function, the mean and 
variance of X can be derived. The details are omitted, but a summary of results follows. 


Let W have a normal distribution with mean @ and variance w”; then X = exp(W) is 
a lognormal random variable with probability density function 


_ 1 (Inx — 6)? 
Ax) = xw V2 exo|- 207 


The mean and variance of X are 


| O<x<a 


2 


DS CURE ol = CPE TES = i) (4-22) 


The parameters of a lognormal distribution are 6 and w”, but care is needed to interpret that 
these are the mean and variance of the normal random variable W. The mean and variance of 
X are the functions of these parameters shown in (4-22). Figure 4-27 illustrates lognormal dis- 
tributions for selected values of the parameters. 

The lifetime of a product that degrades over time is often modeled by a lognormal ran- 
dom variable. For example, this is a common distribution for the lifetime of a semiconductor 
laser. A Weibull distribution can also be used in this type of application, and with an appro- 
priate choice for parameters, it can approximate a selected lognormal distribution. However, 
a lognormal distribution is derived from a simple exponential function of a normal random 
variable, so it is easy to understand and easy to evaluate probabilities. 
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Figure 4-27 Lognormal probability density functions with 


6 = 0 for selected values of w. 


EXAMPLE 4-26 Semiconductor Laser 
The lifetime of a semiconductor laser has a lognormal distri- 
bution with 6 = 10 hours and w = 1.5 hours. What is the 
probability that the lifetime exceeds 10,000 hours? 

From the cumulative distribution function for X, 


P(X > 10,000) = 1 — P[exp(W) <= 10,000] 
= 1 — P[W < In(10,000)] 
; ae — 10 


1.5 
= 1 — 0.30 = 0.70 


)=1 (—0.52) 


What lifetime is exceeded by 99% of lasers? The question is to 
determine x such that P(X > x) = 0.99. Therefore, 


P(X > x) = Plexp(W) > x] = P[W > In(x)] 


0 = ~) 
1 15 0.99 


EXERCISES FOR SECTION 4-11 


From Appendix Table III, 1 — ®(z) = 0.99 when z = —2.33. 
Therefore, 


In(x) — 10 


15 = —2.33 and x = exp(6.505) = 668.48 hours 


Determine the mean and standard deviation of lifetime. Now, 


E(X) = e+? = exp(10 + 1.125) = 67,846.3 
V(X) = ete — 1) = exp(20 + 2.25)[exp(2.25) — 1] 
39,070,059,886.6 


ll 


so the standard deviation of X is 197,661.5 hours. 
Practical Interpretation: The standard deviation of a log- 
normal random variable can be large relative to the mean. 


4-138. Suppose that X has a lognormal distribution with 
parameters @ = 5 and w” = 9. Determine the following: 

(a) P(X < 13,300) 

(b) The value for x such that P(X = x) = 0.95 

(c) The mean and variance of X 

4-139. Suppose that X has a lognormal distribution with 
parameters @ = —2 and w” = 9. Determine the following: 
(a) P(500 < X < 1000) 

(b) The value for x such that P(X < x) = 0.1 

(c) The mean and variance of X 


4-140. Suppose that X has a lognormal distribution with 
parameters @ = 2 and w* = 4. Determine the following: 


(a) P(X < 500) 

(b) The conditional probability that X < 1500 given that 
X > 1000 

(c) What does the difference between the probabilities in 
parts (a) and (b) imply about lifetimes of lognormal ran- 
dom variables? 


4-141. The length of time (in seconds) that a user views a 

page on a Web site before moving to another page is a lognor- 

mal random variable with parameters @ = 0.5 and w” = 1. 

(a) What is the probability that a page is viewed for more than 
10 seconds? 
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(b) By what length of time have 50% of the users moved to 
another page? 

(c) What is the mean and standard deviation of the time until 
a user moves from the page? 


4-142. Suppose that X has a lognormal distribution and that 
the mean and variance of X are 100 and 85,000, respectively. 
Determine the parameters @ and w” of the lognormal distribu- 
tion. (Hint: define x = exp(@) and y = exp(w”) and write two 
equations in terms of x and y.) 

4-143. The lifetime of a semiconductor laser has a lognor- 
mal distribution, and it is known that the mean and standard 
deviation of lifetime are 10,000 and 20,000, respectively. 

(a) Calculate the parameters of the lognormal distribution. 
(b) Determine the probability that a lifetime exceeds 10,000 

hours. 
(c) Determine the lifetime that is exceeded by 90% of lasers. 


4-144. Anarticle in Health and Population: Perspectives and 
Issues (2000, Vol. 23, pp. 28-36) used the lognormal distribution 
to model blood pressure in humans. The mean systolic blood 
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pressure (SBP) in males age 17 was 120.87 mm Hg. If the co- 
efficient of variation (100% X standard deviation/mean) is 9%, 
what are the parameter values of the lognormal distribution? 
4-145. Derive the probability density function of a lognor- 
mal random variable from the derivative of the cumulative 
distribution function. 


4-146. Suppose X has a lognormal distribution with param- 

eters 0 = 10 and w” = 16. Determine the following: 

(a) P(X < 2000) 

(b) P(X > 1500) 

(c) value exceeded with probability 0.7 

4-147. Suppose the length of stay (in hours) at an emergency 

department is modeled with a lognormal random variable X 

with 6 = 1.5 and w = 0.4. Determine the following: 

(a) mean and variance 

(b) P(X <8) 

(c) Comment on the difference between the probability P(X < 0) 
calculated from this lognormal distribution and a normal 
distribution with the same mean and variance. 


A continuous distribution that is flexble, but bounded over a finite range, is useful for prob- 
ability models. The proportion of solar radiation absorbed by a material or the proportion 
(of the maximum time) required to complete a task in a project are examples of continuous 
random variables over the interval [0, 1]. 


F(x) = 


The random variable X with probability density function 


T(a + B) 
Pa) © (8) 


is a beta random variable with parameters a > 0 and B > 0. 


(Wwe eee for x e[0, 1] 


The shap parameters a and £ allow the probability density function to assume many dif- 
ferent shapes. Figure 4-28 provides some examples. If a = 8 the distribution is symmetric 
about x = 0.5, and if a = B = | the beta distribution equals a continuous uniform distri- 
bution. The figure illustrates that other parameter choices generate nonsymmetric distribu- 


tions. 


In general, there is a not a closed-form expression for the cumulative distribution func- 
tion, and probabilities for beta random variables need to be computed numerically. The 
exercises provide some special cases where the probability density function is more easily 


handled. 


Figure 4-28 Beta 
probability density 
functions for selected 
values of the parame- 
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EXAMPLE 4-27 
Consider the completion time of a large commercial develop- Tia + ma a 
ment. The proportion of the maximum allowed time to com- P(X > 0.7) Ta W(x) 
plete a task is modeled as a beta random variable with a = 2.5 os 
and 8 = 1. What is the probability that the proportion of the es xis 
maximum time exceeds 0.7? ie 
Suppose X denotes the proportion of the maximum time 2.5(1 ao — I 1 

required to complete the task. The probability is = t 5(05)Va 2 57 x2 

A TT = 0.7 

— 0.7°° = 0.59 
If X has a beta distribution with parameters a and B, 
Qa ap 
E(X 7 = VX) = 
ye atp Cs (a + BY(a + B + 1) 
EXAMPLE 4-28 
Consider the proportion of time required to complete the task From the expression for the mean and variance, 
described in the previous example. Calculate the mean and 
variance of this random variable. = aad =(.71 c= saa = 0.045 
2541 3.5°(4.5) 


If « > 1 and B > 1, the mode (peak of the density) is in the interior of [0, 1] and equals 


a-— 1 


mode = —————— 
at+Bp-2 


This expression is useful to relate the peak of the density to the parameters. For the distribu- 
tion used previously for the proportion of time required to complete a task, a = 2.5 and B = 1 
and the mode of this distribution is (2.5 — 1)/(3.5 —2) = 1. Also, although a beta random vari- 
able X is defined over the interval [0, 1], a random variable W defined over the finite interval 
[a, b] can be constructed from W = a+ (b — a)X. 
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EXERCISES FOR SECTION 4-12 


4-148. Suppose X has a beta distribution with parameters 
a = 2.5 and B® = 2.5. Sketch an approximate graph of the 
probability density function. Is the density symmetric? 


4-149. Suppose x has a beta distribution with parameters 
a = 2.5 and B = 1. Determine the following: 

(a) P(X <0.25) = (b) P(0.25 < X < 0.75) 

(c) mean and variance 


4-150. Suppose X has a beta distribution with parameters 
a = 1 and B = 4.2. Determine the following: 

(a) P(X <0.25)  (b) P(O.5<X) 

(c) mean and variance 


4-151. A European standard value for a low-emission 

window glazing uses 0.59 as the proportion of solar energy 

that enters a room. Suppose that the distribution of the pro- 

portion of solar energy that enters a room is a beta random 

variable. 

(a) Calculate the mode, mean, and variance of the distribution 
fora = 3 and B = 1.4. 

(b) Calculate the mode, mean, and variance of the distribution 
for « = 10 and B = 6.25. 

(c) Comment on the difference in dispersion in the distribu- 
tions from the previous parts. 


4-152. The length of stay at an emergency department is the 
sum of the waiting and service times. Let X denote the propor- 
tion of time spent waiting and assume a beta distribution with 
a = 10 and B = 1. Determine the following: 

(a) P(X > 0.9) 

(b) P(X < 0.5) 

(c) mean and variance 


4-153. The maximum time to complete a task in a project is 
2.5 days. Suppose that the completion time as a proportion of 
this maximum is a beta random variable with a = 2 and B = 
3. What is the probability that the task requires more than two 
days to complete? 


Supplemental Exercises 


4-154. The probability density function of the time it takes 

a hematology cell counter to complete a test on a blood sample 

is f(x) = 0.04 for 50 < x < 75 seconds. 

(a) What percentage of tests require more than 70 seconds to 
complete? 

(b) What percentage of tests require less than one minute to 
complete? 

(c) Determine the mean and variance of the time to complete 
a test on a sample. 


4-155. The tensile strength of paper is modeled by a normal 

distribution with a mean of 35 pounds per square inch and a 

standard deviation of 2 pounds per square inch. 

(a) What is the probability that the strength of a sample is less 
than 40 Ib/in?? 


(b) If the specifications require the tensile strength to exceed 

30 Ib/in?, what proportion of the samples is scrapped? 
4-156. The time it takes a cell to divide (called mitosis) is 
normally distributed with an average time of one hour and a 
standard deviation of 5 minutes. 

(a) What is the probability that a cell divides in less than 
45 minutes? 

(b) What is the probability that it takes a cell more than 
65 minutes to divide? 

(c) By what time have approximately 99% of all cells com- 
pleted mitosis? 

4-157. The length of an injection-molded plastic case 

that holds magnetic tape is normally distributed with a length of 

90.2 millimeters and a standard deviation of 0.1 millimeter. 

(a) What is the probability that a part is longer than 90.3 mil- 
limeters or shorter than 89.7 millimeters? 

(b) What should the process mean be set at to obtain the great- 
est number of parts between 89.7 and 90.3 millimeters? 

(c) If parts that are not between 89.7 and 90.3 millimeters are 
scrapped, what is the yield for the process mean that you 
selected in part (b)? 

Assume that the process is centered so that the mean is 90 mil- 

limeters and the standard deviation is 0.1 millimeter. Suppose that 

10 cases are measured, and they are assumed to be independent. 

(d) What is the probability that all 10 cases are between 89.7 
and 90.3 millimeters? 

(e) What is the expected number of the 10 cases that are be- 
tween 89.7 and 90.3 millimeters? 

4-158. The sick-leave time of employees in a firm in a 

month is normally distributed with a mean of 100 hours and a 

standard deviation of 20 hours. 

(a) What is the probability that the sick-leave time for next 
month will be between 50 and 80 hours? 

(b) How much time should be budgeted for sick leave if the 
budgeted amount should be exceeded with a probability 
of only 10%? 

4-159. The percentage of people exposed to a bacteria who 

become ill is 20%. Assume that people are independent. Assume 

that 1000 people are exposed to the bacteria. Approximate each 
of the following: 

(a) The probability that more than 225 become ill 

(b) The probability that between 175 and 225 become ill 

(c) The value such that the probability that the number of 
people who become ill exceeds the value is 0.01 

4-160. The time to failure (in hours) for a laser in a cytom- 

etry machine is modeled by an exponential distribution with 

A = 0.00004. 

(a) What is the probability that the laser will last at least 
20,000 hours? 

(b) What is the probability that the laser will last at most 
30,000 hours? 

(c) What is the probability that the laser will last between 
20,000 and 30,000 hours? 


4-161. When a bus service reduces fares, a particular trip 
from New York City to Albany, New York, is very popular. 
A small bus can carry four passengers. The time between calls 
for tickets is exponentially distributed with a mean of 30 min- 
utes. Assume that each call orders one ticket. What is the prob- 
ability that the bus is filled in less than three hours from the 
time of the fare reduction? 


4-162. The time between process problems in a manufac- 

turing line is exponentially distributed with a mean of 30 days. 

(a) What is the expected time until the fourth problem? 

(b) What is the probability that the time until the fourth prob- 
lem exceeds 120 days? 


4-163. The life of a recirculating pump follows a Weibull 

distribution with parameters B = 2 and 6 = 700 hours. 

(a) Determine the mean life of a pump. 

(b) Determine the variance of the life of a pump. 

(c) What is the probability that a pump will last longer than its 
mean? 


4-164. The size of silver particles in a photographic emul- 

sion is known to have a log normal distribution with a mean of 

0.001 mm and a standard deviation of 0.002 mm. 

(a) Determine the parameter values for the lognormal distri- 
bution. 

(b) What is the probability of a particle size greater than 
0.005 mm? 


4-165. Suppose that f(x) =05x—-1 for 2<x<4. 

Determine the following: 

(a) P(X < 2.5) 

(b) P(X > 3) 

(c) P(2.5 < X < 3.5) 

(d) Determine the cumulative distribution function of the 
random variable. 

(e) Determine the mean and variance of the random variable. 


4-166. The time between calls is exponentially distributed 

with a mean time between calls of 10 minutes. 

(a) What is the probability that the time until the first call is 
less than 5 minutes? 

(b) What is the probability that the time until the first call is 
between 5 and 15 minutes? 

(c) Determine the length of an interval of time such that the 
probability of at least one call in the interval is 0.90. 

(d) If there has not been a call in 10 minutes, what is the 
probability that the time until the next call is less than 
5 minutes? 

(e) What is the probability that there are no calls in the inter- 
vals from 10:00 to 10:05, from 11:30 to 11:35, and from 
2:00 to 2:05? 

(f) What is the probability that the time until the third call is 
greater than 30 minutes? 

(g) What is the mean time until the fifth call? 


4-167. The CPU ofa personal computer has a lifetime that 
is exponentially distributed with a mean lifetime of six years. 
You have owned this CPU for three years. 
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(a) What is the probability that the CPU fails in the next three 
years? 

(b) Assume that your corporation has owned 10 CPUs for 
three years, and assume that the CPUs fail independently. 
What is the probability that at least one fails within the 
next three years? 


4-168. Suppose that X has a lognormal distribution with 
parameters @ = 0 and w” = 4. Determine the following: 

(a) P10 < X < 50) 

(b) The value for x such that P(X < x) = 0.05 

(c) The mean and variance of X 


4-169. Suppose that X has a lognormal distribution and that 
the mean and variance of X are 50 and 4000, respectively. 
Determine the following: 

(a) The parameters @ and w” of the lognormal distribution 
(b) The probability that X is less than 150 


4-170. Asbestos fibers in a dust sample are identified by an 

electron microscope after sample preparation. Suppose that 

the number of fibers is a Poisson random variable and the 

mean number of fibers per square centimeter of surface dust is 

100. A sample of 800 square centimeters of dust is analyzed. 

Assume a particular grid cell under the microscope represents 

1/160,000 of the sample. 

(a) What is the probability that at least one fiber is visible in 
the grid cell? 

(b) What is the mean of the number of grid cells that need to 
be viewed to observe 10 that contain fibers? 

(c) What is the standard deviation of the number of grid cells 
that need to be viewed to observe 10 that contain fibers? 


4-171. Without an automated irrigation system, the height 

of plants two weeks after germination is normally distributed 

with a mean of 2.5 centimeters and a standard deviation of 

0.5 centimeter. 

(a) What is the probability that a plant’s height is greater than 
2.25 centimeters? 

(b) What is the probability that a plant’s height is between 2.0 
and 3.0 centimeters? 

(c) What height is exceeded by 90% of the plants? 


4-172. Continuation of Exercise 4-171. With an automated 
irrigation system, a plant grows to a height of 3.5 centimeters 
two weeks after germination. 

(a) What is the probability of obtaining a plant of this 
height or greater from the distribution of heights in 
Exercise 4-165. 

(b) Do you think the automated irrigation system increases 
the plant height at two weeks after germination? 


4-173. The thickness of a laminated covering for a wood 

surface is normally distributed with a mean of 5 millimeters 

and a standard deviation of 0.2 millimeter. 

(a) What is the probability that a covering thickness is greater 
than 5.5 millimeters? 

(b) If the specifications require the thickness to be between 
4.5 and 5.5 millimeters, what proportion of coverings do 
not meet specifications? 
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(c) The covering thickness of 95% of samples is below what 
value? 


4-174. The diameter of the dot produced by a printer is 
normally distributed with a mean diameter of 0.002 inch. 
Suppose that the specifications require the dot diameter to be 
between 0.0014 and 0.0026 inch. If the probability that a dot 
meets specifications is to be 0.9973, what standard deviation 
is needed? 


4-175. Continuation of Exercise 4-174. Assume that the 
standard deviation of the size of a dot is 0.0004 inch. If the 
probability that a dot meets specifications is to be 0.9973, 
what specifications are needed? Assume that the specifications 
are to be chosen symmetrically around the mean of 0.002. 


4-176. The life of a semiconductor laser at a constant power 
is normally distributed with a mean of 7000 hours and a stan- 
dard deviation of 600 hours. 

(a) What is the probability that a laser fails before 5800 hours? 

(b) What is the life in hours that 90% of the lasers exceed? 

(c) What should the mean life equal in order for 99% of the 
lasers to exceed 10,000 hours before failure? 

(d) A product contains three lasers, and the product fails if 
any of the lasers fails. Assume the lasers fail indepen- 
dently. What should the mean life equal in order for 99% 
of the products to exceed 10,000 hours before failure? 
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4-177. Continuation of Exercise 4-176. Rework parts (a) 

and (b). Assume that the lifetime is an exponential random 

variable with the same mean. 

4-178. Continuation of Exercise 4-176. Rework parts (a) 

and (b). Assume that the lifetime is a lognormal random vari- 

able with the same mean and standard deviation. 

4-179. A square inch of carpeting contains 50 carpet fibers. 

The probability of a damaged fiber is 0.0001. Assume the 

damaged fibers occur independently. 

(a) Approximate the probability of one or more damaged 
fibers in 1 square yard of carpeting. 

(b) Approximate the probability of four or more damaged 
fibers in 1 square yard of carpeting. 


4-180. An airline makes 200 reservations for a flight who 

holds 185 passengers. The probability that a passenger arrives 

for the flight is 0.9 and the passengers are assumed to be inde- 

pendent. 

(a) Approximate the probability that all the passengers who 
arrive can be seated. 

(b) Approximate the probability that there are empty seats. 

(c) Approximate the number of reservations that the airline 
should make so that the probability that everyone who 
arrives can be seated is 0.95. [Hint: Successively try 
values for the number of reservations. ] 


MIND-EXPANDING EXERCISES 


4-181. The steps in this exercise lead to the probabil- 

ity density function of an Erlang random variable X with 

parameters A and r, f(x) = A’x” le */(r — 1)!, x > 0, 

andr = 1, 2,.... 

(a) Use the Poisson distribution to express P(X > x). 

(b) Use the result from part (a) to determine the cumu- 
lative distribution function of X. 

(c) Differentiate the cumulative distribution function in 
part (b) and simplify to obtain the probability den- 
sity function of X. 

4-182. A bearing assembly contains 10 bearings. The 

bearing diameters are assumed to be independent and 

normally distributed with a mean of 1.5 millimeters and 

a standard deviation of 0.025 millimeter. What is the 

probability that the maximum diameter bearing in the 

assembly exceeds 1.6 millimeters? 


4-183. Let the random variable X denote a measure- 
ment from a manufactured product. Suppose the target 
value for the measurement is m. For example, X could 
denote a dimensional length, and the target might be 10 
millimeters. The quality loss of the process producing 


the product is defined to be the expected value of 

$k(X — m)’, where k is a constant that relates a devia- 

tion from target to a loss measured in dollars. 

(a) Suppose X is a continuous random variable with 
E(X) = mand V(X) = o”. What is the quality loss 
of the process? 

(b) Suppose X is a continuous random variable with 
E(X) = wand V(X) = 0”. What is the quality loss 
of the process? 

4-184. The lifetime of an electronic amplifier is 

modeled as an exponential random variable. If 10% of 

the amplifiers have a mean of 20,000 hours and the re- 
maining amplifiers have a mean of 50,000 hours, what 
proportion of the amplifiers fail before 60,000 hours? 


4-185. Lack of Memory Property. Show that for 
an exponential random variable X, P(X < t+ tf | 
X >t) = P(X <b). 

4-186. A process is said to be of six-sigma quality if 
the process mean is at least six standard deviations from 
the nearest specification. Assume a normally distributed 
measurement. 


(a) 


(b) 
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MIND-EXPANDING EXERCISES 


If a process mean is centered between upper and 
lower specifications at a distance of six standard 
deviations from each, what is the probability that a 
product does not meet specifications? Using the 
result that 0.000001 equals one part per million, 
express the answer in parts per million. 

Because it is difficult to maintain a process mean 
centered between the specifications, the probability 
of a product not meeting specifications is often cal- 
culated after assuming the process shifts. If the 


process mean positioned as in part (a) shifts upward 
by 1.5 standard deviations, what is the probability 
that a product does not meet specifications? Express 
the answer in parts per million. 

(c) Rework part (a). Assume that the process mean is 
at a distance of three standard deviations. 

(d) Rework part (b). Assume that the process mean is at 
a distance of three standard deviations and then 
shifts upward by 1.5 standard deviations. 

(e) Compare the results in parts (b) and (d) and comment. 


IMPORTANT TERMS AND CONCEPTS 


Beta distribution 

Chi-squared 
distribution 

Continuity correction 

Continuous uniform 
distribution 

Cumulative probability 
distribution function- 
continuous random 
variable 

Erlang distribution 


Exponential distribution 

Gamma distribution 

Lack of memory 
property-continuous 
random variable 

Lognormal distribution 

Mean-continuous 
random variable 

Mean-function of a 
continuous random 
variable 


Normal approximation to 
binomial and Poisson 
probabilities 

Normal distribution 

Probability density 
function 

Probability distribution- 
continuous random 
variable 


Standard deviation- 
continuous random 
variable 

Standardizing 

Standard normal 
distribution 

Variance-continuous 
random variable 

Weibull distribution 
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Joint Probability Distributions 


Air quality monitoring stations are maintained throughout Maricopa County, Arizona and 
the Phoenix metropolitan area. Measurements for particulate matter and ozone are meas- 
ured hourly. Particulate matter (known as PM10) is a measure (in g/m? ) of solid and liq- 
uid particles in the air with diameters less than 10 micrometers. Ozone is a colorless gas 
with molecules comprised of three oxygen atoms that make it very reactive. Ozone is 
formed in a complex reaction from heat, sunlight, and other pollutants, especially volatile 
organic compounds. The U.S. Environmental Protection Agency sets limits for both PM10 
and ozone. For example, the limit for ozone is 0.075 ppm. The probability a day in Phoenix 
exceeds the limits for PM10 and ozone is important for compliance and remedial actions 
with the county and city. But this might be more involved that the product of the probabili- 
ties for each pollutant separately. It might be that days with high PM10 measurements also 
tend to have ozone values. That is, the measurements might not be independent and it is the 
joint relationship between these measurements that becomes important. The study of prob- 
ability distributions for more than one random variable is the focus of this chapter and the 
air quality data is just one illustration of the ubiquitous need to study variables jointly. 


CHAPTER OUTLINE 
5-1 TWO OR MORE RANDOM 5-2 COVARIANCE AND 
VARIABLES CORRELATION 
5-1.1 Joint Probability Distributions 5-3. COMMON JOINT DISTRIBUTIONS 
5-1.2 Marginal Probability Distributions 5-3.1 Multinomial Probability Distribution 
5-1.3 Conditional Probability 5-3.2 Bivariate Normal Distribution 
Pisealutions 5-4 LINEAR FUNCTIONS OF 
5-1.4 Independence RANDOM VARIABLES 
5-1.5 More Than Two Random 5-5 GENERAL FUNCTIONS OF 
Variables RANDOM VARIABLES 
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LEARNING OBJECTIVES 


After careful study of this chapter you should be able to do the following: 

. Use joint probability mass functions and joint probability density functions to calculate probabilities 
. Calculate marginal and conditional probability distributions from joint probability distributions 
. Interpret and calculate covariances and correlations between random variables 


. Use the multinomial distribution to determine probabilities 


aA Bp WN 


. Understand properties of a bivariate normal distribution and be able to draw contour plots for the 
probability density function 


6. Calculate means and variances for linear combinations of random variables and calculate proba- 
bilities for linear combinations of normally distributed random variables 


7. Determine the distribution of a general function of a random variable 


In Chapters 3 and 4 we studied probability distributions for a single random variable. However, 
it is often useful to have more than one random variable defined in a random experiment. For 
example, in the classification of transmitted and received signals, each signal can be classified 
as high, medium, or low quality. We might define the random variable X to be the number of 
high-quality signals received and the random variable Y to be the number of low-quality signals 
received. In another example, the continuous random variable X can denote the length of one 
dimension of an injection-molded part, and the continuous random variable Y might denote the 
length of another dimension. We might be interested in probabilities that can be expressed in 
terms of both X and Y. For example, if the specifications for X and Y are (2.95 to 3.05) and (7.60 
to 7.80) millimeters, respectively, we might be interested in the probability that a part satisfies 
both specifications; that is, P(2.95 < X < 3.05 and 7.60 < Y < 7.80). 

Because the two random variables are measurements from the same part, small disturbances 
in the injection-molding process, such as pressure and temperature variations, might be more 
likely to generate values for X and Y in specific regions of two-dimensional space. For example, 
a small pressure increase might generate parts such that both X and Y are greater than their 
respective targets, and a small pressure decrease might generate parts such that X and Y are both 
less than their respective targets. Therefore, based on pressure variations, we expect that the 
probability of a part with X much greater than its target and Y much less than its target is small. 

In general, if X and Y are two random variables, the probability distribution that defines 
their simultaneous behavior is called a joint probability distribution. In this chapter, we 
investigate some important properties of these joint distributions. 


5-1 TWO OR MORE RANDOM VARIABLES 
5-1.1 Joint Probability Distributions 
For simplicity, we begin by considering random experiments in which only two random vari- 


ables are studied. In later sections, we generalize the presentation to the joint probability 
distribution of more than two random variables. 


EXAMPLE 5-1 Signal Bars 


Calls are made to check the airline schedule at your departure In the first four bits transmitted, let 

city. You monitor the number of bars of signal strength on your X denote the number of bars of signal strength 
cell phone and the number of times you have to state the name on your cell phone 

of your departure city before the voice system recognizes the 


Y denote the number of times you need to state 


name. : 
your departure city 
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Figure 5-1 Joint 
probability distribution 
of X and Y in Example 
5-1. 


x = number of bars of signal strength 
y = number of times 
city name is stated | 1 2 3 
4|0.15 0.1 0.05 
3 | 0.02 0.1 0.05 
2 | 0.02 0.03 0.2 
1/0.01 0.02 0.25 


By specifying the probability of each of the points in Fig. 5-1, range of the random variables (X, Y) to be the set of points 
we specify the joint probability distribution of X and Y. (x, y) in two-dimensional space for which the probability that 
Similarly to an individual random variable, we define the X = xand Y = yis positive. 


Joint 
Probability 
Mass Function 


If X and Y are discrete random variables, the joint probability distribution of X and Y is a 
description of the set of points (x, y) in the range of (X, Y) along with the probability of each 
point. The joint probability distribution of two random variables is sometimes referred to as 
the bivariate probability distribution or bivariate distribution of the random variables. 
One way to describe the joint probability distribution of two discrete random variables is 
through a joint probability mass function. Also, P(X = x and Y = y) is usually written as 
P(X =x,Y=y). 


The joint probability mass function of the discrete random variables X and Y, 
denoted as fyy(x, y), satisfies 


(fetes) = 0 
2) SD fly) =1 
x oy 


(3) fxy(x, y) = P(X = x, Y = y) (5-1) 


Just as the probability mass function of a single random variable X is assumed to be zero at all 
values outside the range of X, so the joint probability mass function of X and Y is assumed to 
be zero at values for which a probability is not specified. 

The joint probability distribution of two continuous random variables X and Y can be 
specified by providing a method for calculating the probability that X and Y assume a value in 
any region R of two-dimensional space. Analogous to the probability density function of a 
single continuous random variable, a joint probability density function can be defined over 
two-dimensional space. The double integral of fyy(x, y) over a region R provides the proba- 
bility that (X, Y) assumes a value in R. This integral can be interpreted as the volume under the 
surface fyy(x, y) over the region R. 

A joint probability density function for X and Y is shown in Fig. 5-2. The probability 
that (X,Y) assumes a value in the region R equals the volume of the shaded region in 
Fig. 5-2. In this manner, a joint probability density function is used to determine probabil- 
ities for X and Y. 
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fry (x, y) 


fry y) 


Figure 5-2 Joint probability density function for Figure 5-3 Joint probability density function for the lengths 
random variables X and Y. Probability that (X, Y) is of different dimensions of an injection-molded part. 

in the region R is determined by the volume of 

Fay, y) over the region R. 


Joint 
Probability A joint probability density function for the continuous random variables X and Y, 
Density denoted as fxy(x, y), satisfies the following properties: 
Function 


(1) fxy(% y) = 0 for all x, y 


(oo) 


@ | | fortes) ) dx dy = 1 


—0 —0o0 


(3) For any region R of two-dimensional space, 


P(x, ¥) € R) = || fur(ssy) ae dy 65-2) 


R 


Typically, fxy(x, y) is defined over all of two-dimensional space by assuming that 
fxy(x, y) = 0 for all points for which fyy(x, y) is not specified. 

At the start of this chapter, the lengths of different dimensions of an injection-molded part 
were presented as an example of two random variables. Each length might be modeled by a 
normal distribution. However, because the measurements are from the same part, the random 
variables are typically not independent. A probability distribution for two normal random vari- 
ables that are not independent is important in many applications and it is presented later in this 
chapter. If the specifications for X and Y are 2.95 to 3.05 and 7.60 to 7.80 millimeters, respec- 
tively, we might be interested in the probability that a part satisfies both specifications; that is, 
P(2.95 < X < 3.05, 7.60 < Y < 7.80). Suppose that fyy(x, y) is shown in Fig. 5-3. The re- 
quired probability is the volume of fxy(x, y) within the specifications. Often a probability such 
as this must be determined from a numerical integration. 


EXAMPLE 5-2 Server Access Time 
Let the random variable X denote the time until a computer the wait from a common starting time and X < Y. Assume that 
server connects to your machine (in milliseconds), and let Y the joint probability density function for X and Y is 

denote the time until the server authorizes you as a valid user = = 

(in milliseconds). Each of these random variables measures Sxv(%s ¥) = 6 X 10™° exp(—0.001x — 0.002y) forx<y 
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0 x 


Figure 5-4 The joint probabil- 
ity density function of X and Y is 
nonzero over the shaded region. 


Reasonable assumptions can be used to develop such a distri- 
bution, but for now, our focus is only on the joint probability 
density function. 

The region with nonzero probability is shaded in 
Fig. 5-4. The property that this joint probability density function 
integrates to 1 can be verified by the integral of fyy(x, y) over 
this region as follows: 


oo 


[oe v4 10~%e7 9.0014 — 0.002y dy dx 


x 


om 8 


| fiur(x, y) dy dx = 


~ —00 


{—— 8 


_ 6 x 10°] [e-om dy e 0.001 dx 
0 x 
: 7 e 0:002x ‘tel 
= Ese 10" =0.001x 
| 0.002 7 


1 
= 0. =1 
nee (<a0s) 


5-1.2 Marginal Probability Distributions 


2000 


0) 
QO 1000 x 


Figure 5-5 Region of integration 
for the probability that X < 1000 
and Y < 2000 is darkly shaded. 


The probability that X < 1000 and Y < 2000 is deter- 
mined as the integral over the darkly shaded region in 
Fig. 5-5. 


1000 2000 


P(X < 1000, Y = 2000) = | | fay(x y) dy dx 


O x 
1000 


=6x 10° | | e 9.0027 dy | 0.001 gy 


2000 


e 0:002x = e+ 
=6x 10° | (— a Jenm dx 


1000 


| e 0-003 _ e+ e 0-001 dx 


0 


= Uz e 
0.003 0.001 


= 0.003 (316.738 — 11.578) = 0.915 


Practical Interpretation: A joint probability density func- 
tion enables probabilities for two (or more) random variables 
to be calculated as in these examples. 


If more than one random variable is defined in a random experiment, it is important to distin- 
guish between the joint probability distribution of X and Y and the probability distribution of 
each variable individually. The individual probability distribution of a random variable is 
referred to as its marginal probability distribution. 

In general, the marginal probability distribution of X can be determined from the joint 
probability distribution of X and other random variables. For example, consider discrete 
random variables X and Y. To determine P(X = x), we sum P(X = x, Y = y) over all points in 
the range of (X, Y) for which X = x. Subscripts on the probability mass functions distinguish 


between the random variables. 
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x = number of bars of signal strength 
Marginal 
y = number of times probability 
city name is stated | 1 2 3 distribution of Y 

4| 0.15 0.1 0.05 0.3 

3 | 0.02 0.1 0.05 0.17 

2 | 0.02 0.03 0.2 0.25 
Figure 5-6 Marginal 1} 0.01 0.02 0.25 0.28 
probability distribu- 
tions of X and Y from 0.2 0.25 0.55 
Fig. 5-1 Marginal probability distribution of X 


EXAMPLE 5-3 Marginal Distribution 

The joint probability distribution of X and Y in Fig. 5-1 can be 
used to find the marginal probability distribution of X. For 
example, 


fx(3) = P(X = 3) = P(X = 3, Y= 1) + P(X = 3, Y = 2) 
+ P(X = 3, Y= 3) + P(X =3,¥ = 4) 
= 0.25 + 0.2 + 0.05 + 0.05 = 0.55 


The marginal probability distribution for X is found by 
summing the probabilities in each column, whereas the 
marginal probability distribution for Y is found by sum- 
ming the probabilities in each row. The results are shown in 
Fig. 5-6. 


For continuous random variables, an analogous approach is used to determine marginal 
probability distributions. In the continuous case, an integral replaces the sum. 


Marginal 


Probability 
Density 
Function 


y 


If the joint probability density function of random variables X and Y is fyy(x, y), the 
marginal probability density functions of X and Y are 


fro) = [fovless)dr and flo) = | fnls ya 


where the first integral is over all points in the range of (X, Y) for which X = x and 
the second integral is over all points in the range of (X, Y) for which Y = y. 


(-3) 


x 


A probability for only one random variable, say, for example, P(a < X < b), can be 
found from the marginal probability distribution of X or from the integral of the joint prob- 


ability distribution of X and Y as 


b 


ma <x<s)= | fuer = | | [ reser |ac= | , [res s)ayae 


a 


EXAMPLE 5-4 Server Access Time 

For the random variables that denote times in Example 5-2, 

calculate the probability that Y exceeds 2000 milliseconds. 
This probability is determined as the integral of fyy(x, y) 

over the darkly shaded region in Fig. 5-7. The region is parti- 

tioned into two parts and different limits of integration are 

determined for each part. 


a -—7 in) 


2000, ca 
P(Y¥ > 2000) = | | 6 X 10 Se .001x~ 0.002 +) dx 
0 200 


op | a x 10 Se 0-001x— 0.002y Gy, dx 


158 CHAPTER 5 JOINT PROBABILITY DISTRIBUTIONS 


y 
Figure 5-7 Region 
of integration for the 
probability that 
Y <= 2000 is darkly aie 
shaded and it is parti- 
tioned into two regions 
with x < 2000 and 0 
x > 2000. 0 2000 x 
The first integral is 

2 
6 xX 1076 | aan i 0.001 fy 
—0.002 | 900 
2000 
6 x 10°° 
= et | 70.00 ry 
0 
6 X 10°6 “(2 - —) 
0.002 © \ 0.001 
= 0.0475 


The second integral is 


6 goes | 0.001 
6 xX 107 70.001 
? | =0.002], } © . 
2000 
6x 10-5 [ en 
= et JIX dx 
0.002 | . - 
2000 
6X 10°°/ e% 
( : ) 0.0025 
0.002 \0.003 
Therefore, 


P(Y > 2000) = 0.0475 + 0.0025 = 0.05 


Alternatively, the probability can be calculated from the mar- 
ginal probability distribution of Y as follows. For y > 0, 


y 


fr(y) = [o x 10 %e~ 0.0014 — 0.002y dx 


0 


y 
=6xX 10~%e~ 9.002 [e-0omax 
0 
eo 0.00Lx 


¥ 
—0.001 ‘ 


70.001 
=6% 10-te-00m( 1 —e *) 
( 


=6x 19-00 ( 


0.001 


=6~x 1073 e 0.002y 1- e” 0.001y) for y >0 


We have obtained the marginal probability density function 
of Y. Now, 


oo 


P(Y > 2000) = 6 x 1073 | e 9.0027(1 — @ 0.00ly) dy 


2000 
Lee (<< oo ) es 
2000 —0.003 


) 


=0.002 
=6x 10] rae x = 0.05 
0.002 0.003 


Also, E(X) and V(X) can be obtained by first calculating the marginal probability distri- 
bution of X and then determining E(X ) and V(X) by the usual method. In Fig. 5-6, the marginal 
probability distribution of X is used to obtain the mean as 


E(X) = 


5-1.3 Conditional Probability Distributions 


1(0.2) + 2(0.25) + 3(0.55) = 2.35 


When two random variables are defined in a random experiment, knowledge of one can change 
the probabilities that we associate with the values of the other. Recall that in Example 5-1, X 
denotes the number of bars of service and Y denotes the number of times you need to state 
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the name of the departure city. One expects the probability Y = 1 to be greater at X = 3 bars 
than at X = 1 bar. From the notation for conditional probability in Chapter 2, we can write 
such conditional probabilities as P(Y = 1|X = 3) and P(Y = 1|X = 1). Consequently, the ran- 
dom variables X and Y are expected to be dependent. Knowledge of the value obtained for X 
changes the probabilities associated with the values of Y. 

Recall that the definition of conditional probability for events A and B is P(B|A) = 
P(A ( B)/P(A). This definition can be applied with the event A defined to be X = x and event 
B defined to be Y = y. 


EXAMPLE 5-5 Signal Bars 
For Example 5-1, X and Y denote the number of bars of signal P(Y = 2|X = 3) = P(X = 3, Y = 2)/P(X = 3) 


strength and times you need to state your departure city re- = fry(3, 2)/fx(3) = 0.2/0.55 = 0.364 
ceived, respectively. Then, 


Additional Conclusion: Further work shows that P(Y = 3| X = 


PY = 1s = 3) = RS 3 Y= 1) Pe = 3) 3) = 0.091 and P(Y = 4| X = 3) = 0.091. Note that P(Y = 1|X 
= fxy(3, 1)/fx(3) = 0.25/0.55 = 0.454 = 3)+ P(Y = 2|X = 3) + P(Y = 3|X =3)+ P(Y=4|X= 


The probability that Y = 


Conditional 
Probability 
Density 
Function 


3) = 1. This set of probabilities defines the conditional proba- 
2 given that X = 3 is bility distribution of Y given that X = 3. 


Example 5-5 illustrates that the conditional probabilities for Y given that X = x can be 
thought of as a new probability distribution. The following definition applies these concepts to 
continuous random variables. 


Given continuous random variables X and Y with joint probability density function 
Fxy@, y), the conditional probability density function of Y given X = x is 


A ie eee (5-4) 
fx(x) 


The conditional probability density function provides the conditional probabilities for the 
values of Y given that X = x. 


Because the conditional probability density function fy),(y) is a probability density 
function for all y in R,, the following properties are satisfied: 


GQ) fry) = 0 
(2) [more =i 


(3) P(YEB|X =x) = | foray for any set B in the range of Y 


B 


(5-5) 
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Figure 5-8 The y 
conditional probability 
density function for Y, 
given that x = 1500, is 
nonzero over the solid 


line. 
1500 


0 
0 1500 x 


It is important to state the region in which a joint, marginal, or conditional probability 
density function is not zero. The following example illustrates this. 


EXAMPLE 5-6 Conditional Probability 
For the random variables that denote times in Example 5-2, 
determine the conditional probability density function for Y 
given that X = x. 

First the marginal density function of x is determined. 
For x > 0, 


e-} 


fe x 107 Se 0.001x= 0.0029 oy, 


x 


ll 


Fx (x) 


—0.002y 


= 6 X 10% 0.001x ( © 
Reale —0.002 


= 6 X 10 ~8e~0.001x (<a) 
0.002 
= 0.003e793* for 


This is an exponential distribution with \ = 0.003. Now, for 
0 <xand x < ythe conditional probability density function is 


x>0 


6x 10 ~ Se 0.001 — 0.002y 


0.00327 9% 
forO <xandx<y 


frc(¥) = fay (% yV/fe(™) 


=0 002e9-002— 0.002y 


EXAMPLE 5-7 

For the joint probability distribution in Fig. 5-1, Frixl y) is 
found by dividing each fyy(x, y) by fx(x). Here, f(x) is simply 
the sum of the probabilities in each column of Fig. 5-1. The 
function frixl(y) is shown in Fig. 5-9. In Fig. 5-9, each column 
sums to one because it is a probability distribution. 


The conditional probability density function of Y, given that 
X = 1500, is nonzero on the solid line in Fig. 5-8. 

Determine the probability that Y exceeds 2000, given 
that x = 1500. That is, determine P(Y > 2000| X = 1500). 
The conditional probability density function is integrated as 
follows: 


oo 


1500) = | Fy iisoo(y) ay 


2000 


ll 


P(Y > 2000|X 


oo 


ll 


| 0.002¢2-002(1500) — 0.002y dy 


2000 
(0.0029 | 
= 0.002 
: (S003 sn) 
3(¢* 
= 0.002e° = 0.368 
0.002e (<5) 0.36 


Properties of random variables can be extended to a 
conditional probability distribution of Y given X = x. The 
usual formulas for mean and variance can be applied to a 
conditional probability density or mass function. 


x = number of bars of signal strength 
, y = number of times 
Figure 5-9 city name is stated | 1 2 3 
Conditional probabil- 4 | 0.750 0.400 0.091 
ity distributions of Y 3 | 0.100 0.400 0.091 
given X = x, fy(y) 2| 0.100 0.120 0.364 
in Example 5-7. 1| 0.050 0.080 0.454 
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Conditional 
Mean and 
Variance 


y 


The conditional mean of Y given X = x, denoted as E(Y|x) or py|,, is 
B13) = | yfielo) 
and the conditional variance of Y given X = x, denoted as V(Y | x) or oY , is 


VY |x) = Jo — py) fyie(y) = [Ano — Wye 


(5-6) 


y 


y 


EXAMPLE 5-8 Conditional Mean and Variance 
For the random variables that denote times in Example 5-2, 
determine the conditional mean for Y given that x = 1500. 

The conditional probability density function for Y was 
determined in Example 5-6. Because fyj1599(y) is nonzero for 
y > 1500, 


E(¥|X = 1500) = | F(O00LeP AO) ay 


1500 


oo 


= 0.002e7 | ye 9002y ay 


1500 


EXAMPLE 5-9 
For the discrete random variables in Example 5-1, the condi- 
tional mean of Y given X = 1 is obtained from the conditional 
distribution in Fig. 5-9: 
E(Y|1) = pyi1 
= 1(0.05) + 2(0.1) + 3(0.1) + 4(0.75) 
= 3:55 


5-1.4 Independence 


Integrate by parts as follows: 


io} 


aes oe 0:002y co 7 e 0.002y 
~0.002 gy = d 
| oe ee? O02 ica | (sam) - 
1500 1500 
1500 _, ( Bee ig ) 
- ys 
0.002 (—0.002)(—0.002) | 1500 
1500 _, e? e? 
= = 2000 
0.002° * (0.002)(0.002) ~ 0.002 (2°) 


With the constant 0.002e? reapplied, 
E(Y|X = 1500) = 2000 


Practical Interpretation: If the connect time is 1500 ms then 
the expected time to be authorized is 2000 ms. 


The conditional mean is interpreted as the expected number of 
times the city name is stated given that one bar of signal is 
present. The conditional variance of Y given X = 1 is 


V(Y|1) = (1 — 3.55)? 0.05 + (2 — 3.55)" 0.1 
+ (3 — 3.55)? 0.1 + (4 — 3.55)? 0.75 = 0.748 


In some random experiments, knowledge of the values of X does not change any of the prob- 
abilities associated with the values for Y. 


EXAMPLE 5-10 Independent Random Variables 


In a plastic molding operation, each part is classified as to 
whether it conforms to color and length specifications. Define 
the random variable X and Y as 


1 if the part conforms to color specifications 
{ 0 otherwise 

_ Ji ifthe part conforms to length specifications 
~ to otherwise 


Assume the joint probability distribution of X and Y is 
defined by fyy(x, y) in Fig. 5-10(a). The marginal probabil- 
ity distributions of X and Y are also shown in Fig. 5-10(a). 
Note that fyy(x, y) = f(x) fy(y). The conditional probabil- 
ity mass function fy|,(y) is shown in Fig. 5-10(b). Notice 
that for any x, Frix( y) = fy(y). That is, knowledge of whether 
or not the part meets color specifications does not change the 
probability that it meets length specifications. 
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Figure 5-10 (a) Joint 
and marginal probabil- 
ity distributions of X 
and Y in Example 
5-10. (b) Conditional 
probability distribution 
of Y given X = xin 
Example 5-10. 


Independence 


JOINT PROBABILITY DISTRIBUTIONS 


y y 
fy) = 
0.0098 0.9702 0.98 0.98 
0.98 le e le e 
0.0002 0.0198 0.02 0.02 
0.02 O08 e Oe e 
0 1 x 0 1 x 
fx@= 0.01 0.99 
(a) (b) 


By analogy with independent events, we define two random variables to be independent 
whenever fxy(x, y) = fx(x) fy(y) for all x and y. Notice that independence implies that 
Svs, y) = f(x) fy(y) for all x and y. If we find one pair of x and y in which the equality fails, 
X and Y are not independent. If two random variables are independent, then for fy (x) > 0, 


- fxy(, y) = Sx(x) fr(y) 
fx() fx() 


With similar calculations, the following equivalent statements can be shown. 


fyx(y) = fry) 


For random variables X and Y, if any one of the following properties is true, the 
others are also true, and X and Y are independent. 


1) fxy(x, y) = f(x) fy(y) for all x and y 
(2) frix(y) =fr(y) for all x and y with f(x) > 0 
(3) fxly(x) = f(x) for all x and y with fy(y) > 0 


(4) P(X €A,Y € B) = P(X € A)P(Y € B) for any sets A and B in the range 
of X and Y, respectively. (5-7) 


Rectangular Range for (X, Y) 
If the set of points in two-dimensional space that receive positive probability under 
fxy(x, y) does not form a rectangle, X and Y are not independent because knowledge of X 
can restrict the range of values of Y that receive positive probability. If the set of points in 
two-dimensional space that receives positive probability under fyy(x, y) forms a rectangle, 
independence is possible but not demonstrated. One of the conditions in Equation 5-6 must 
still be verified. 

The variables in Example 5-2 are not independent. This can be quickly determined be- 
cause the range of (X, Y) shown in Fig. 5-4 is not rectangular. Consequently, knowledge of X 
changes the interval of values for Y with non-zero probability. 


EXAMPLE 5-11 Independent Random Variables 

Suppose that Example 5-2 is modified so that the joint prob- i 

ability density function of X and Y is fxy (x, y) = 2 X fx (x) = | 2 XK 10 Se 0.001 = 0.002» Gry 
10 %exp(—0.001x—0.002y) for x = 0 and y = 0. Show that X A 


and Y are independent and determine P(X > 1000, Y < 1000). 
The marginal probability density function of X is 


= 0.001e 0! forx >0 


The marginal probability density function of Y is 
fr y) = | 2 & 1076 e7 0.001 — 0.002y Gy. 
0 


= 0.002e°°” for y > 0 


Therefore, fxy(x, y) = fx) fy(y) for all x and y, and X and Y 
are independent. 
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To determine the probability requested, property (4) of 
Equation 5-7 and the fact that each random variable has an 
exponential distribution can be applied. 


P(X > 1000, Y < 1000) = P(X > 1000)P(Y < 1000) 
=e !(1 —e 7) = 0.318 


Often, based on knowledge of the system under study, random variables are assumed to 
be independent. Then, probabilities involving both variables can be determined from the mar- 
ginal probability distributions. For example, the time to complete a computer search should be 


independent of an adult’s height. 


EXAMPLE 5-12 Machined Dimensions 

Let the random variables X and Y denote the lengths of two di- 
mensions of a machined part, respectively. Assume that X and 
Y are independent random variables, and further assume that 
the distribution of X is normal with mean 10.5 millimeters and 
variance 0.0025 (millimeter)? and that the distribution of 
Y is normal with mean 3.2 millimeters and variance 0.0036 
(millimeter)”. Determine the probability that 10.4 < X < 
10.6 and 3.15 < Y < 3.25. 

Because X and Y are independent, 


P(10.4 < X < 10.6, 3.15 < Y < 3.25) 


5-1.5 More Than Two Random Variables 


= P(10.4 < X < 10.6)P(3.15 < Y < 3.25) 


10.4 — 10.5 10.6 — 10.5 
=p <Z< 
0.05 0.05 
3.15 — 3.2 3.25 — 3.2 
x P 
0.06 0.06 


= P(-2 < Z < 2)P(—0.833 < Z < 0.833) = 0.568 


where Z denotes a standard normal random variable. 

Practical Interpretation: If random variables are inde- 
pendent probabilities for multiple variables are often much 
easier to compute. 


More than two random variables can be defined in a random experiment. 


EXAMPLE 5-13 Machined Dimensions 
Many dimensions of a machined part are routinely measured 
during production. Let the random variables, X,, X>, X;, and X, 


The joint probability distribution of random variables X,, X>, X3, .. 
by providing a method of calculating the probability that X1, X>, X3, .. 


denote the lengths of four dimensions of a part. Then, at least 
four random variables are of interest in this study. 


., X, can be specified 
., Xp assume a value in 


a region R of p-dimensional space. A joint probability density function fy, x,... x, (Rig May ang he) 
is used to determine the probability that (X), X», X3, ..., X,) assume a value in a region R by 


the multiple integral of fy, y,.. x,(X15 Xa oe 


, Xp) over the region R. 


Joint 
Probability 


Deusi 

eneny es 
Function 
P 


co 00 


—c1o— 20 


A joint probability density function for the continuous random variables X,, X>, 
-, Xp, denoted as fx, x,...x,(%1, X25 ++ 5%, 


(0 fre ia Xp) = 0 
(2) | | ae | Fick. xe 0) dx dx... dXp =1 


(3) For any region B of p-dimensional space, 


P(X), Xo,-.., X,) & B] = [|| Hee | Bernernee oe) dx, dx... AXp 


p)> Satisfies the following properties: 


B 


(5-8) 
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Typically, fy, X>...X, ity Mapu 
Fx, X>...X, ("1 XQ, +++ 


EXAMPLE 5-14 Component Lifetimes 

In an electronic assembly, let the random variables 
X1, X, X3, X4 denote the lifetimes of four components in 
hours. Suppose that the joint probability density function of 
these variables is 


FreixyX4x4(* 1s X25 X3, X4) 
= 9 X 1Q7!2¢70.001x) = 0.002x) — 0.0015x3 — 0.0031, 


for x; = 0, x» = 0, x3 = 0, x4 = 0 


What is the probability that the device operates for more than 
1000 hours without any failures? The requested probability is 


, X,) is defined over all of p-dimensional space by assuming that 
, Xp) = 0 for all points for which fy, x,...x, (1, Xa +++ 


, Xp) is not specified. 


P(X, > 1000, X, > 1000, X; > 1000, X, > 1000), which 
equals the multiple integral of fy, x, x, x,(41, X25 %3, X4) over the 
region x, > 1000, x, > 1000, x; > 1000, x4 > 1000. The joint 
probability density function can be written as a product of 
exponential functions, and each integral is the simple integral 
of an exponential function. Therefore, 


P(X, > 1000, X, > 1000, X; > 1000, X, > 1000) 
=e !2-!5-3 = 0.00055 


Suppose that the joint probability density function of several continuous random 
variables is a constant, say, c over a region R (and zero elsewhere). In this special case, 


ioe} ive} ine} 
| | = | Fu, Xp... X(%1> X25 cary 
—0oo —o0 —00 


Xp) dx, dxz...dx, = c X (volume of region R) = 1 


by property (2) of Equation 5-8. Therefore, (R). Furthermore, by property (3) of Equation 5-8, 


P[(X,, Xo, +. X,) € B 


= [| re [tana (ein 93) dx, dx)... dx, = c X volume (BM R) 


B 


volume (BM R) 


volume (R) 


When the joint probability density function is constant, the probability that the random vari- 
ables assume a value in the region B is just the ratio of the volume of the region BM R to the 
volume of the region R for which the probability is positive. 


EXAMPLE 5-15 Probability as a Ratio of Volumes 
Suppose the joint probability density function of the con- 
tinuous random variables X and Y is constant over the re- 
gion x°+y?=4. Determine the probability that 


The region that receives positive probability is a circle of 
radius 2. Therefore, the area of this region is 47. The area of 
the region x7 + y’ <1 is m. Consequently, the requested 


e+Y<1, probability is 7/(4m) = 1/4. 
Marginal 
Probability If the joint probability density function of continuous random variables X,, X>,... , X, 
Reva is fx,x,... x,(X15 X2, .-., Xp), the marginal probability density function of X; is 


Teal [| oo [ea celeees cea hp) Oe ee 20 iat, 


where the integral is over all points in the range of Xj, X9, .. 


(S-9) 


., X, for which X; = x;. 
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As for two random variables, a probability involving only one random variable, say, for 
example, P(a < X; < b), can be determined from the marginal probability distribution of X; 
or from the joint probability distribution of Xj, X, ... , X,. That is, 


Pia < X; <b) = P(-® < X, < %,..., -© << X;_, << % a <X, <b, 
—% < Xj4,<%,..., -0 < X, < &) 


Furthermore, E(X;) and V(X;), for i = 1,2,...,p, can be determined from the marginal 
probability distribution of X; or from the joint probability distribution of X,, X2,...,X, as 


follows. 
Mean and 
Variance from eee oe 
Joint Distribution 
E(X;) = | | Ben eee: XIy 02+ 5 =) dx, dx» nes dXxp 
and (5-10) 
V(X;) = | | eee | (x; = bx) fx, x)... x,(%1 XQ, +++ 5 Xp) dx, dx aoe dx, 
EXAMPLE 5-16 
Points that have positive probability in the joint probability P(X = 1) = fx,x,x,(2, 1,0) + fx,x,x,(0, 1, 2) 


distribution of three random variables X,, X,, X3 are shown deg (1, 1,1) 
in Fig. 5-11. The range is the nonnegative integers with x, + XH” 
X, + x; = 3. The marginal probability distribution of X, is P(X_ = 2) = fx,x,x,(1, 2, 0) + fx,x,x,(0, 2, 1) 
found as follows. P(X, = 3) = fx,x,x,(0, 3, 0) 


P(X, = 0) = fx,x,x,(3, 0,0) + fx,x,x,(0, 0, 3) 
+ fx,xx,C1, 0,2) + fx,x,x,(2; 0, 1) 


With several random variables, we might be interested in the probability distribution of some 
subset of the collection of variables. The probability distribution of X,, X5,..., X;, k < pcan 
be obtained from the joint probability distribution of X, Xz, ..., X, as follows. 


Distribution 
ofa Subset of If the joint probability density function of continuous random variables X;, X>,..., X, 
Random Variables is fx, x,...x, (1 X2 ---» Xp), the probability density function of X,, X,,...,X;, k <p, is 
Fx, x,...X, By cop) 
= ie [ical Te Oe ak (=I) 
where the integral is over all points in the range of Xj, X>,...,X, for which 


X = x1, X4 = Wry ooo 9 MG = Xk 


166 CHAPTER 5 JOINT PROBABILITY DISTRIBUTIONS 


*3 
36 4 x9 
2 e it 36 
2 

Figure 5-11 Joint 1 1 ras 
probability distribution 0 oi 
of X,, X>, and X3. 0 1 2 3 x1 

Conditional Probability Distribution 

Conditional probability distributions can be developed for multiple random variables by an 

extension of the ideas used for two random variables. 

f ( ) Px, Xy XX) x<(15 Xp, X35 X4, X5) 
XX) Xq|xyxs\F1> 42,13) = 
pee Fayxs(Xa, Xs) 
for Ik, x; (x4, Xs) > 0. 
The concept of independence can be extended to multiple random variables. 
Independence 
Random variables X), X>,... , X, are independent if and only if 
Fx, X... X,(%1 X2 stesn 5%p) = fx fx, 0%) - Sx,(%) for all X1, XQ, +++ Xp (5-12) 

Similar to the result for only two random variables, independence implies that Equation 5-12 

holds for all x), x2, ... , Xp. If we find one point for which the equality fails, X), X2,..., X, are 

not independent. It is left as an exercise to show that if X), X5,..., X, are independent, 

P(X, E Ay, X> E Ad, oles Xp E A,) — P(X, E A,)P(Xy E A>) ses P(X, Ee Ap) 

for any regions Aj, Ap, ...,A, in the range of Xj, X2,..., X,, respectively. 

EXAMPLE 5-17 


In Chapter 3, we showed that a negative binomial random vari- 
able with parameters p and r can be represented as a sum of r 
geometric random variables X,, X>,...,X,. Each geometric 
random variable represents the additional trials required to 


EXAMPLE 5-18 Layer Thickness 

Suppose X,, X5, and X3 represent the thickness in micrometers 
of a substrate, an active layer, and a coating layer of a chemical 
product. Assume that X,, X>, and X3 are independent and nor- 
mally distributed with p,; = 10000, np» = 1000, p3 = 80, 
0; = 250, o2 = 20, and o3 = 4, respectively. The specifica- 
tions for the thickness of the substrate, active layer, and coat- 
ing layer are 9200 < x, < 10,800, 950 < x. < 1050, and 
75 < x3 < 85, respectively. What proportion of chemical prod- 
ucts meets all thickness specifications? Which one of the three 
thicknesses has the least probability of meeting specifications? 


obtain the next success. Because the trials in a binomial 
experiment are independent, X, X>,..., X, are independent 
random variables. 


The requested probability is P(9200 < X; < 10,800, 
950 < X, < 1050, 75 < X3 < 85. Because the random vari- 
ables are independent, 
P(9200 < X, < 10,800, 950 < X, < 1050, 75 < X3 < 85) 
= P(9200 < X; < 10,800)P(950 < X, 
< 1050) P(75 < X3 < 85) 


After standardizing, the above equals 


P(-3.2 < Z < 3.2)P(—2.5 < Z < 2.5)P(—1.25 < Z < 1.25) 


where Z is a standard normal random variable. From the table 
of the standard normal distribution, the above equals 


(0.99862)(0.98758)(0.78870) = 0.7778 


EXERCISES FOR SECTION 5-1 
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The thickness of the coating layer has the least probability of 
meeting specifications. Consequently, a priority should be to 
reduce variability in this part of the process. 


5-1. Show that the following function satisfies the proper- 
ties of a joint probability mass function. 


x y fay y) 
1 1 1/4 
1.5 2 1/8 
1.5 3 1/4 
2.5 4 1/4 
3} 5 1/8 


Determine the following: 

(a) P(X < 2.5, Y < 3) 

(b) P(X < 2.5) 

(c) P(Y < 3) 

(d) P(X > 1.8, Y > 4.7) 

(e) E(X), E(Y), V(X), and V(Y). 

(f) Marginal probability distribution of the random vari- 
able X 

(g) Conditional probability distribution of Y given that X = 1.5 

(h) Conditional probability distribution of X given that Y = 2 

(i) E(Y|X = 1.5) 

(j) Are X and Y independent? 

5-2. Determine the value of c that makes the function 

f(x, y) = c(x + y) a joint probability mass function over the 

nine points with x = 1, 2,3 and y = 1, 2, 3. 

Determine the following: 

(a) P(X =1,Y<4) (b) P(X = 1) 

(c) P(Y = 2) (d) P(X <2,¥ <2) 

(e) E(X), E(Y), V(X), and V(Y) 

(f) Marginal probability distribution of the random variable X 

(g) Conditional probability distribution of Y given that X = 1 

(h) Conditional probability distribution of X given that Y = 2 

(i) E(Y|X = 1) 

(j) Are X and Y independent? 

5-3. Show that the following function satisfies the proper- 

ties of a joint probability mass function. 


x ay, fry y) 
=i =?) 1/8 
—0.5 —1 1/4 
0.5 1 1/2 
1 2 1/8 


Determine the following: 

(a) P(X < 0.5, Y < 1.5) 

(b) P(X < 0.5) 

(c) P(Y < 1.5) 

(d) P(X > 0.25, Y < 4.5) 

(e) E(X), E(Y), V(X), and V(Y) 

(f) Marginal probability distribution of the random vari- 
able X 

(g) Conditional probability distribution of Y given that 
xX=1 

(h) Conditional probability distribution of X given that 
Y=1 

(i) E(Xly = 1) 

(j) Are X and Y independent? 

5-4. Four electronic printers are selected from a large lot 

of damaged printers. Each printer is inspected and classified 

as containing either a major or a minor defect. Let the random 

variables X and Y denote the number of printers with major 

and minor defects, respectively. Determine the range of the 

joint probability distribution of X and Y. 


5-5. Inthe transmission of digital information, the probability 
that a bit has high, moderate, and low distortion is 0.01, 0.04, and 
0.95, respectively. Suppose that three bits are transmitted and 
that the amount of distortion of each bit is assumed to be 
independent. Let X and Y denote the number of bits with high 
and moderate distortion out of the three, respectively. Determine: 


(a) fxr y) (b) fx(x) 
(c) E(X) ) fry) 
(e) E(Y|X =1) (f) Are X and Y independent? 


5-6. A small-business Web site contains 100 pages and 
60%, 30%, and 10% of the pages contain low, moderate, and 
high graphic content, respectively. A sample of four pages is 
selected without replacement, and X and Y denote the number 
of pages with moderate and high graphics output in the 
sample. Determine: 


(a) fxy(x, y) (b) f(x) 
(c) E(X) (d) fs (y) 
(e) E(Y|X = 3) (f) W(Y|X = 3) 


(g) Are X and Y independent? 


5-7. A manufacturing company employs two devices to 
inspect output for quality control purposes. The first device 
is able to accurately detect 99.3% of the defective items it 
receives, whereas the second is able to do so in 99.7% of the 
cases. Assume that four defective items are produced and 
sent out for inspection. Let X and Y denote the number of 
items that will be identified as defective by inspecting 
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devices | and 2, respectively. Assume the devices are inde- 
pendent. Determine: 


(a) fer y) (b) fx(x) 

(c) E(X) d) fri2(y) 

(e) E(Y|X = 2) (f) V(Y|X = 2) 

(g) Are X and Y independent? 

5-8. Suppose the random variables X, Y, and Z have the 


following joint probability distribution. 


NNN NY FPR FIR 
YN wee KS NS — ie 
NF NY RP NY KF WY KH TA 

So 

N 

Oo 


Determine the following: 


(a) P(X = 2) (b) P(X = 1, Y = 2) 
(c) P(Z < 155) (d) P(X=1 or Z=2) 
(e) E(X) (f) P(X =1|¥= 1) 


(g) P(X =1,¥Y=1|Z=2) (hy) P(X =1|Y=1,Z=2) 
(i) Conditional probability distribution of X given that Y = 1 
and Z = 2 

5-9. An engineering statistics class has 40 students and 
60% are electrical engineering majors, 10% are industrial 
engineering majors, and 30% are mechanical engineering 
majors. A sample of four students is selected randomly, with- 
out replacement, for a project team. Let X and Y denote the 
number of industrial engineering and mechanical engineering 
majors, respectively. Determine the following: 


(a) fxy(% y) (b) f(x) 
(c) E(X) ) fyis () 
(e) E(Y|X = 3) (f) V(YIX = 3) 


(g) Are X and Y independent? 


Average Frequencies and Operations in TPC-C 


5-10. An article in the Journal of Database Manage- 

ment [“Experimental Study of a Self-Tuning Algorithm for 

DBMS Buffer Pools” (2005, Vol. 16, pp. 1-20)] provided 

the workload used in the TPC-C OLTP (Transaction 

Processing Performance Council’s Version C On-Line 

Transaction Processing) benchmark, which simulates a typ- 

ical order entry application. See the table below. The fre- 

quency of each type of transaction (in the second column) 

can be used as the percentage of each type of transaction. 

Let X and Y denote the average number of selects and 

updates operations, respectively, required for each type 

transaction. Determine the following: 

(a) P(X <5) 

(b) E(X) 

(c) Conditional probability mass function of X given Y = 0 

(d) P(X < 6|Y = 0) 

(e) E(X|¥ = 0) 

5-11. For the Transaction Processing Performance 

Council’s benchmark in Exercise 5-10, let X, Y, and Z denote 

the average number of selects, updates, and inserts opera- 

tions required for each type of transaction, respectively. 

Calculate the following: 

(a) fxyz(x, y,z) 

(b) Conditional probability mass function for X and Y given 
Z=0 

(c) P(X <6, ¥ < 6|Z=0) 

(d) E(X|Y = 0, Z = 0) 

5-12. In the transmission of digital information, the 

probability that a bit has high, moderate, or low distortion is 

0.01, 0.04, and 0.95, respectively. Suppose that three bits are 

transmitted and that the amount of distortion of each bit is as- 

sumed to be independent. Let X and Y denote the number of 

bits with high and moderate distortion out of the three trans- 

mitted, respectively. Determine the following: 

(a) The probability that two bits have high distortion and one 
has moderate distortion 

(b) The probability that all three bits have low distortion 

(c) The probability distribution, mean, and variance of X 

(d) The conditional probability distribution, conditional mean, 
and conditional variance of X given that Y = 2 


Non-Unique 
Transaction Frequency Selects Updates Inserts Deletes Selects Joins 
New Order 43 M3) 11 12 0 0 0 
Payment 44 4.2 3 1 0 0.6 0 
Order Status 4 11.4 0 0 0 0.6 0 
Delivery 5 130 120 0 10 0 0 
Stock Level 4 0 0 0 0 0 1 


5-13. Determine the value of c such that the function 
t(, y) = cxy for 0 <x <3 and 0 < y < 3 satisfies the 
properties of a joint probability density function. 
Determine the following: 

(a) P(X <2,¥<3) (b) P(X <2.5) 

(c) Pl < Y < 2.5) (d) P(X > 1.8,1<¥Y< 2.5) 

(e) E(X) (f) P(X <0, Y < 4) 

(g) Marginal probability distribution of the random variable X 
(h) Conditional probability distribution of Y given that X = 1.5 
(i) E(Y|X) = 1.5) (j) P(Y < 2|X = 1.5) 

(k) Conditional probability distribution of X given that Y 5 2 


5-14. Determine the value of c that makes the function 
F(x, y) = cx + y) a joint probability density function over the 
rangeOQ <x<3andx<y<x+2. 

Determine the following: 

(a) P(X<1,¥Y<2) (b) P(1<X <2) 

(c) P(Y > 1) (d) P(X < 2, ¥Y < 2) 

(e) E(X) (f) V(X) 

(g) Marginal probability distribution of X 

(h) Conditional probability distribution of Y given that X = 1 
(i) E(Y|X = 1) 

(j) P(Y > 21X = 1) 

(k) Conditional probability distribution of X given that Y 5 2 


5-15. Determine the value of c that makes the function 
f(x, ») = cxy a joint probability density function over the range 
O<x<3and0<y<x. 

Determine the following: 

(a) P(X <1,¥<2) (b) P(1<X<2) 

(c) P(Y > 1) (d) P(X < 2, Y < 2) 

(e) E(X) (f) E(Y) 

(g) Marginal probability distribution of X 

(h) Conditional probability distribution of Y given X = 1 

(i) E(Y|X = 1) 

(j) P(Y > 2|X = 1) 

(k) Conditional probability distribution of X given Y 5 2 


5-16. Determine the value of c that makes the function 
f(x, y) = ce~**~* a joint probability density function over the 
range0 <xandO<y<~x. 

Determine the following: 

(a) (X<1,Y<2) (b) PL<X<2) 

(c) P(Y > 3) (d) P(X < 2, Y < 2) 

(e) E(X) (f) E(Y) 

(g) Marginal probability distribution of X 

(h) Conditional probability distribution of Y given X = 1 

(i) E(Y|X = 1) 

(j) Conditional probability distribution of X given Y = 2 
5-17. Determine the value of c that makes the function 
f(x,y) = ce~**~»” a joint probability density function over 
the range 0 < x and x < y. 

Determine the following: 

(a) (X<1,Y<2) (b) Pl <X <2) 
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(c) P(Y > 3) (d) P(X < 2, Y < 2) 

(e) E(X) (f) E(Y) 

(g) Marginal probability distribution of X 

(h) Conditional probability distribution of Y given X = 1 

(i) E(YLX = 1) 

(j) P(Y < 2|X =1) 

(k) Conditional probability distribution of X given Y = 2 
5-18. The conditional probability distribution of Y given 
X =x is fy,(y)=xe ” for y > 0, and the marginal 
probability distribution of X is a continuous uniform distribu- 
tion over 0 to 10. 

(a) Graph fyy(y) = xe~*” for y > 0 for several values of x. 


Determine: 
(b) P(¥<2|X=2) (c) E(Y|X = 2) 
(d) E(Y|X = x) (e) fxr(x, y) 
f) fr(y) 
5-19. Two methods of measuring surface smoothness are 


used to evaluate a paper product. The measurements are 
recorded as deviations from the nominal surface smoothness 
in coded units. The joint probability distribution of the 
two measurements is a uniform distribution over the re- 
gion 0 <x<4,0<y, andx-1<y<x+1. That is, 
xy, y) = c for x and y in the region. Determine the value for 
c such that fxy (x, y) is a joint probability density function. 


Determine the following: 

(a) P(X <0.5,Y<0.5)  (b) P(X < 0.5) 

(c) E(X) (d) E(Y) 

(e) Marginal probability distribution of X 

(f) Conditional probability distribution of Y given X = 1 
(g) E(Y|X = 1) (h) P(Y < 0.5|X = 1) 


5-20. The time between surface finish problems in a galva- 
nizing process is exponentially distributed with a mean of 
40 hours. A single plant operates three galvanizing lines that 
are assumed to operate independently. 

(a) What is the probability that none of the lines experiences 
a surface finish problem in 40 hours of operation? 

(b) What is the probability that all three lines experience a 
surface finish problem between 20 and 40 hours of operation? 

(c) Why is the joint probability density function not needed to 
answer the previous questions? 

5-21. A popular clothing manufacturer receives Internet 

orders via two different routing systems. The time between 

orders for each routing system in a typical day is known to be 
exponentially distributed with a mean of 3.2 minutes. Both 
systems operate independently. 

(a) What is the probability that no orders will be received in a 
5-minute period? In a 10-minute period? 

(b) What is the probability that both systems receive two 
orders between 10 and 15 minutes after the site is 
officially open for business? 

(c) Why is the joint probability distribution not needed to 
answer the previous questions? 
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5-22. The blade and the bearings are important parts of a 

lathe. The lathe can operate only when both of them work 

properly. The lifetime of the blade is exponentially distributed 

with the mean three years; the lifetime of the bearings is also 

exponentially distributed with the mean four years. Assume 

each lifetime is independent. 

(a) What is the probability that the lathe will operate at least 
five years? 

(b) The lifetime of the lathe exceeds what time with 95% 
probability? 


5-23. Suppose the random variables X, Y, and Z have the joint 
probability density function f(x, y, z) = 8xyz for0 <x < 1, 
0<y<1,and0<z< 1. Determine the following: 


(a) P(X <0.5) (b) P(X < 0.5, ¥< 0.5) 
(c) P(Z <2) (d) P(X < 0.5 or Z < 2) 
(e) E(X) (f) P(X < 0.5|¥ = 0.5) 


(g) P(X < 0.5, ¥Y < 0.5|Z = 0.8) 

(h) Conditional probability distribution of X given that Y = 
0.5 and Z = 0.8 

(i) P(X <0.5|Y = 0.5, Z = 0.8) 


5-24. Suppose the random variables X, Y, and Z have 
the joint probability density function fyy7(x, y, z) = c over 
the cylinder x* + y? <4 and 0 <z<4. Determine the 
constant c so that fyyz(x, y, z) is a probability density function. 


Determine the following: 

(a) P(X? + Y? <2) 

(c) E(X) 

(e) P(X? + Y? <1|Z=1) 

(f) Conditional probability distribution of Z given that 
X=1landY=1. 


(b) P(Z < 2) 
(d) P(X <1|Y=1) 


5-25. Determine the value of c that makes fyy7(x, y, Z) = c 
a joint probability density function over the region x > 0, 
y>0,z>O0,andx+y+z<1. 

Determine the following: 

(a) P(X < 0.5, Y < 0.5, Z < 0.5) 

(b) P(X < 0.5, Y < 0.5) 

(c) P(X < 0.5) 


5-2 COVARIANCE AND CORRELATION 


(d) E(X) 

(e) Marginal distribution of X 

(f) Joint distribution of X and Y 

(g) Conditional probability distribution of X given that Y = 
0.5 and Z = 0.5 

(h) Conditional probability distribution of X given that 
Y=0.5 


5-26. The yield in pounds from a day’s production is 

normally distributed with a mean of 1500 pounds and standard 

deviation of 100 pounds. Assume that the yields on different 

days are independent random variables. 

(a) What is the probability that the production yield exceeds 
1400 pounds on each of five days next week? 

(b) What is the probability that the production yield exceeds 
1400 pounds on at least four of the five days next week? 


5-27. The weights of adobe bricks used for construction are 

normally distributed with a mean of 3 pounds and a standard 

deviation of 0.25 pound. Assume that the weights of the bricks are 

independent and that a random sample of 20 bricks is selected. 

(a) What is the probability that all the bricks in the sample 
exceed 2.75 pounds? 

(b) What is the probability that the heaviest brick in the 
sample exceeds 3.75 pounds? 


5-28. A manufacturer of electroluminescent lamps knows 

that the amount of luminescent ink deposited on one of 

its products is normally distributed with a mean of 1.2 grams 

and a standard deviation of 0.03 gram. Any lamp with less than 

1.14 grams of luminescent ink will fail to meet customers’ 

specifications. A random sample of 25 lamps is collected and 

the mass of luminescent ink on each is measured. 

(a) What is the probability that at least one lamp fails to meet 
specifications? 

(b) What is the probability that five lamps or fewer fail to 
meet specifications? 

(c) What is the probability that all lamps conform to 
specifications? 

(d) Why is the joint probability distribution of the 25 lamps 
not needed to answer the previous questions? 


When two or more random variables are defined on a probability space, it is useful to 
describe how they vary together; that is, it is useful to measure the relationship between the 
variables. A common measure of the relationship between two random variables is the 
covariance. To define the covariance, we need to describe the expected value of a function of 
two random variables h(X, Y). The definition simply extends the one for a function of a single 


random variable. 


Expected Value of 
a Function of Two 
Random Variables 
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X, Y discrete 


> » h(x, y) fxy(x, y) 
E[h(X, Y)] = (5-13) 


|| h(x, y) fxy(x, y)dxdy X, Y continuous 


That is, E[h(X, Y)] can be thought of as the weighted average of h(x, y) for each point in the 
range of (X, Y). The value of E[h(X, Y)] represents the average value of h(X, Y) that is expected 
in a long sequence of repeated trials of the random experiment. 


EXAMPLE 5-19 Expected Value of a Function of Two Random Variables 


For the joint probability distribution of the two random 


Therefore, 


variables in Fig. 5-12, calculate E[(X — jrx)(Y — py)]. 


The result is obtained by multiplying x — py times y — py, E[(X — px)(¥ — py)] = (1 — 2.4) — 2.0) x 0.1 
times fyy(x, y) for each point in the range of (X, Y). First, wy and + (1 — 2.4)(2 — 2.0) x 0.2 
Jy are determined from the marginal distributions for X and Y: + (3 — 2.4)(1 — 2.0) x 0.2 
py = 1X034+3X0.7 = 24 and + (3 — 2.4)(2 — 2.0) x 0.2 
py =1X03 42 + (3 — 2.4)(3 — 2.0) X 0.3 = 0.2 


x 0.4 + 3 X 0.3 = 2.0 


Covariance 


The covariance is defined for both continuous and discrete random variables by the same 
formula. 


The covariance between the random variables X and Y, denoted as cov(X, Y) or Oxy, is 


Oxy = E|(X — px)(¥ — py)] 


E(XY) — pxby G14) 


If the points in the joint probability distribution of X and Y that receive positive 
probability tend to fall along a line of positive (or negative) slope, oxy is positive (or negative). 
If the points tend to fall along a line of positive slope, X tends to be greater than wy when Y is 
greater than ry. Therefore, the product of the two terms x — jx and y — py tends to be positive. 
However, if the points tend to fall along a line of negative slope, x — jx tends to be positive 
when y — py is negative, and vice versa. Therefore, the product of x — wx and y — py tends 
to be negative. In this sense, the covariance between X and Y describes the variation between 
the two random variables. Figure 5-13 shows examples of pairs of random variables with 
positive, negative, and zero covariance. 

Covariance is a measure of linear relationship between the random variables. If the re- 
lationship between the random variables is nonlinear, the covariance might not be sensitive to 
the relationship. This is illustrated in Fig. 5-13(d). The only points with nonzero probability 
are the points on the circle. There is an identifiable relationship between the variables. Still, 
the covariance is zero. 
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Figure 5-12 Joint distribution Figure 5-13 Joint probability distributions and the sign of covariance between X and Y. 
of X and Y for Example 5-19. 


The equality of the two expressions for covariance in Equation 5-14 is shown for 
continuous random variables as follows. By writing the expectations as integrals, 


E|(Y — py)(X — px)] = | ji (x — Bx)( by) fay(x, y) dxdy 
= | [i — Bxy — Xpy + pyby] fay y) dxdy 
Now 
| | Lxy fxy(x, y) dxdy = wy | ) Yfxy(%, y) dxdy | = pxpy 
Therefore, 


E|(X — wyx)(¥ — py)] = xyfyy(x, y) dxdy — wypby — yby + bxyby 


xyfxy(x, y) dxdy — pxpy = E(XY) — pxpy 


EXAMPLE 5-20 

In Example 5-1, the random variables X and Y are the number 
of signal bars and the number of times you need to state your 
departure city, respectively. Is the covariance between X and Y 
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As the signal bars increase, the number of times you 
need to state your name decreases. Therefore, X and Y have a 
negative covariance. This can be verified from the joint prob- 


positive or negative? 


Correlation 


ability distribution in Fig. 5-1. 


There is another measure of the relationship between two random variables that is often 
easier to interpret than the covariance. 


The correlation between random variables X and Y, denoted as pyy, is 


coviAG Y) 
V(X) V(Y) 


Oxy 
OxOy 


Pxy — (5-15) 


Because oy > 0 and oy > 0, if the covariance between X and Y is positive, negative, or zero, 
the correlation between X and Y is positive, negative, or zero, respectively. The following 
result can be shown. 


For any two random variables X and Y, 


=| == (hey = orl (5-16) 


The correlation just scales the covariance by the product of the standard deviation of each vari- 
able. Consequently, the correlation is a dimensionless quantity that can be used to compare the 
linear relationships between pairs of variables in different units. 

If the points in the joint probability distribution of X and Y that receive positive 
probability tend to fall along a line of positive (or negative) slope, pyy is near +1 (or —1). If 
Pxy equals +1 or —1, it can be shown that the points in the joint probability distribution that 
receive positive probability fall exactly along a straight line. Two random variables with 
nonzero correlation are said to be correlated. Similar to covariance, the correlation is a meas- 
ure of the linear relationship between random variables. 


EXAMPLE 5-21 Covariance 
For the discrete random variables X and Y with the joint 
distribution shown in Fig. 5-14, determine oyy and pyy. 

The calculations for E(XY), E(X), and V(X) are as 
follows. 


E(XY) =0X0X02+1X1X01+1X2X01 
+2X1X014+2X2X01+3X3X04 =45 
E(X) =0X0.2+1X02+2xX02+3X04=18 
V(X) = (0 — 1.8) x 0.2 + (1 — 1.87 x 0.2 
+ (2 — 1.8)? x 0.2 + (3 — 1.8) x 0.4 = 1.36 


Because the marginal probability distribution of Y is the same 
as for X, E(Y) = 1.8 and V(Y) = 1.36. Consequently, 


Oxy = E(XY) — E(X)E(Y) = 4.5 — (1.8)(1.8) = 1.26 


Furthermore, 
Oxy 1.26 
= = 0.926 
Pxy ~ oxoy  (/1.36)(V1.36) 
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Figure 5-14 Joint distribution 
for Example 5-20. 


EXAMPLE 5-22 Correlation 

Suppose that the random variable X has the following distribu- 
tion: P(X = 1) = 0.2, P(X = 2) = 0.6, P(X = 3) = 0.2. Let 
Y = 2X + 5. Thatis, P(Y = 7) = 0.2, P(Y = 9) = 0.6, P(Y = 11) 
= 0.2. Determine the correlation between X and Y. Refer to 
Fig. 5-15. 
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Figure 5-15 Joint distribution 
for Example 5-21. 


Because X and Y are linearly related, p = 1. This can be 
verified by direct calculations: Try it. 


For independent random variables, we do not expect any relationship in their joint prob- 
ability distribution. The following result is left as an exercise. 


If X and Y are independent random variables, 


Oxy = Pxy = 0 


(5-17) 


EXAMPLE 5-23 Independence Implies Zero Covariance 


For the two random variables in Fig. 5-16, show that oxy = 0. 

The two random variables in this example are continuous 
random variables. In this case E(XY) is defined as the double 
integral over the range of (X, Y). That is, 


42 2 
1 

= | | vires neray = 5 | | 

0 0 
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Thus, 


E(XY) — E(X)E(Y) = 32/9 — (4/3)(8/3) = 0 
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It can be shown that these two random variables are in- 
dependent. You can check that fyy(x, y) = fx) f;(9) for all 
xand y. 


However, if the correlation between two random variables is zero, we cannot immediately con- 
clude that the random variables are independent. Figure 5-13(d) provides an example. 


EXERCISES FOR SECTION 5-2 


5-29. Determine the covariance and correlation for the 
following joint probability distribution: 


x 1 1 2 4 
y 3 4 5 6 
Fay, y) 1/8 1/4 1/2 1/8 


5-30. Determine the covariance and correlation for the 
following joint probability distribution: 


x -1 -05 05 1 
y =} -1 1 2 
Sv y) 1/8 1/4 1/2 1/8 


5-31. Determine the value for c and the covariance and 
correlation for the joint probability mass function fyy(x, y) = 
c(x + y) forx = 1, 2,3 andy = 1, 2, 3. 

5-32. Determine the covariance and correlation for the joint 
probability distribution shown in Fig. 5-10(a) and described in 
Example 5-10. 

5-33. Determine the covariance and correlation for X; and 
X, in the joint distribution of the multinomial random vari- 
ables X,, X,, and X; with p, = p) = p; = 4 and n = 3. What 
can you conclude about the sign of the correlation between 
two random variables in a multinomial distribution? 


5-34. For the Transaction Processing Performance Council’s 
benchmark in Exercise 5-10, let X, Y, and Z denote the average 
number of selects, updates, and inserts operations required for 
each type transaction, respectively. Calculate the following: 
(a) Covariance between X and Y 


(b) Correlation between X and Y 
(c) Covariance between X and Z 
(d) Correlation between X and Z 


5-35. Determine the value for c and the covariance and cor- 
relation for the joint probability density function fyy(x, y) = 
cxy over the rangeO <x<3and0<y<x. 


5-36. Determine the value for c and the covariance and cor- 
relation for the joint probability density function fxy(x, y) = c 
over the range O <x <5,0<y, andx-1l<y<x+1. 


5-37. Determine the covariance and correlation for the joint 
probability density function fyy(x, y) = e * ” over the range 
O0<xand0<y. 

5-38. Determine the covariance and correlation for the joint 
probability density function fyy(x, y) = 6 X 107% 0.001 0.09 
over the range 0 < x and x < y from Example 5-2. 


5-39. The joint probability distribution is 


x —l1 0 0 1 
y 0 =1 1 0 
Sxyv@ y) 1/4 1/4 1/4 1/4 


Show that the correlation between X and Y is zero, but X and Y 
are not independent. 


5-40. Suppose X and Y are independent continuous random 
variables. Show that oyy = 0. 


5-41. Suppose that the correlation between X and Y is p. For 
constants a, b, c, and d, what is the correlation between the 
random variables U = aX + band V = cY + d? 
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5-3 COMMON JOINT DISTRIBUTIONS 


5-3.1 Multinomial Probability Distribution 


A joint probability distribution for multiple discrete random variables that is quite useful is an 
extension of the binomial. The random experiment that generates the probability distribution 
consists of a series of independent trials. However, the results from each trial can be catego- 


rized into one of k classes. 


EXAMPLE 5-24 Digital Channel 

We might be interested in a probability such as the following. 
Of the 20 bits received, what is the probability that 14 are ex- 
cellent, 3 are good, 2 are fair, and 1 is poor? Assume that the 
classifications of individual bits are independent events and 
that the probabilities of E, G, F, and P are 0.6, 0.3, 0.08, and 
0.02, respectively. One sequence of 20 bits that produces the 
specified numbers of bits in each class can be represented as 


EEEEEEEEEEEEEEGGGFFP 


Using independence, we find that the probability of this se- 
quence is 


P(EEEEEEEEEEEEEEGGGFFP) = 0.6'40.3°0.0870.02! 
= 2.708 x 107° 


Clearly, all sequences that consist of the same numbers of 
E’s, G’s, F’s, and P’s have the same probability. Consequently, 
the requested probability can be found by multiplying 2.708 x 
10° by the number of sequences with 14 E’s, three G’s, two 
F’s, and one P. The number of sequences is found from 
Chapter 2 to be 
20! 


——__ = 932 
i4isignt 


Therefore, the requested probability is 


P(14E’s, three G’s, two F’s, and one P) 
= 2325600(2.708 x 107°) = 0.0063 


Example 5-24 leads to the following generalization of a binomial experiment and a bino- 


mial distribution. 


Multinomial 
Distribution 


Suppose a random experiment consists of a series of n trials. Assume that 


(1) The result of each trial is classified into one of k classes. 


(2) The probability of a trial generating a result in class 1, class 2, ..., class k 
is constant over the trials and equal to pj, Do, ... 


(3) The trials are independent. 


The random variables X,, X>,..., X, that denote the number of trials that result in 
class 1, class 2,..., class k, respectively, have a multinomial distribution and the 


joint probability mass function is 
P(X, = x1, X> = XQ5)- 1 » Xz = Xp) = 


LOL x Xp es xy, — Nand py + ps > + py — I 


» Pw Tespectively. 


n! 


pepe 5-18 
igen Pi! Pr’. Pk (5-18) 


The multinomial distribution is considered a multivariable extension of the binomial distribution. 


EXAMPLE 5-25 Digital Channel 

In Example 5-9, let the random variables X,, X, X;, and X, de- 
note the number of bits that are E, G, F, and P, respectively, in 
a transmission of 20 bits. The probability that 12 of the bits 
received are E, 6 are G, 2 are F, and 0 are P is 


P(X, = 12, Xp = 6, Xz = 2, X, = 0) 
20! 


= ———__ 0.6!70.3°0.0870.02° = 0. 
a6 17101 0-8" 0-3°0.0870.02° = 0.0358 
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Each trial in a multinomial random experiment can be regarded as either generating or not 
generating a result in class i, for each i = 1, 2,...,k. Because the random variable X; is the 
number of trials that result in class i, X; has a binomial distribution. 


Mean and 


Variance feXG eXG 


tion of X; is binomial with 


. , X, have a multinomial distribution, the marginal probability distribu- 


E(X;) = np; 


and V(X;) = np;(1 — pi) (5-19) 


EXAMPLE 5-26 Marginal Probability Distributions 

In Example 5-25 the marginal probability distribution of X, is 
binomial with n = 20 and p = 0.3. Furthermore, the joint mar- 
ginal probability distribution of X, and X;3 is found as follows. 
The P(X, = x), X; = x3) is the probability that exactly x, trials re- 
sult in G and that x; result in F. The remaining n — x, — x; trials 
must result in either E or P. Consequently, we can consider each 
trial in the experiment to result in one of three classes, {G}, {F'}, 
or {E, P}, with probabilities 0.3, 0.08, and 0.6 + 0.02 = 0.62, 
respectively. With these new classes, we can consider the trials 


5-3.2 Bivariate Normal Distribution 


to comprise a new multinomial experiment. Therefore, 


Frix,(%2, X43) = P(Xp = X2, X3 = x3) 


! 
te (0.3)*(0.08)""(0.62)"27 3 
X!x3! (n = xy x3)! 


The joint probability distribution of other sets of variables can 
be found similarly. 


An extension of a normal distribution to two random variables is an important bivariate prob- 


ability distribution. 


EXAMPLE 5-27 Bivariate Normal Distribution 

At the start of this chapter, the length of different dimensions 
of an injection-molded part was presented as an example of 
two random variables. Each length might be modeled by a nor- 
mal distribution. However, because the measurements are 
from the same part, the random variables are typically not 
independent. A probability distribution for two normal ran- 


dom variables that are not independent is important in many 
applications. As stated at the start of the chapter, if the specifi- 
cations for X and Y are 2.95 to 3.05 and 7.60 to 7.80 millime- 
ters, respectively, we might be interested in the probability that 
a part satisfies both specifications; that is, P(2.95<xX< 
3.05, 7.60 < Y < 7.80). 


Bivariate Normal 
Probability 
Density Function 


The probability density function of a bivariate normal distribution is 


fry(%, Yi Ox, Ty, Wy, ys P) = 


Lore aloe 
QnayayV1 — p? 3 al p’) o% 
_ BE Ss ty) = zal (5-20) 


for —%2 <x < and —% < y < ©, with parameters oy > 0, oy > 0, -~ < py < ™, 
—o<py<%, and -l<p<l. 


2 


OxOy Oy 
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Figure 5-17 Examples of bivariate normal distributions. 


The result that fyy(% y; Ox, Oy, Wy, Wy, P) integrates to 1 is left as an exercise. Also, the bivari- 
ate normal probability density function is positive over the entire plane of real numbers. 

Two examples of bivariate normal distributions are illustrated in Fig. 5-17 along with 
corresponding contour plots. Each curve on the contour plots is a set of points for which the 
probability density function is constant. As seen in the contour plots, the bivariate normal 
probability density function is constant on ellipses in the (x, y) plane. (We can consider a circle 
to be a special case of an ellipse.) The center of each ellipse is at the point (ux, wy). If p > 0 
(p < 0), the major axis of each ellipse has positive (negative) slope, respectively. If p = 0, the 
major axis of the ellipse is aligned with either the x or y coordinate axis. 


EXAMPLE 5-28 

The joint probability density function fyy(x%, y) = This probability density function is illustrated in Fig. 5-18. 
Losey?) = : —— Notice that the contour plot consists of concentric circles 

ae e is a special case of a bivariate normal about the origin. 


distribution with oy = 1, oy = 1, py = 0, py = 0, and p = 0. 


The following results can be shown for a bivariate normal distribution. The details are left 
as an exercise. 


Marginal 
Distributions of If X and Y have a bivariate normal distribution with joint probability density fyy(x, y; 
Devarinee Hormel Ox, Oy, Wx, My, p), the marginal probability distributions of X and Y are normal 
analy nee with means py and pryand standard deviations oy and oy, respectively. (5-21) 
Figure 5-19 illustrates that the marginal probability distributions of X and Y are normal. 
fxy(% y) % 
z 
y 
0) 


x 
Figure 5-18 Bivariate normal probability density Figure 5-19 Marginal probability 


function with oy = 1, oy = 1, p = 0, py = 0, and density functions of a bivariate 
My = 0. normal distribution. 


Conditional 
Distribution of 
Bivariate Normal 
Random Variables 


Correlation of 
Bivariate Normal 
Random Variables 


For Bivariate 
Normal Random 
Variables Zero 
Correlation Implies 
Independence 
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If X and Y have a bivariate normal distribution with joint probability density 
fuy(%, y; Ox, Ty, Ly, by, P), the conditional probability distribution of Y given 
X = xis normal with mean 


Ow OF 
by, = By — BxPgy tT Gy 


px 
and variance 


OF |x mm oF(1 ic p’) 


Furthermore, as the notation suggests, p represents the correlation between X and Y. The 


following result is left as an exercise. 


If X and Y have a bivariate normal distribution with joint probability density function 
Fury, y; Ox, Ty, Ly, by, Pp), the correlation between X and Y is p. (5-22) 


The contour plots in Fig. 5-17 illustrate that as p moves from zero (left graph) to 0.9 (right 
graph), the ellipses narrow around the major axis. The probability is more concentrated about 
a line in the (x, y) plane and graphically displays greater correlation between the variables. If 
p = —1 or +1, all the probability is concentrated on a line in the (x, y) plane. That is, the 
probability that X and Y assume a value that is not on the line is zero. In this case, the bivari- 
ate normal probability density is not defined. 

In general, zero correlation does not imply independence. But in the special case that X 
and Y have a bivariate normal distribution, if p = 0, X and Y are independent. The details are 
left as an exercise. 


If X and Y have a bivariate normal distribution with p = 0, X and Y are independent. 
(5-23) 


An important use of the bivariate normal distribution is to calculate probabilities involving 
two correlated normal random variables. 


EXAMPLE 5-29  Injection-Molded Part 


Suppose that the X and Y dimensions of an injection-molded This probability can be obtained by integrating fy, y; Ox, Oy, 
part have a bivariate normal distribution with oy = 0.04, [tx Py, p) over the region 2.95 < x < 3.05 and 7.0 <y< 


oy = 0.08, py = 3.00, 
probability that a part satisfies both specifications is closed-form solution to probabilities involving bivariate nor- 


ly = 7.70, and p = 0.8. Then, the 7.80, as shown in Fig. 5-3. Unfortunately, there is often no 


mal distributions. In this case, the integration must be done 


P(2.95 < X < 3.05, 7.60 < Y < 7.80) numerically. 
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EXERCISES FOR SECTION 5-3 


5-42. Test results from an electronic circuit board indicate 
that 50% of board failures are caused by assembly defects, 
30% are due to electrical components, and 20% are due to me- 
chanical defects. Suppose that 10 boards fail independently. 
Let the random variables X, Y, and Z denote the number of 
assembly, electrical, and mechanical defects among the 
10 boards. 


Calculate the following: 

(a) P(X =5, Y =3,Z = 2) 

(b) P(X = 8) 

(c) P(X =8|Y=1) 

(d) P(X =8|Y=1) 

(e) P(X =7,Y=1|Z=2) 

5-43. Based on the number of voids, a ferrite slab is classi- 

fied as either high, medium, or low. Historically, 5% of the 

slabs are classified as high, 85% as medium, and 10% as low. 

A sample of 20 slabs is selected for testing. Let X, Y, and Z 

denote the number of slabs that are independently classified as 

high, medium, and low, respectively. 

(a) What are the name and the values of the parameters of the 
joint probability distribution of X, Y, and Z? 

(b) What is the range of the joint probability distribution of 
X,Y,Z? 

(c) What are the name and the values of the parameters of the 
marginal probability distribution of X? 

(d) Determine E(X) and V(X). 

Determine the following: 

(e) P(X = 1, Y = 17, Z = 3) 


(f) P(X <1, Y =17,Z =3) 

(g) P(X = 1) (h) E(Y) 

(i) P(X =2,Z=3|Y¥=17) (j) P(X =2|Y =17) 
( 


(k) E(X|¥ = 17) 


5-44. A Web site uses ads to route visitors to one of four land- 
ing pages. The probabilities for each landing page are equal. 
Consider 20 independent visitors and let the random variables W, 
X, Y, and Z denote the number of visitors routed to each page. 


Calculate the following: 

(a) PW =5,X =5,Y=5,Z=5) 
(b) PW =5,X =5, Y=5) 

(c) (QW =7,X = 7, Y =6|Z=3) 


(d) QW = 7,X =7, Y =3|Z=3) 
(e) PW = 2) 
(f) ECW) 


(g) (W = 5,X =5) 
(h) P(W = 5|X =5) 


5-45. Four electronic ovens that were dropped during ship- 

ment are inspected and classified as containing either a major, a 

minor, or no defect. In the past, 60% of dropped ovens had a ma- 

jor defect, 30% had a minor defect, and 10% had no defect. 

Assume that the defects on the four ovens occur independently. 

(a) Is the probability distribution of the count of ovens in each 
category multinomial? Why or why not? 


(b) What is the probability that, of the four dropped ovens, two 
have a major defect and two have a minor defect? 
(c) What is the probability that no oven has a defect? 


Determine the following: 

(d) The joint probability mass function of the number of ovens 
with a major defect and the number with a minor defect 

(e) The expected number of ovens with a major defect 

(f) The expected number of ovens with a minor defect 

(g) The conditional probability that two ovens have major 
defects given that two ovens have minor defects 

(h) The conditional probability that three ovens have major 
defects given that two ovens have minor defects 

(i) The conditional probability distribution of the number of 
ovens with major defects given that two ovens have minor 
defects 

(j) The conditional mean of the number of ovens with major 
defects given that two ovens have minor defects. 


5-46. Let X and Yrepresent concentration and viscosity of a 
chemical product. Suppose X and Y have a bivariate normal 
distribution with oy = 4, oy = 1, py = 2, and py = 1. Draw 
a rough contour plot of the joint probability density function 
for each of the following values of p: 


(a) p=0 (b) p=0.8 
(c) p= —0.8 
5-47. Suppose X and Y have a bivariate normal distribution 


with oy = 0.04, oy = 0.08, py = 3.00, py = 7.70, and p = 0. 


Determine the following: 
(a) P(2.95 < X < 3.05) 
(b) P(7.60 < Y < 7.80) 
(c) P(2.95 < X < 3.05, 7.60 < Y < 7.80) 


5-48. In an acid-base titration, a base or acid is gradually 
added to the other until they have completely neutralized 
each other. Let X and Y denote the milliliters of acid and base 
needed for equivalence, respectively. Assume X and Y have a 
bivariate normal distribution with oy = 5 mL, gy = 2 mL, 
py = 120 mL, py = 100 mL, and p = 0.6. 


Determine the following: 

(a) Covariance between X and Y 

(b) Marginal probability distribution of X 

(c) P(X < 116) 

(d) Conditional probability distribution of X given that 
Y = 102 

(e) P(X < 116|Y = 102) 


5-49. In the manufacture of electroluminescent lamps, 
several different layers of ink are deposited onto a plastic 
substrate. The thickness of these layers is critical if specifi- 
cations regarding the final color and intensity of light are to 
be met. Let X and Y denote the thickness of two different 
layers of ink. It is known that X is normally distributed with 
a mean of 0.1 millimeter and a standard deviation of 
0.00031 millimeter, and Y is normally distributed with a 


mean of 0.23 millimeter and a standard deviation of 0.00017 
millimeter. The value of p for these variables is equal to 
zero. Specifications call for a lamp to have a thickness of the 
ink corresponding to X in the range of 0.099535 to 0.100465 
millimeter and Y in the range of 0.22966 to 0.23034 mil- 
limeter. What is the probability that a randomly selected 
lamp will conform to specifications? 


5-50. Suppose that X and Y have a bivariate normal distri- 

bution with joint probability density function fyy(x, y; oy, Oy, 

Bex, Hy; P)- 

(a) Show that the conditional distribution of Y, given that 
X = x, is normal. 

(b) Determine E(Y|X = x). 

(c) Determine V(Y|X = x). 


5-4 LINEAR FUNCTIONS OF RANDOM VARIABLES 181 


5-51. If X and Y have a bivariate normal distribution with 
p = 0, show that X and Y are independent. 


5-52. Show that the probability density function fyy (x, y; 
Oy, Oy, bx, by, P) Of a bivariate normal distribution integrates 
to one. [Hint: Complete the square in the exponent and use the 
fact that the integral of a normal probability density function 
for a single variable is 1.] 


5-53. If X and Y have a bivariate normal distribution with 
joint probability density fyy (x, y; Ox, Ty, Lx, Py, Pp), Show that 
the marginal probability distribution of X is normal with mean 
uy and standard deviation oy. [Hint: Complete the square in 
the exponent and use the fact that the integral of a normal 
probability density function for a single variable is 1.] 


5-4 LINEAR FUNCTIONS OF RANDOM VARIABLES 


A random variable is sometimes defined as a function of one or more random variables. In 
this section, results for linear functions are highlighted because of their importance in the 
remainder of the book. For example, if the random variables X, and X, denote the length 
and width, respectively, of a manufactured part, Y = 2X, + 2X, is a random variable that 
represents the perimeter of the part. As another example, recall that the negative binomial 
random variable was represented as the sum of several geometric random variables. 

In this section, we develop results for random variables that are linear combinations of 


random variables. 


Linear 
Combination Given random variables X;, X>, ... , X, and constants Cc), €, ... , Cy, 
SoC AG tre hte een (5-24) 
is a linear combination of X,, X5,... 9 sep 


Now, E(Y) can be found from the joint probability distribution of X), X>,... , X, as follows. 


Assume X,, X), .. 
for discrete random variables. 


E(Y) = | | ee | (cx) + C2%X2 ae ote Cy Xp)Fx, x... X,(F1> XQ, +055 


. ,X, are continuous random variables. An analogous calculation can be used 


Xp) dx, dx... AX, 


co oe} 
| ni | 1 fix... X(Mb Xp +++» Xp) dy Ary... AX, 
co —co 


2Fx,X)... x,(X15 X2, ey Aig) Ay ON pec Ces A atte 


Xp Fx, X... x,(%, XQ, 000s ay) dx dX... dx, 


8 
s—8 {8 | 
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By using Equation 5-10 for each of the terms in this expression, we obtain the following. 


Mean of a 
Linear Function 


IfY= CX ar 2X4 ap eco ap eK 


Pp? 


EY) = c)E(X;) ae CE (Xp) me eer oe Cp E(X,) (5-25) 


Furthermore, it is left as an exercise to show the following. 


Variance of a 
Linear Function feXqep Xo 


general, 


Ke Os 


VY) = c{V(X1) + c3V(X>) + 


., X, are random variables, and Y = c,;X; + c)X) + ++: + c,X,, then in 


p“p 


., X, are independent, 


+ + VW(X,) +2) Dd cic; cov(X;, X)) (5-26) 
i<j 
V(X) = c{V(X1) + c3V(Xp) +--+ + GV(X,) (5-27) 


Note that the result for the variance in Equation 5-27 requires the random variables to be 
independent. To see why the independence is important, consider the following simple exam- 
ple. Let X, denote any random variable and define X, = —X;,. Clearly, X, and X, are not inde- 
pendent. In fact, pyy = —1. Now, Y = X, + X, is 0 with probability 1. Therefore, VY) = 0, 
regardless of the variances of X, and X>. 


EXAMPLE 5-30 Negative Binomial Distribution 

In Chapter 3, we found that if Yis a negative binomial random 
variable with parameters p and r, Y= X, + X, +-:- + X,, 
where each X; is a geometric random variable with parameter 


p, and they are independent. Therefore, E(X;) = 1/p and 


V(X,) = (1 — p)/p?. From Equation 5-25, E(Y) = r/p, and 
from Equation 5-27, V(Y) = r(1 — p)/p*. 


An approach similar to the one applied in the above example can be used to verify the 
formulas for the mean and variance of an Erlang random variable in Chapter 4. An important 
use of Equation 5-27 is in error propagation and this is presented in the following example. 


EXAMPLE 5-31 Error Propagation 
A semiconductor product consists of three layers. If the vari- 
ances in thickness of the first, second, and third layers are 25, 
40, and 30 nanometers squared, what is the variance of the 
thickness of the final product? 

Let X,, X>, X3, and X be random variables that denote the 
thickness of the respective layers, and the final product. Then, 


X=X,4+R%4+% 


The variance of X is obtained from Equaion 5-27: 


V(X) = WiX1) + V(X2) + V(X3) 
= 25 + 40 + 30 = 95 nm” 


Consequently, the standard deviation of thickness of the final 
product is 95" = 9.75 nm and this shows how the variation in 
each layer is propagated to the final product. 


The particular linear function that represents the average of p random variables, with 
identical means and variances, is used quite often in subsequent chapters. We highlight the 


results for this special case. 
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Mean and 
Variance of If X = (X, + X) + +++ + X,)/p with E(X) = p fori = 1, 2,...,p, 
an Average 


HX) = p (5-28a) 


If X;, X, ... , X, are also independent with V(X;) = Or itary = 1, Dy oor [Dp 


o 


x) = = (5-28b) 


The conclusion for V(X ) is obtained as follows. Using Equation 5-27, with c; = 1/p and 
V(X) = o°, yields 


V(X) = (1/p)’o? + +++ + (1/p)°o? = o7/p 
—<———_ 
p terms 


Another useful result concerning linear functions of random variables is a reproductive 
property that holds for independent, normal random variables. 


Reproductive 
Property of If X, X>,..., X, are independent, normal random variables with E(X;) = p,; and 
the Normal (ie teaG fon ee 
Distribution 


Y= CX, ar 2X9 ap 008 Sp CpXp 


is anormal random variable with 
E(Y) = cypy + Cog +++ + Cpbyp 
and 


VY) = cio, cso, co, (5-29) 


The mean and variance of Y follow from Equations 5-25 and 5-27. The fact that Y has a nor- 
mal distribution can be obtained from supplemental material on moment-generating functions 
on the Web site for the book. 


EXAMPLE 5-32 Linear Function of Independent Normal Random Variables 
Let the random variables X, and X, denote the length and centimeters and the variance of Y is 
width, respectively, of a manufactured part. Assume that X, is 


= 2 2 = 
normal with E(X,) = 2 centimeters and standard deviation Ur) = 4 x OI +4 x 0.2" = 0.2 


0.1 centimeter, and that X, is normal with E(X,) = 5 centime- Now, 

ters and standard deviation 0.2 centimeter. Also, assume that 

X, and X, are independent. Determine the probability that the P(Y > 14.5) = P[(Y — py)/oy > (14.5 — 14)/V0.2] 
perimeter exceeds 14.5 centimeters. = P(Z > 1.12) = 0.13 


Then, Y = 2X, + 2X, is a normal random variable that 
represents the perimeter of the part. We obtain E(Y) = 14 
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EXAMPLE 5-33 Beverage Volume 
Soft-drink cans are filled by an automated filling machine. The 
mean fill volume is 12.1 fluid ounces, and the standard devia- 
tion is 0.1 fluid ounce. Assume that the fill volumes of the cans 
are independent, normal random variables. What is the proba- 
bility that the average volume of 10 cans selected from this 
process is less than 12 fluid ounces? 

Let X,, X5, ... , Xj denote the fill volumes of the 10 cans. 
The average fill volume (denoted as X) is a normal random 
variable with 


EXERCISES FOR SECTION 5-4 


2 


ae = 0.1 
E(X) = 12.1 and V(X) = 0. 0.001 
Consequently, 
= X—py — 12-1211 
P(X < 12) =P 0.001 


= P(Z < —3.16) = 0.00079 


5-54. Xand Yare independent, normal random variables with 
E(X) = 0, V(X) = 4, E(Y) = 10, and ViY) = 9. 

Determine the following: 
(a) E(2X + 3Y) 

(c) P(2X + 3Y < 30) (d) P(2X + 3Y < 40) 

5-55. X and Y are independent, normal random variables 
with E(X) = 2, V(X) = 5, E(Y) = 6, and V(Y) = 8. 
Determine the following: 
(a) E(3X + 2Y) 


(b) V(2X + 3Y) 


(b) V(i3X + 2Y) 

(c) P(3X + 2Y < 18) (d) P(3X + 2Y < 28) 

5-56. Suppose that the random variable X represents the 
length of a punched part in centimeters. Let Y be the length 
of the part in millimeters. If E(x) = 5 and V(X) = 0.25, what 
are the mean and variance of Y? 


5-57. A plastic casing for a magnetic disk is composed of 

two halves. The thickness of each half is normally distributed 

with a mean of 2 millimeters and a standard deviation of 

0.1 millimeter and the halves are independent. 

(a) Determine the mean and standard deviation of the total 
thickness of the two halves. 

(b) What is the probability that the total thickness exceeds 
4.3 millimeters? 


5-58. Making handcrafted pottery generally takes two 
major steps: wheel throwing and firing. The time of wheel 
throwing and the time of firing are normally distributed 
random variables with means of 40 min and 60 min and stan- 
dard deviations of 2 min and 3 min, respectively. 

(a) What is the probability that a piece of pottery will be fin- 

ished within 95 min? 
(b) What is the probability that it will take longer than 110 min? 


5-59. In the manufacture of electroluminescent lamps, sev- 
eral different layers of ink are deposited onto a plastic sub- 
strate. The thickness of these layers is critical if specifications 
regarding the final color and intensity of light are to be met. 
Let X and Y denote the thickness of two different layers of ink. 
It is known that X is normally distributed with a mean of 0.1 
millimeter and a standard deviation of 0.00031 millimeter and 
Y is also normally distributed with a mean of 0.23 millimeter 
and a standard deviation of 0.00017 millimeter. Assume that 
these variables are independent. 


(a) If a particular lamp is made up of these two inks only, 
what is the probability that the total ink thickness is less 
than 0.2337 millimeter? 

(b) A lamp with a total ink thickness exceeding 0.2405 mil- 
limeter lacks the uniformity of color demanded by the 
customer. Find the probability that a randomly selected 
lamp fails to meet customer specifications. 

5-60. The width of a casing for a door is normally distrib- 

uted with a mean of 24 inches and a standard deviation of 

1/8 inch. The width of a door is normally distributed with a 

mean of 23-7/8 inches and a standard deviation of 1/16 inch. 

Assume independence. 

(a) Determine the mean and standard deviation of the differ- 
ence between the width of the casing and the width of the 
door. 

(b) What is the probability that the width of the casing minus 
the width of the door exceeds 1/4 inch? 

(c) What is the probability that the door does not fit in the 
casing? 

5-61. An article in Knee Surgery Sports Traumatology, 
Arthroscopy (“Effect of Provider Volume on Resource 
Utilization for Surgical Procedures” (2005, Vol. 13, pp. 
273-279)| showed a mean time of 129 minutes and a standard 
deviation of 14 minutes for ACL reconstruction surgery for 
high-volume hospitals (with more than 300 such surgeries per 
year). If a high-volume hospital needs to schedule 10 surger- 
ies, what are the mean and variance of the total time to com- 
plete these surgeries? Assume the times of the surgeries are in- 
dependent and normally distributed. 


5-62. Soft-drink cans are filled by an automated filling 

machine and the standard deviation is 0.5 fluid ounce. Assume 

that the fill volumes of the cans are independent, normal 

random variables. 

(a) What is the standard deviation of the average fill volume 
of 100 cans? 

(b) If the mean fill volume is 12.1 ounces, what is the proba- 
bility that the average fill volume of the 100 cans is below 
12 fluid ounces? 

(c) What should the mean fill volume equal so that the proba- 
bility that the average of 100 cans is below 12 fluid ounces 
is 0.005? 
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(d) If the mean fill volume is 12.1 fluid ounces, what should 
the standard deviation of fill volume equal so that the 
probability that the average of 100 cans is below 12 fluid 
ounces is 0.005? 

(e) Determine the number of cans that need to be measured 
such that the probability that the average fill volume is 
less than 12 fluid ounces is 0.01. 

5-63. The photoresist thickness in semiconductor manufac- 

turing has a mean of 10 micrometers and a standard deviation of 

1 micrometer. Assume that the thickness is normally distributed 

and that the thicknesses of different wafers are independent. 

(a) Determine the probability that the average thickness of 10 
wafers is either greater than 11 or less than 9 micrometers. 

(b) Determine the number of wafers that need to be measured 
such that the probability that the average thickness ex- 
ceeds 11 micrometers is 0.01. 

(c) If the mean thickness is 10 micrometers, what should the 
standard deviation of thickness equal so that the probability 
that the average of 10 wafers is either greater than 11 
or less than 9 micrometers is 0.001? 

5-64. Assume that the weights of individuals are indepen- 

dent and normally distributed with a mean of 160 pounds and a 

standard deviation of 30 pounds. Suppose that 25 people 

squeeze into an elevator that is designed to hold 4300 pounds. 

(a) What is the probability that the load (total weight) exceeds 
the design limit? 

(b) What design limit is exceeded by 25 occupants with prob- 
ability 0.0001? 

5-65. Weights of parts are normally distributed with variance 

o”. Measurement error is normally distributed with mean zero 

and variance 0.507, independent of the part weights, and adds to 

the part weight. Upper and lower specifications are centered at 

30 about the process mean. 


(a) Without measurement error, what is the probability that a 
part exceeds the specifications? 

(b) With measurement error, what is the probability that a part 
is measured as beyond specifications? Does this imply it is 
truly beyond specifications? 

(c) What is the probability that a part is measured beyond 
specifications if the true weight of the part is one o below 
the upper specification limit? 


5-66. A U-shaped component is to be formed from the three 
parts A, B, and C. The picture is shown in Fig. 5-20. The length 
of A is normally distributed with a mean of 10 millimeters and 
a standard deviation of 0.1 millimeter. The thickness of parts B 
and C is normally distributed with a mean of 2 millimeters and 
a standard deviation of 0.05 millimeter. Assume all dimensions 
are independent. 


>|/B |< >|C |« 
<———— JD ———> 
B (E 
A 
. ‘ | 


Figure 5-20 Figure for the 
U-shaped component. 


(a) Determine the mean and standard deviation of the length 
of the gap D. 

(b) What is the probability that the gap D is less than 5.9 mil- 
limeters? 
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In many situations in statistics, it is necessary to derive the probability distribution of a func- 
tion of one or more random variables. In this section, we present some results that are helpful 


in solving this problem. 


Suppose that X is a discrete random variable with probability distribution fy(x). Let Y = 
h(X) be a function of X that defines a one-to-one transformation between the values of X and 
Y, and we wish to find the probability distribution of Y. By a one-to-one transformation, we 
mean that each value x is related to one and only one value of y = h(x) and that each value of 
y is related to one and only one value of x, say, x = u(y), where u(y) is found by solving y = 


h(x) for x in terms of y. 


Now, the random variable Y takes on the value y when X takes on the value u(y). 
Therefore, the probability distribution of Y is 


fr(y) 


We may state this result as follows. 


PY=y) = P(x 


u(y)] = fxlu(y)] 
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General 
Puan of Suppose that X is a discrete random variable with probability distribution f,(x). Let 
a Discrete Y = h(X) define a one-to-one transformation between the values of X and Y so that 
Random Variable 


the equation y = h(x) can be solved uniquely for x in terms of y. Let this solution be 
x = u(y). Then the probability mass function of the random variable Y is 


frly) = fxlu(y)] (5-30) 


EXAMPLE 5-34 Function of a Discrete Random Variable 


Let X be a geometric random variable with probability distri- Since X = 0, the transformation is one to one; that is, y = 
bution x and x = Vy. Therefore, Equation 5-30 indicates that the 
f(x) = p(l - py, pe: distribution of the random variable Y is 


Find the probability distribution of Y = X?. fly) =fCV¥) = pL - py", y = 1,4, 9, 16... 


We now consider the situation where the random variables are continuous. Let Y = h(X), 
with X continuous and the transformation one to one. 


General 

Function of Suppose that X is a continuous random variable with probability distribution f,(x). 

2 Coutinmons The function Y = h(X ) is a one-to-one transformation between the values of Y and X 

Random Variable so that the equation y = h(x) can be uniquely solved for x in terms of y. Let this 
solution be x = u(y). The probability distribution of Y is 


fry) = felu(y)] | JI (5-31) 


where J = u'(y) is called the Jacobian of the transformation and the absolute value 
of J is used. 


Equation 5-31 is shown as follows. Let the function y = A(x) be an increasing function of x. Now, 
P[X = u(a)] 

u(a) 

= | slo)ax 


—0c0 


P(Y Sa) 


If we change the variable of integration from x to y by using x = u(y), we obtain dx = u'(y) dy 
and then 

Poy s a) = | faluo)]e() dy 
Since the integral gives the probability that Y < a for all values of a contained in the feasible 
set of values for y, fy[u(y)]u'(y) must be the probability density of Y. Therefore, the proba- 
bility distribution of Y is 


Frly) = fxlu(y) u(y) = Axle) 


If the function y = A(x) is a decreasing function of x, a similar argument holds. 
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EXAMPLE 5-35 Function of a Continuous Random Variable 


Let X be a continuous random variable with probability distri- 
bution 


x 
fx) = 3) 0O=x<4 


Find the probability distribution of Y = h(X) = 2X + 4. 


Note that y = h(x) = 2x + 4 is an increasing function of 
x. The inverse solution is x = u(y) = (y — 4)/2, and from this 


EXERCISES FOR SECTION 5-5 


we find the Jacobian to be J = u' (y) = dx/dy = 1/2. Therefore, 
from Equation 5-31, the probability distribution of Y is 


fry) = 4sy=12 


(y (5) we4 


5-67. Suppose that X is a random variable with probability 
distribution 


fxs) = 1/4, x= 1,2,3,4 


Find the probability distribution of Y = 2X + 1. 

5-68. Let X be a binomial random variable with p = 0.25 
and n = 3. Find the probability distribution of the random 
variable Y = X?. 

5-69. Suppose that X is a continuous random variable with 
probability distribution 


(a) Find the probability distribution of the random variable 
Y=2xX + 10. 
(b) Find the expected value of Y. 


5-70. Suppose that X has a uniform probability distribution 


Show that the probability distribution of the random variable 
Y = —2 In X is chi-squared with two degrees of freedom. 
5-71. A random variable X has the following probability 
distribution: 

FxQx) =e, x20 
(a) Find the probability distribution for Y = X’. 
(b) Find the probability distribution for Y = X!/”. 
(c) Find the probability distribution for Y = In X. 
5-72. The velocity of a particle in a gas is arandom variable 
V with probability distribution 

flv) = ave” v>0 

where bD is a constant that depends on the temperature of the 
gas and the mass of the particle. 
(a) Find the value of the constant a. 


(b) The kinetic energy of the particle is W = mV7/2. Find the 
probability distribution of W. 


5-73. Suppose that X has the probability distribution 


Find the probability distribution of the random variable 
Y=e%. 
5-74. The random variable X has the probability distri- 
bution 


x 


fx(x) = 8 


O=x=4 


Find the probability distribution of Y = (X — 2)”. 


Supplemental Exercises 


5-75. Show that the following function satisfies the proper- 
ties of a joint probability mass function: 


x y f(y) 
0 0 1/4 
0 1 1/8 
1 0 1/8 
1 1 1/4 
2 2, 1/4 

Determine the following: 

(a) P(X <0.5,¥Y< 1.5) (b) P(X <1) 


(c) P(X < 1.5) (d) P(X > 0.5, Y < 1.5) 

(e) Determine F(X), E(Y), V(X), and V(Y). 

(f) Marginal probability distribution of the random vari- 
able X 

(g) Conditional probability distribution of Y given that X = 1 

(h) E(Y|X = 1) 

(i) Are X and Y independent? Why or why not? 

(j) Calculate the correlation between X and Y. 

5-76. The percentage of people given an antirheumatoid 

medication who suffer severe, moderate, or minor side effects 

are 10, 20, and 70%, respectively. Assume that people react 
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independently and that 20 people are given the medication. 

Determine the following: 

(a) The probability that 2, 4, and 14 people will suffer severe, 
moderate, or minor side effects, respectively 

(b) The probability that no one will suffer severe side effects 

(c) The mean and variance of the number of people who will 
suffer severe side effects 

(d) What is the conditional probability distribution of the 
number of people who suffer severe side effects given that 
19 suffer minor side effects? 

(e) What is the conditional mean of the number of people who 
suffer severe side effects given that 19 suffer minor side 
effects? 


5-77. The backoff torque required to remove bolts in a steel 

plate is rated as high, moderate, or low. Historically, the prob- 

ability of a high, moderate, or low rating is 0.6, 0.3, or 0.1, 

respectively. Suppose that 20 bolts are evaluated and that the 

torque ratings are independent. 

(a) What is the probability that 12, 6, and 2 bolts are rated as 
high, moderate, and low, respectively? 

(b) What is the marginal distribution of the number of bolts 
rated low? 

(c) What is the expected number of bolts rated low? 

(d) What is the probability that the number of bolts rated low 
is greater than two? 

(e) What is the conditional distribution of the number of bolts 
rated low given that 16 bolts are rated high? 

(f)) What is the conditional expected number of bolts rated 
low given that 16 bolts are rated high? 

(g) Are the numbers of bolts rated high and low independent 
random variables? 


5-78. To evaluate the technical support from a computer 

manufacturer, the number of rings before a call is answered by 

a service representative is tracked. Historically, 70% of the 

calls are answered in two rings or less, 25% are answered in 

three or four rings, and the remaining calls require five rings 
or more. Suppose you call this manufacturer 10 times and 
assume that the calls are independent. 

(a) What is the probability that eight calls are answered in two 
rings or less, one call is answered in three or four rings, 
and one call requires five rings or more? 

(b) What is the probability that all 10 calls are answered in 
four rings or less? 

(c) What is the expected number of calls answered in four 
rings or less? 

(d) What is the conditional distribution of the number of calls 
requiring five rings or more given that eight calls are 
answered in two rings or less? 

(e) What is the conditional expected number of calls requir- 
ing five rings or more given that eight calls are answered in 
two rings or less? 

(f) Are the number of calls answered in two rings or less and 
the number of calls requiring five rings or more independ- 
ent random variables? 


5-79. Determine the value of c such that the function 
f(x, y) = cx’y for 0 <x <3 and 0 < y <2 satisfies the 
properties of a joint probability density function. 

Determine the following: 

(a) P(X<1,¥Y<1) (b) P(X <235) 

(c) Pl < Y < 2.5) (d) P(X >2,1<Y< 15) 

(e) E(X) (f) E(Y) 

(g) Marginal probability distribution of the random variable X 
(h) Conditional probability distribution of Y given that X = 1 
(i) Conditional probability distribution of X given that Y = 1 


5-80. The joint distribution of the continuous random vari- 

ables X, Y, and Z is constant over the region r+ y = 1, 

0 < z <4. Determine the following: 

(a) P(X? + Y? <= 0.5) 

(b) P(X? + Y? = 0.5, Z < 2) 

(c) Joint conditional probability density function of X and Y 
given that Z = 1 

(d) Marginal probability density function of X 

(e) Conditional mean of Z given that X = 0 and Y = 0 

(f) Conditional mean of Z given that X = x and Y = y 


5-81. Suppose that X and Y are independent, continuous 
uniform random variables for 0 < x < 1 andO < y < 1. Use 
the joint probability density function to determine the proba- 
bility that | X — ¥| < 0.5. 


5-82. The lifetimes of six major components in a copier are 

independent exponential random variables with means of 8000, 

10,000, 10,000, 20,000, 20,000, and 25,000 hours, respectively. 

(a) What is the probability that the lifetimes of all the compo- 
nents exceed 5000 hours? 

(b) What is the probability that at least one component life- 
time exceeds 25,000 hours? 


5-83. Contamination problems in semiconductor manufac- 

turing can result in a functional defect, a minor defect, or no 

defect in the final product. Suppose that 20, 50, and 30% of the 

contamination problems result in functional, minor, and no de- 

fects, respectively. Assume that the defects of 10 contamina- 

tion problems are independent. 

(a) What is the probability that the 10 contamination problems 
result in two functional defects and five minor defects? 

(b) What is the distribution of the number of contamination 
problems that result in no defects? 

(c) What is the expected number of contamination problems 
that result in no defects? 


5-84. The weight of adobe bricks for construction is 

normally distributed with a mean of 3 pounds and a standard 

deviation of 0.25 pound. Assume that the weights of the bricks 

are independent and that a random sample of 25 bricks is 

chosen. 

(a) What is the probability that the mean weight of the sample 
is less than 2.95 pounds? 

(b) What value will the mean weight exceed with probability 
0.99? 
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5-85. The length and width of panels used for interior doors 

(in inches) are denoted as X and Y, respectively. Suppose that X 

and Y are independent, continuous uniform random variables 

for 17.75 < x < 18.25 and 4.75 < y < 5.25, respectively. 

(a) By integrating the joint probability density function over 
the appropriate region, determine the probability that the 
area of a panel exceeds 90 square inches. 

(b) What is the probability that the perimeter of a panel 
exceeds 46 inches? 


5-86. The weight of a small candy is normally distributed 

with a mean of 0.1 ounce and a standard deviation of 0.01 

ounce. Suppose that 16 candies are placed in a package and 

that the weights are independent. 

(a) What are the mean and variance of package net weight? 

(b) What is the probability that the net weight of a package is 
less than 1.6 ounces? 

(c) If 17 candies are placed in each package, what is the 
probability that the net weight of a package is less than 
1.6 ounces? 


5-87. The time for an automated system in a warehouse to 

locate a part is normally distributed with a mean of 45 seconds 

and a standard deviation of 30 seconds. Suppose that inde- 

pendent requests are made for 10 parts. 

(a) What is the probability that the average time to locate 10 
parts exceeds 60 seconds? 

(b) What is the probability that the total time to locate 10 
parts exceeds 600 seconds? 


5-88. A mechanical assembly used in an automobile engine 
contains four major components. The weights of the 
components are independent and normally distributed with 
the following means and standard deviations (in ounces): 


Standard 
Component Mean Deviation 
Left case 4 0.4 
Right case 55 0.5 
Bearing assembly 10 0.2 
Bolt assembly 8 0.5 


(a) What is the probability that the weight of an assembly 
exceeds 29.5 ounces? 

(b) What is the probability that the mean weight of eight 
independent assemblies exceeds 29 ounces? 


5-89. Suppose X and Y have a bivariate normal distribution 
with oy = 4, oy=1, py =4, py =4, and p = —0.2. 
Draw a rough contour plot of the joint probability density 
function. 


5-90. 


If fuy(x y) = ep {= Gr 


1 
1.27 


determine E(X), E(Y), V(X), V(Y), and p by reorganizing the 
parameters in the joint probability density function. 


5-91. The permeability of a membrane used as a moisture 

barrier in a biological application depends on the thickness of 

two integrated layers. The layers are normally distributed with 

means of 0.5 and 1 millimeters, respectively. The standard 

deviations of layer thickness are 0.1 and 0.2 millimeters, 

respectively. The correlation between layers is 0.7. 

(a) Determine the mean and variance of the total thickness of 
the two layers. 

(b) What is the probability that the total thickness is less than 
1 millimeter? 

(c) Let X, and X, denote the thickness of layers 1 and 2, re- 
spectively. A measure of performance of the membrane is 
a function 2X, + 3X, of the thickness. Determine the 
mean and variance of this performance measure. 


5-92. The permeability of a membrane used as a moisture 
barrier in a biological application depends on the thickness of 
three integrated layers. Layers 1, 2, and 3 are normally dis- 
tributed with means of 0.5, 1, and 1.5 millimeters, respec- 
tively. The standard deviations of layer thickness are 0.1, 0.2, 
and 0.3, respectively. Also, the correlation between layers 1 
and 2 is 0.7, between layers 2 and 3 is 0.5, and between layers 
1 and 3 is 0.3. 
(a) Determine the mean and variance of the total thickness of 
the three layers. 
(b) What is the probability that the total thickness is less than 
1.5 millimeters? 


5-93. Asmall company is to decide what investments to use 
for cash generated from operations. Each investment has a 
mean and standard deviation associated with the percentage 
gain. The first security has a mean percentage gain of 5% with 
a standard deviation of 2%, and the second security provides 
the same mean of 5% with a standard deviation of 4%. The 
securities have a correlation of —0.5, so there is a negative 
correlation between the percentage returns. If the company 
invests two million dollars with half in each security, what are 
the mean and standard deviation of the percentage return? 
Compare the standard deviation of this strategy to one that 
invests the two million dollars into the first security only. 


5-94. An order of 15 printers contains four with a 

graphics-enhancement feature, five with extra memory, and 

six with both features. Four printers are selected at random, 

without replacement, from this set. Let the random variables 

X, Y, and Z denote the number of printers in the sample with 

graphics enhancement only, extra memory only, and both, 

respectively. 

(a) Describe the range of the joint probability distribution of 
X, Y, and Z. 

(b) Is the probability distribution of X, Y, and Z multinomial? 
Why or why not? 

(c) Determine the conditional probability distribution of X 
given that Y = 2. 
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Determine the following: 

(d) P(X =1,Y =2,Z = 1) 
(f) E(X) and V(X) 

(g) (PX =1,Y=2|Z=1) (hb) P(X =2|¥ =2) 

(i) Conditional probability distribution of X given that Y = 0 

and Z = 3. 

5-95. A marketing company performed a risk analysis for a 
manufacturer of synthetic fibers and concluded that new com- 
petitors present no risk 13% of the time (due mostly to the 
diversity of fibers manufactured), moderate risk 72% of the 
time (some overlapping of products), and very high risk (com- 
petitor manufactures the exact same products) 15% of the 
time. It is known that 12 international companies are planning 


(ec) P(X = 1, Y= 1) 
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to open new facilities for the manufacture of synthetic fibers 

within the next three years. Assume the companies are inde- 

pendent. Let X, Y, and Z denote the number of new competi- 

tors that will pose no, moderate, and very high risk for the 

interested company, respectively. 

(a) What is the range of the joint probability distribution of 
X, Y, and Z? 

Determine the following: 

(b) P(X = 1, Y=3,Z=1) 

(c) P(Z = 2) 

(d) P(Z = 2|Y = 1,X = 10) 


( (e) P(Z = 1|X = 10) 
(f) P(Y = 1,Z <1|X = 10) 


(g) E(Z|X = 10) 


MIND-EXPANDING EXERCISES 


5-96. Show that if X,, X>,..., X, are independent, 
continuous random variables, P(X, € A), X, © A», ..., 
X, €A,) = P(X, € Ay) P(X, € Aa)... P(X, € A,) for any 


regions Aj, Ay, ..., A, in the range of X), X>,..., X, 


respectively. 
5-97. Show that if X,, X,..., X, are independent 
random variables and Y = c,X, + c).X, + -*- + c,X,, 


V(Y) = c{V(X1) + c3V(Xp) + ++ + CV(X,) 


You may assume that the random variables are continuous. 
5-98. Suppose that the joint probability function of 
the continuous random variables X and Y is constant on 
the rectangle 0 < x < a,0 < y < b. Show that X and Y 
are independent. 

5-99. Suppose that the range of the continuous 
variables X and YisO <x<aand0<y<b. Also 
suppose that the joint probability density function 
Fxv(x, y) = g(x)h(y), where g(x) is a function only of 


IMPORTANT TERMS AND CONCEPTS 


x and h(y), is a function only of y. Show that X and Y 
are independent. 


5-100. This exercise extends the hypergeometric dis- 
tribution to multiple variables. Consider a population 
with N items of k different types. Assume there are N, 
items of type 1, NV, items of type 2, ..., N, items of type k 
so that VN, + N, +...+...,M, = N. Suppose that a ran- 
dom sample of size n is selected, without replacement, 
from the population. Let X,, X, ..., X, denote the number 
of items of each type in the sample so that X, + X5, +... 
+ ...+ X, =n. Show that for feasible values of n, x,, 
Xo, ++ Xp Ni, No, .--, NM, the probability is 


“cay otek) 
- (") 


P(X =x, X. =%,.- 
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Descriptive Statistics 


Statistics is the science of data. An important aspect of dealing with data is organizing and 
summarizing the data in ways that facilitate its interpretation and subsequent analysis. 
This aspect of statistics is called descriptive statistics, and is the subject of this chapter. 
For example, in Chapter 1 we presented eight observations made on the pull-off force of 
prototype automobile engine connectors. The observations (in pounds) were 12.6, 12.9, 
13.4, 12.3, 13.6, 13.5, 12.6, and 13.1. There is obvious variability in the pull-off force val- 
ues. How should we summarize the information in these data? This is the general question 
that we consider. Data summary methods should highlight the important features of the 
data, such as the middle or central tendency and the variability, because these character- 
istics are most often important for engineering decision making. We will see that there are 
both numerical methods for summarizing data and a number of powerful graphical tech- 
niques. The graphical techniques are particularly important. Any good statistical analysis 
of data should always begin with plotting the data. 


CHAPTER OUTLINE 

6-1 NUMERICAL SUMMARIES OF DATA 6-4 BOX PLOTS 

6-2, STEM-AND-LEAF DIAGRAMS 6-5 TIME SEQUENCE PLOTS 

6-3. FREQUENCY DISTRIBUTIONS 6-6 PROBABILITY PLOTS 
AND HISTOGRAMS 

LEARNING OBJECTIVES 


After careful study of this chapter you should be able to do the following: 


1. Compute and interpret the sample mean, sample variance, sample standard deviation, sample median, 
and sample range 


2. Explain the concepts of sample mean, sample variance, population mean, and population variance 
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3. Construct and interpret visual data displays, including the stem-and-leaf display, the histogram, 
and the box plot 


4. Explain the concept of random sampling 
5. Construct and interpret normal probability plots 


6. Explain how to use box plots and other data displays to visually compare two or more samples 
of data 


7. Know how to use simple time series plots to visually display the important features of time- 
oriented data 


6-1 NUMERICAL SUMMARIES OF DATA 


Sample Mean 


Well-constructed data summaries and displays are essential to good statistical thinking, be- 
cause they can focus the engineer on important features of the data or provide insight about 
the type of model that should be used in solving the problem. The computer has become an 
important tool in the presentation and analysis of data. While many statistical techniques re- 
quire only a hand-held calculator, much time and effort may be required by this approach, and 
a computer will perform the tasks much more efficiently. 

Most statistical analysis is done using a prewritten library of statistical programs. The 
user enters the data and then selects the types of analysis and output displays that are of 
interest. Statistical software packages are available for both mainframe machines and 
personal computers. We will present examples of output from Minitab (one of the most 
widely used PC packages) throughout the book. We will not discuss the hands-on use of 
Minitab for entering and editing data or using commands. This information is found in the 
software documentation. 

We often find it useful to describe data features numerically. For example, we can char- 
acterize the location or central tendency in the data by the ordinary arithmetic average or 
mean. Because we almost always think of our data as a sample, we will refer to the arithmetic 
mean as the sample mean. 


If the n observations in a sample are denoted by x, x2, ... , x,, the sample mean is 


n 

> 
Xi) te Novae sae oe, i=1 
n ay ea? 


ie) 


(6-1) 


EXAMPLE 6-1 Sample Mean 

Let’s consider the eight observations on pull-off force col- 
lected from the prototype engine connectors from Chapter 1. 
The eight observations are x; = 12.6, x. = 12.9, x3; = 13.4, 
x4 = 12.3, x5 = 13.6, x5 = 13.5, x7 = 12.6, and xg = 13.1. The 
sample mean is 


“12.6 +12.9 ++: +13.1 
n 8 8 


— Xp txgte tx, A 
g= _ 


104 
= “7 = 13.0 pounds 


A physical interpretation of the sample mean as a measure of 
location is shown in the dot diagram of the pull-off force data. 
See Fig. 6-1. Notice that the sample mean x = 13.0 can be 
thought of as a “balance point.” That is, if each observation 
represents | pound of mass placed at the point on the x-axis, a 
fulcrum located at x would exactly balance this system of 
weights. 


Figure 6-1 The 
sample mean as a 
balance point for a 
system of weights. 


Sample 
Variance 
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Pull-off force 


The sample mean is the average value of all the observations in the data set. Usually, 
these data are a sample of observations that have been selected from some larger population 
of observations. Here the population might consist of all the connectors that will be manufac- 
tured and sold to customers. Recall that this type of population is called a conceptual or 
hypothetical population, because it does not physically exist. Sometimes there is an actual 
physical population, such as a lot of silicon wafers produced in a semiconductor factory. 

In previous chapters we have introduced the mean of a probability distribution, denoted w. 
If we think of a probability distribution as a model for the population, one way to think of the 
mean is as the average of all the measurements in the population. For a finite population with NV 
equally likely values, the probability mass function is f(x;) = 1/N and the mean is 


N 
N Di 
p= dxf) = (6-2) 


i=1 


The sample mean, x, is a reasonable estimate of the population mean, w. Therefore, the engi- 
neer designing the connector using a 3/32-inch wall thickness would conclude, on the basis 
of the data, that an estimate of the mean pull-off force is 13.0 pounds. 

Although the sample mean is useful, it does not convey all of the information about a 
sample of data. The variability or scatter in the data may be described by the sample variance 
or the sample standard deviation. 


If x1, Xo, ...,X, is a sample of n observations, the sample variance is 
n 
a Gx i) 
j=1 
a (6-3) 
= Il 


The sample standard deviation, s, is the positive square root of the sample variance. 


The units of measurement for the sample variance are the square of the original units of 
the variable. Thus, if x is measured in pounds, the units for the sample variance are (pounds)’. 
The standard deviation has the desirable property of measuring variability in the original units 
of the variable of interest, x. 


How Does the Sample Variance Measure Variability? 

To see how the sample variance measures dispersion or variability, refer to Fig. 6-2, which 
shows the deviations x; — x for the connector pull-off force data. The greater the amount of 
variability in the pull-off force data, the larger in absolute magnitude some of the deviations 
x; — X will be. Since the deviations x; — x always sum to zero, we must use a measure of vari- 
ability that changes the negative deviations to nonnegative quantities. Squaring the deviations 
is the approach used in the sample variance. Consequently, if s* is small, there is relatively 
little variability in the data, but if s” is large, the variability is relatively large. 
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x2 x8 


Figure 6-2 How the 
sample variance meas- 
ures variability through 
the deviations x; — X. 


EXAMPLE 6-2 Sample Variance 

Table 6-1 displays the quantities needed for calculating the 
sample variance and sample standard deviation for the pull-off 
force data. These data are plotted in Fig. 6-2. The numerator of 


14 15 


so the sample variance is 


> 1.60 


s” is 


= _ 1.60 _ > 
ia ar a a 0.2286 (pounds) 


and the sample standard deviation is 


s = V0.2286 = 0.48 pounds 


Computation of s? 

The computation of s* requires calculation of x, n subtractions, and n squaring and adding oper- 
ations. If the original observations or the deviations x; — x are not integers, the deviations x; — x 
may be tedious to work with, and several decimals may have to be carried to ensure numerical 
accuracy. A more efficient computational formula for the sample variance is obtained as follows: 


n n 
pes + nx — 25 
i=l i=l 


n—-1 


n n 
SG tx — 2k) 
i=1 


n-1 


and since x = (1/n) };_, x;, this last equation reduces to 


n 2 
n > *% 
2_ Sil 7 


(6-4) 


Table 6-1 Calculation of Terms for the Sample Variance and Sample 
Standard Deviation 


i Xx; Ce Se (x; a a 

1 12.6 —0.4 0.16 

2 12.9 —0.1 0.01 

3 13.4 0.4 0.16 

4 12.3 —0.7 0.49 

5 13.6 0.6 0.36 

6 13.5 0.5 0.25 

7 12.6 —0.4 0.16 

8 B.A 04 0.01 
104.0 0.0 1.60 


EXAMPLE 6-3 
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Note that Equation 6-4 requires squaring each individual x;, then squaring the sum of the x;, 
subtracting (>x;)*/n from > x?, and finally dividing by n — 1. Sometimes this is called the 
shortcut method for calculating s? (or s). 


We will calculate the sample variance and standard deviation and 
using the shortcut method, Equation 6-4. The formula gives 


= V0.2286 = 0.48 pounds 


2 
x) (104) These results agree exactly with those obtained previously. 


= 0.2286 (pounds)? 


Sample Range 


Analogous to the sample variance s’, the variability in the population is defined by the 
population variance (o’). As in earlier chapters, the positive square root of o”, or o, will 
denote the population standard deviation. When the population is finite and consists of V 
equally likely values, we may define the population variance as 


N 
_ 
be 2, (Hi — wy (6-5) 
N 


oc = 


We observed previously that the sample mean could be used as an estimate of the population 
mean. Similarly, the sample variance is an estimate of the population variance. In Chapter 7, 
we will discuss estimation of parameters more formally. 

Note that the divisor for the sample variance is the sample size minus one (n — 1), while 
for the population variance it is the population size N. If we knew the true value of the popu- 
lation mean p, we could find the sample variance as the average squared deviation of the sam- 
ple observations about w. In practice, the value of 1 is almost never known, and so the sum of 
the squared deviations about the sample average x must be used instead. However, the obser- 
vations x; tend to be closer to their average, x, than to the population mean, ww. Therefore, to 
compensate for this we use n — | as the divisor rather than n. If we used 7 as the divisor in the 
sample variance, we would obtain a measure of variability that is, on the average, consistently 
smaller than the true population variance o7. 

Another way to think about this is to consider the sample variance s* as being based on 
n — | degrees of freedom. The term degrees of freedom results from the fact that the n devi- 
ations x, — X,X, — X,...,X, — x always sum to zero, and so specifying the values of any 
n — 1 of these quantities automatically determines the remaining one. This was illustrated in 
Table 6-1. Thus, only n — 1 of the n deviations, x; — x, are freely determined. 

In addition to the sample variance and sample standard deviation, the sample range, or 
the difference between the largest and smallest observations, is a useful measure of variabil- 
ity. The sample range is defined as follows. 


If the n observations in a sample are denoted by x, x», ... , x,, the sample range is 


r = max(x;) — min(x;) (6-6) 
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Figure 6-3 Relation- r 
ship between a popula- = } 
tion and a sample. 


For the pull-off force data, the sample range is r = 13.6 — 12.3 = 1.3. Generally, as the vari- 
ability in sample data increases, the sample range increases. 

The sample range is easy to calculate, but it ignores all of the information in the sample 
data between the largest and smallest values. For example, the two samples 1, 3, 5, 8, and 9 
and 1,5, 5,5, and 9 both have the same range (r = 8). However, the standard deviation of the 
first sample is s; = 3.35, while the standard deviation of the second sample is s, = 2.83. The 
variability is actually less in the second sample. 

Sometimes, when the sample size is small, say n < 8 or 10, the information loss associ- 
ated with the range is not too serious. For example, the range is used widely in statistical qual- 
ity control where sample sizes of 4 or 5 are fairly common. We will discuss some of these 


applications in Chapter 15. 


In most statistics problems, we work with a sample of observations selected from the pop- 
ulation that we are interested in studying. Figure 6-3 illustrates the relationship between the 


population and the sample. 


EXERCISES FOR SECTIONS 6-1 


6-1. Will the sample mean always correspond to one of the 
observations in the sample? 

6-2. Will exactly half of the observations in a sample fall be- 
low the mean? 

6-3. Will the sample mean always be the most frequently oc- 
curring data value in the sample? 

6-4. For any set of data values, is it possible for the sample 
standard deviation to be larger than the sample mean? Give an 
example. 

6-5. Can the sample standard deviation be equal to zero? 
Give an example. 

6-6. Suppose that you add 10 to all of the observations in a 
sample. How does this change the sample mean? How does it 
change the sample standard deviation? 

6-7. Eight measurements were made on the inside diameter 
of forged piston rings used in an automobile engine. The data 


(in millimeters) are 74.001, 74.003, 74.015, 74.000, 74.005, 
74.002, 74.005, and 74.004. Calculate the sample mean and 
sample standard deviation, construct a dot diagram, and com- 
ment on the data. 

6-8. In Applied Life Data Analysis (Wiley, 1982), Wayne 
Nelson presents the breakdown time of an insulating fluid 
between electrodes at 34 kV. The times, in minutes, are as 
follows: 0.19, 0.78, 0.96, 1.31, 2.78, 3.16, 4.15, 4.67, 4.85, 
6.50, 7.35, 8.01, 8.27, 12.06, 31.75, 32.52, 33.91, 36.71, 
and 72.89. Calculate the sample mean and sample standard 
deviation. 

6-9. The January 1990 issue of Arizona Trend contains a 
supplement describing the 12 “best” golf courses in the state. 
The yardages (lengths) of these courses are as follows: 6981, 
7099, 6930, 6992, 7518, 7100, 6935, 7518, 7013, 6800, 7041, 
and 6890. Calculate the sample mean and sample standard de- 
viation. Construct a dot diagram of the data. 


6-10. Anarticle in the Journal of Structural Engineering (Vol. 
115, 1989) describes an experiment to test the yield strength of 
circular tubes with caps welded to the ends. The first yields (in 
KN) are 96, 96, 102, 102, 102, 104, 104, 108, 126, 126, 128, 128, 
140, 156, 160, 160, 164, and 170. Calculate the sample mean and 
sample standard deviation. Construct a dot diagram of the data. 


6-11. An article in Human Factors (June 1989) presented 
data on visual accommodation (a function of eye movement) 
when recognizing a speckle pattern on a high-resolution CRT 
screen. The data are as follows: 36.45, 67.90, 38.77, 42.18, 
26.72, 50.77, 39.30, and 49.71. Calculate the sample mean and 
sample standard deviation. Construct a dot diagram of the data. 
6-12. The following data are direct solar intensity measure- 
ments (watts/m/) on different days at a location in southern Spain: 
562, 869, 708, 775, 775, 704, 809, 856, 655, 806, 878, 909, 918, 
558, 768, 870, 918, 940, 946, 661, 820, 898, 935, 952, 957, 693, 
835, 905, 939, 955, 960, 498, 653, 730, and 753. Calculate the 
sample mean and sample standard deviation. Prepare a dot dia- 
gram of these data. Indicate where the sample mean falls on this 
diagram. Give a practical interpretation of the sample mean. 


6-13. The April 22, 1991, issue of Aviation Week and Space 
Technology reported that during Operation Desert Storm, U.S. 
Air Force F-117A pilots flew 1270 combat sorties for a total of 
6905 hours. What is the mean duration of an F-117A mission 
during this operation? Why is the parameter you have calcu- 
lated a population mean? 


6-14. Preventing fatigue crack propagation in aircraft struc- 
tures is an important element of aircraft safety. An engineering 
study to investigate fatigue crack in n = 9 cyclically loaded 
wing boxes reported the following crack lengths (in mm): 
2.13, 2.96, 3.02, 1.82, 1.15, 1.37, 2.04, 2.47, 2.60. Calculate 
the sample mean and sample standard deviation. Prepare a dot 
diagram of the data. 

6-15. Anarticle in the Journal of Physiology [“Response of 
Rat Muscle to Acute Resistance Exercise Defined by 
Transcriptional and Translational Profiling” (2002, Vol. 545, 
pp. 27—41)] studied gene expression as a function of resist- 
ance exercise. Expression data (measures of gene activity) 
from one gene are shown in the following table. One group of 
rats was exercised for six hours while the other received no ex- 
ercise. Compute the sample mean and standard deviation of 
the exercise and no-exercise groups separately. Construct a dot 
diagram for the exercise and no-exercise groups separately. 
Comment on any differences between the groups. 
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6 Hours 6 Hours 
of Exercise of Exercise No Exercise No Exercise 

425.313 208.475 485.396 406.921 
223.306 286.484 159.471 335.209 
388.793 244.242 478.314 

139.262 408.099 245.782 

212.565 157.743 236.212 

324.024 436.37 252.773 


6-16. Exercise 6-11 describes data from an article in Human 
Factors on visual accommodation from an experiment involv- 
ing a high-resolution CRT screen. 

Data from a second experiment using a low-resolution screen 
were also reported in the article. They are 8.85, 35.80, 26.53, 
64.63, 9.00, 15.38, 8.14, and 8.24. Prepare a dot diagram for this 
second sample and compare it to the one for the first sample. 
What can you conclude about CRT resolution in this situation? 


6-17. The pH ofa solution is measured eight times by one op- 
erator using the same instrument. She obtains the following 
data: 7.15, 7.20, 7.18, 7.19, 7.21, 7.20, 7.16, and 7.18. Calculate 
the sample mean and sample standard deviation. Comment on 
potential major sources of variability in this experiment. 


6-18. An article in the Journal of Aircraft (1988) described 
the computation of drag coefficients for the NASA 0012 
airfoil. Different computational algorithms were used at 
M.. = 0.7 with the following results (drag coefficients are in 
units of drag counts; that is, one count is equivalent to a drag 
coefficient of 0.0001): 79, 100, 74, 83, 81, 85, 82, 80, and 84. 
Compute the sample mean, sample variance, and sample stan- 
dard deviation, and construct a dot diagram. 


6-19. The following data are the joint temperatures of the 

O-rings (°F) for each test firing or actual launch of the space 

shuttle rocket motor (from Presidential Commission on the 

Space Shuttle Challenger Accident, Vol. 1, pp. 129-131): 

84, 49, 61, 40, 83, 67, 45, 66, 70, 69, 80, 58, 68, 60, 67, 72, 

73, 70, 57, 63, 70, 78, 52, 67, 53, 67, 75, 61, 70, 81, 76, 79, 

75, 76, 58, 31. 

(a) Compute the sample mean and sample standard deviation 
and construct a dot diagram of the temperature data. 

(b) Set aside the smallest observation (31°F) and recompute 
the quantities in part (a). Comment on your findings. How 
“different” are the other temperatures from this last value? 


The dot diagram is a useful data display for small samples, up to (say) about 20 observations. 
However, when the number of observations is moderately large, other graphical displays may 


be more useful. 


For example, consider the data in Table 6-2. These data are the compressive strengths in 
pounds per square inch (psi) of 80 specimens of a new aluminum-lithium alloy undergoing 
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Table 6-2 Compressive Strength (in psi) of 80 Aluminum-Lithium Alloy Specimens 


105 221 183 

97 154 153 
245 228 174 
163 131 154 
207 180 190 
134 178 76 
218 157 101 
199 151 142 
160 175 149 
196 201 200 


186 121 181 180 143 
174 120 168 167 141 
199 181 158 176 110 
115 160 208 158 133 
193 194 133 156 123 
167 184 135, 229 146 
171 165 172 158 169 
163 145 171 148 158 
87 160 237 150 135, 
176 150 170 118 149 


evaluation as a possible material for aircraft structural elements. The data were recorded in the 
order of testing, and in this format they do not convey much information about compressive 
strength. Questions such as “What percent of the specimens fail below 120 psi?” are not easy to 
answer. Because there are many observations, constructing a dot diagram of these data would 
be relatively inefficient; more effective displays are available for large data sets. 

A stem-and-leaf diagram is a good way to obtain an informative visual display of a data 


set X1,%2,-- 


. , X,, where each number x; consists of at least two digits. To construct a stem- 


and-leaf diagram, use the following steps. 


Steps to Construct a 
Stem-and-Leaf (1) 
Diagram 


Divide each number x; into two parts: a stem, consisting of one or more of the 
leading digits, and a leaf, consisting of the remaining digit. 


(2) List the stem values in a vertical column. 
(3) Record the leaf for each observation beside its stem. 


(4) Write the units for stems and leaves on the display. 


To illustrate, if the data consist of percent defective information between 0 and 100 on 
lots of semiconductor wafers, we can divide the value 76 into the stem 7 and the leaf 6. In gen- 
eral, we should choose relatively few stems in comparison with the number of observations. 
It is usually best to choose between 5 and 20 stems. 


EXAMPLE 6-4 Alloy Strength 

To illustrate the construction of a stem-and-leaf diagram, 
consider the alloy compressive strength data in Table 6-2. 
We will select as stem values the numbers 7, 8, 9,..., 24. 
The resulting stem-and-leaf diagram is presented in Fig. 6-4. 
The last column in the diagram is a frequency count of the 
number of leaves associated with each stem. Inspection of 
this display immediately reveals that most of the compressive 


strengths lie between 110 and 200 psi and that a central 
value is somewhere between 150 and 160 psi. Furthermore, 
the strengths are distributed approximately symmetrically 
about the central value. The stem-and-leaf diagram enables 
us to determine quickly some important features of the data 
that were not immediately obvious in the original display in 
Table 6-2. 


In some data sets, it may be desirable to provide more classes or stems. One way to do this 
would be to modify the original stems as follows: Divide stem 5 (say) into two new stems, 5L 
and 5U. Stem 5L has leaves 0, 1, 2, 3, and 4, and stem 5U has leaves 5, 6,7, 8, and 9. This will 
double the number of original stems. We could increase the number of original stems by four 


Figure 6-4 Stem- 
and-leaf diagram for 
the compressive 
strength data in Table 
6-2. 
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Stem Leaf Frequency 
7 6 1 
8 7 1 
9 7 1 
10 51 2 
11 580 3} 
12 103 3 
13 413535 6 
14 29583169 8 
15 471340886808 12 
16 3073050879 10 
17 8544162106 10 
18 0361410 7 
19 960934 6 
20 7108 4 
21 8 1 
22 189 é] 
23 7 1 
5 1 


Stem : Tens and hundreds digits (psi); Leaf: Ones digits (psi) 


by defining five new stems: 5z with leaves 0 and 1, 5t (for twos and three) with leaves 2 and 3, 
5f (for fours and fives) with leaves 4 and 5, 5s (for six and seven) with leaves 6 and 7, and 5e 
with leaves 8 and 9. 


EXAMPLE 6-5 Chemical Yield 


Figure 6-5 illustrates the stem-and-leaf diagram for 25 obser- 
vations on batch yields from a chemical process. In Fig. 6-5(a) 
we have used 6, 7, 8, and 9 as the stems. This results in too few 
stems, and the stem-and-leaf diagram does not provide much 
information about the data. In Fig. 6-5(b) we have divided 


each stem into two parts, resulting in a display that more 
adequately displays the data. Figure 6-5(c) illustrates a stem- 
and-leaf display with each stem divided into five parts. There 
are too many stems in this plot, resulting in a display that does 
not tell us much about the shape of the data. 


Figure 6-6 shows a stem-and-leaf display of the compressive strength data in Table 6-2 
produced by Minitab. The software uses the same stems as in Fig. 6-4. Note also that the 
computer orders the leaves from smallest to largest on each stem. This form of the plot is 
usually called an ordered stem-and-leaf diagram. This is not usually done when the plot is 
constructed manually because it can be time-consuming. The computer adds a column to the 
left of the stems that provides a count of the observations at and above each stem in the up- 
per half of the display and a count of the observations at and below each stem in the lower 
half of the display. At the middle stem of 16, the column indicates the number of observa- 
tions at this stem. 

The ordered stem-and-leaf display makes it relatively easy to find data features such as 
percentiles, quartiles, and the median. The sample median is a measure of central tendency 
that divides the data into two equal parts, half below the median and half above. If the num- 
ber of observations is even, the median is halfway between the two central values. From 
Fig. 6-6 we find the 40th and 41st values of strength as 160 and 163, so the median is 
(160 + 163)/2 = 161.5. If the number of observations is odd, the median is the central value. 
The sample mode is the most frequently occurring data value. Figure 6-6 indicates that the mode 
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Figure 6-5 Stem- 
and-leaf displays for 
Example 6-5. Stem: 
Tens digits. Leaf: 
Ones digits. 


Figure 6-6 A stem- 
and-leaf diagram from 
Minitab. 


Stem | Leaf Stem Leaf Stem Leaf 
6 134556 6L 134 6z 1 
7 011357889 6U 556 6t 3 
8 1344788 TL 0113 6f 455 
9 235 TU 57889 6s 6 
(a) 8L 1344 6e 
8U 788 TZ O11 
OL 23 Tt 3 
9U 5) Tf 5 
(b) Ts wh 
Te 889 
8z 1 
8t 3 
8f 44 
8s 7 
8e 88 
9z 
ot 23 
of 5 
9s 
9e 
(c) 


is 158; this value occurs four times, and no other value occurs as frequently in the sample. If 
there were more than one value that occurred four times, the data would have multiple modes. 
We can also divide data into more than two parts. When an ordered set of data is divided 
into four equal parts, the division points are called quartiles. The fizst or lower quartile, q, 
is a value that has approximately 25% of the observations below it and approximately 75% 


Character Stem-and-Leaf Display 


Stem-and-leaf of Strength 
N=80 Leaf Unit = 1.0 


1 7 6 

2 8 7 

3 9 7 

5 10 15 

8 11 058 

11 12 013 

17 13 133455 

25 14 12356899 
37 15 001344678888 
(10) 16 0003357789 
33 17 0112445668 
23 18 0011346 

16 19 034699 

10 20 0178 

6 21 8 

5 22 189 

2 23 7 

1 24 5 
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Table 6-3 Summary Statistics for the Compressive Strength Data from Minitab 


Variable N Mean Median StDev SE Mean 
80 162.66 161.50 33.77 3.78 
Min Max Ql Q3 
76.00 245.00 143.50 181.00 


of the observations above. The second quartile, qz, has approximately 50% of the observa- 
tions below its value. The second quartile is exactly equal to the median. The third or upper 
quartile, q3, has approximately 75% of the observations below its value. As in the case of 
the median, the quartiles may not be unique. The compressive strength data in Fig. 6-6 con- 
tain n = 80 observations. Minitab software calculates the first and third quartiles as the 
(n + 1)/4 and 3(n + 1)/4 ordered observations and interpolates as needed, for example, 
(80 + 1)/4 = 20.25 and 3(80 + 1)/4 = 60.75. Therefore, Minitab interpolates between the 
20th and 21st ordered observation to obtain g; = 143.50 and between the 60th and 61st 
observation to obtain g; = 181.00. In general, the 100th percentile is a data value such that 
approximately 100k% of the observations are at or below this value and approximately 
100(1 — k)% of them are above it. Finally, we may use the interquartile range, defined as 
IQR = q3 — q), aS a measure of variability. The interquartile range is less sensitive to the 
extreme values in the sample than is the ordinary sample range. 

Many statistics software packages provide data summaries that include these quantities. 
The output obtained for the compressive strength data in Table 6-2 from Minitab is shown in 
Table 6-3. 


EXERCISES FOR SECTION 6-2 


6-20. When will the median of a sample be equal to the 
sample mean? 


6-23. The following data are the numbers of cycles to fail- 
ure of aluminum test coupons subjected to repeated alternat- 


6-21. When will the median of a sample be equal to the ing stress at 21,000 psi, 18 cycles per second. 
mode? 
6-22. An article in Technometrics (Vol. 19, 1977, p. 425) 1115 865 1015 885 1594 1000 1416 1501 
presented the following data on the motor fuel octane ratings 1310 2130 845 1223 2023 1820 1560 1238 
of several blends of gasoline: 1540 1421 1674 375 1315 1940 1055 990 
1502 1109 1016 2265 1269 1120 1764 1468 
88.5 988 89.6 92.2 92.7 884 87.5 90.9 1258 1481 1102 1910 1260 910 1330 1512 
94.7 88.3 90.4 834 879 926 87.8 89.9 1315 1567 1605 1018 1888 1730 1608 1750 
84.3 904 91.6 91.0 93.0 93.7 883 91.8 1085 1883 706 1452 1782 1102 1535 1642 
90.1 91.2 90.7 88.2 944 965 89.2 89.7 798 1203 2215 1890 1522 1578 1781 
89.0 906 88.6 885 904 843 92.3 92.2 1020 1270 785 2100 1792 758 1750 
89.8 92.2 88.3 93.3 91.2 93.2 88.9 
ae SE ee ee ee ee Construct a stem-and-leaf display for these data. Calculate the 
90.3 91.1 85.3 91.1 94.2 88.7 92.7 ; . : : 
median and quartiles of these data. Does it appear likely that a 
0 “RE ea Se ee coupon will “survive” beyond 2000 cycles? Justify your answer. 
91.5 93.4 89.3 1003 90.1 89.3 86.7 
89.9 96.1 91.1 876 91.8 91.0 91.0 6-24. The percentage of cotton in material used to manu- 


Construct a stem-and-leaf display for these data. Calculate the 
median and quartiles of these data. 


facture men’s shirts follows. Construct a stem-and-leaf dis- 
play for the data. Calculate the median and quartiles of 
these data. 


@ 


@ 


@ ©@ 


@ 
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34.2 37.8 33.6 326 33.8 35.8 347 34.6 
33.1 366 34.7 33.1 342 37.6 336 33.6 
34.5 35.4 35.00 346 33.4 37.3 32.5 34.1 
35.6 346 354 35.9 347 346 341 34.7 
36.3 33.8 36.2 347 346 35.5 35.1 35.7 
35.1 37.1 368 336 35.2 32.8 368 368 
34.7 340 35.1 32.9 35.0 32.1 37.9 34.3 
33.6 341 35.3 335 349 345 364 32.7 
6-25. The following data represent the yield on 90 consecu- 


tive batches of ceramic substrate to which a metal coating has 
been applied by a vapor-deposition process. Construct a stem- 
and-leaf display for these data. Calculate the median and quar- 
tiles of these data. 


94.1 86.1 95.3 849 888 846 944 84.1 
93.2 90.4 94.1 783 864 83.6 96.1 83.7 
90.6 89.1 97.8 89.6 85.1 85.4 98.0 82.9 
91.4 87.3 93.1 90.3 840 89.7 85.4 87.3 
88.2 84.1 864 93.1 93.7 87.6 86.6 86.4 
86.1 90.1 87.6 946 87.7 85.1 91.7 84.5 
95.1 95.2 941 96.3 90.6 89.6 87.5 
90.0 86.1 92.1 94.7 89.4 90.0 84.2 
92.4 94.3 964 91.1 886 90.1 85.1 
87.3 93.2 88.2 924 841 94.3 90.5 
86.6 86.7 864 90.6 826 97.3 95.6 
91.2 83.0 85.0 89.1 83.1 96.8 88.3 


6-26. Calculate the sample median, mode, and mean of the 
data in Exercise 6-22. Explain how these three measures of 
location describe different features of the data. 

6-27. Calculate the sample median, mode, and mean of the 
data in Exercise 6-23. Explain how these three measures of 
location describe different features in the data. 

6-28. Calculate the sample median, mode, and mean for the 
data in Exercise 6-24. Explain how these three measures of 
location describe different features of the data. 

6-29. The net energy consumption (in billions of kilowatt- 
hours) for countries in Asia in 2003 was as follows (source: U.S. 
Department of Energy Web site, http://www.eia.doe.gov/emeu). 
Construct a stem-and-leaf diagram for these data and comment 
on any important features that you notice. Compute the sample 
mean, sample standard deviation, and sample median. 


Billions of Kilowatt-Hours 


Afghanistan 1.04 
Australia 200.66 
Bangladesh 16.20 
Burma 6.88 
China 1671.23 
Hong Kong 38.43 
India 519.04 


Indonesia 101.80 
Japan 946.27 
Korea, North 17.43 
Korea, South 303.33 
Laos 3.30 
Malaysia 73.63 
Mongolia 2.91 
Nepal 2.30 
New Zealand 37.03 
Pakistan 71.54 
Philippines 44.48 
Singapore 30.89 
Sri Lanka 6.80 
Taiwan 154.34 
Thailand 107.34 
Vietnam 36.92 
Total 4393.8 


6-30. The female students in an undergraduate engineer- 
ing core course at ASU self-reported their heights to the 
nearest inch. The data are below. Construct a stem-and-leaf 
diagram for the height data and comment on any important 
features that you notice. Calculate the sample mean, the 
sample standard deviation, and the sample median of 
height. 


62 64 66 67 65 68 61 65 67 65 64 63 67 
68 64 66 68 69 65 67 62 66 68 67 66 65 
69 65 70 65 67 68 65 63 64 67 67 


6-31. The shear strengths of 100 spot welds in a titanium 
alloy follow. Construct a stem-and-leaf diagram for the weld 
strength data and comment on any important features that you 
notice. What is the 95th percentile of strength? 


5408 
5420 
5407 
5463 
5399 
5445 
5381 
5401 
5458 
5383 


5431 
5429 
5469 


5475 5442 
5401 5446 
5416 5377 
5408 5481 5453 
5391 5477 5447 
5436 5454 5453 
5425 5388 5388 
5411 5399 5431 
5485 5431 5416 
5401 5407 5385 


5376 
5487 
5454 
5422 
5329 
5428 
5378 
5440 
5431 
5440 


5388 
5416 
5375 
5354 
5473 
5418 
5481 
5413 
5390 
5422 


5459 5422 
5382 5357 
5409 5459 
5421 5406 
5423 5441 
5465 5427 
5387 5440 
5406 5342 
5399 5435 
5448 5366 


5416 
5388 
5445 
5444 
5412 
5421 
5482 
5452 
5387 
5430 


5435 
5457 
5429 
5466 
5384 
5396 
5406 
5420 
5462 
5418 


6-32. Animportant quality characteristic of water is the con- 
centration of suspended solid material. Following are 60 mea- 
surements on suspended solids from a certain lake. Construct a 
stem-and-leaf diagram for these data and comment on any im- 
portant features that you notice. Compute the sample mean, the 
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sample standard deviation, and the sample median. What is the 
90th percentile of concentration? 


42.4 
54.3 
56.3 
61.4 
42.6 
61.4 


6-33. 


65.7 
54.0 
43.3 
64.0 
7714 
73.1 


29.8 
73.1 
57.4 
64.2 
54.7 
77.3 


58.7 
81.3 
45.3 
72.6 
57.1 
48.5 


52.1 
59.9 
80.1 
72.5 
713 
89.8 


55.8 
56.9 
49.7 
46.1 
39.3 
50.7 


57.0 
62.2 
42.8 
53.1 
76.4 
52.0 


68.7 
69.9 
42.4 
56.1 
59.3 
59.6 


67.3 
66.9 
59.6 
67.2 
51.1 
66.1 


67.3 
59.0 
65.8 
70.7 
73.8 
31.6 


The United States Golf Association tests golf balls to 


680 
677 
649 
681 
652 
717 
704 
683 
662 
681 


669 
669 
675 
679 
720 
727 
652 
723 
644 
715 


719 
700 
701 
691 
660 
653 
664 
710 
683 
665 


699 
718 
721 
683 
695 
637 
702 
680 
695 
676 


670 
690 
683 
705 
701 
660 
661 
684 
678 
665 


710 
681 
735 
746 
724 
693 
720 
705 
674 
675 


722 
702 
688 
706 
668 
679 
695 
681 
656 
655 


663 
696 
763 
649 
698 
682 
670 
748 
667 
659 


658 
692 
672 
668 
668 
724 
656 
697 
683 
720 


634 
690 
698 
672 
660 
642 
718 
703 
691 
675 


720 
694 
659 
690 
680 
704 
660 
660 
680 
697 


203 


690 
660 
704 
724 
739 
695 
648 
722 
685 
663 


ensure that they conform to the rules of golf. Balls are tested 
for weight, diameter, roundness, and overall distance. The 
overall distance test is conducted by hitting balls with a driver 
swung by a mechanical device nicknamed “Iron Byron” after 
the legendary great Byron Nelson, whose swing the machine 
is said to emulate. Following are 100 distances (in yards) 
achieved by a particular brand of golf ball in the overall dis- 
tance test. Construct a stem-and-leaf diagram for these data 
and comment on any important features that you notice. 
Compute the sample mean, sample standard deviation, and 


the sample median. What is the 90th percentile of distances? 


261.3 
258.1 
254.2 
2511 
237.9 
255.8 
241.4 
244.3 
241.2 
254.4 
256.8 
255.3 
255.0 


259.4 
270.5 
270.7 
272.6 
274.0 
260.7 
260.6 
272.2 
251.1 
232.1 
273.0 
266.6 
273.2 


265.7 
255.1 
233.7 
253.7 
264.5 
245.5 
280.3 
248.3 
267.0 
271.5 
240.8 
250.2 
251.4 


270.6 
268.9 
263.5 
262.2 
244.8 
279.6 
212A 
278.7 
273.4 
242.9 
276.6 
255.8 
276.1 


274.2 
267.4 
244.5 
252.0 
264.0 
237.8 
261.0 
236.0 
247.7 
273.6 
264.5 
285.3 
277.8 


261.4 
253.6 
251.8 
280.3 
268.3 
278.5 
260.0 
271.2 
254.8 
256.1 
264.5 
255.4 
266.8 


254.5 
234.3 
259.5 
274.9 
272.1 
273.3 
279.3 
279.8 
272.8 
251.6 
226.8 
240.5 
268.5 


283.7 
263.2 
201:5 
233.7 
260.2 
263.7 
252.1 
245.6 
270.5 


6-34. A semiconductor manufacturer produces devices used 
as central processing units in personal computers. The speed 
of the devices (in megahertz) is important because it deter- 
mines the price that the manufacturer can charge for the 
devices. The following table contains measurements on 120 
devices. Construct a stem-and-leaf diagram for these data and 
comment on any important features that you notice. Compute 
the sample mean, the sample standard deviation, and the sam- 
ple median. What percentage of the devices has a speed ex- 
ceeding 700 megahertz? 


6-35. A group of wine enthusiasts taste-tested a pinot noir 
wine from Oregon. The evaluation was to grade the wine on a 


0-to-100-point scale. The results follow. Construct a stem-and-leaf @ 


diagram for these data and comment on any important features 
that you notice. Compute the sample mean, the sample standard 
deviation, and the sample median. A wine rated above 90 is con- 
sidered truly exceptional. What proportion of the taste-tasters 
considered this particular pinot noir truly exceptional? 


94 #90 92 91 91 86 89 91 91 90 
909 93 87 90 91 92 89 86 89 90 
88 95 91 88 89 92 87 89 95 92 
8 91 8 89 88 84 85 90 90 83 
6-36. In their book Jntroduction to Linear Regression 


Analysis (4th edition, Wiley, 2006), Montgomery, Peck, and 
Vining presented measurements on NbOCI, concentration from 
a tube-flow reactor experiment. The data, in gram-mole per 
liter X 107%, are as follows. Construct a stem-and-leaf dia- 
gram for these data and comment on any important features 
that you notice. Compute the sample mean, the sample stan- 
dard deviation, and the sample median. 


450 450 473 S07 457 452 453 1215 1256 
1145 1085 1066 1111 1364 1254 1396 1575 1617 
1733 2753 3186 3227 3469 1911 2588 2635 2725 


6-37. In Exercise 6-30, we presented height data that was 
self-reported by female undergraduate engineering students in 
acore course at ASU. In the same class, the male students self- 
reported their heights as follows. Construct a comparative 
stem-and-leaf diagram by listing the stems in the center of the 
display and then placing the female leaves on the left and the 
male leaves on the right. Comment on any important features 
that you notice in this display. 


69 67 69 70 65 68 69 70 71 69 66 67 69 75 68 67 68 
69 70 71 72 68 69 69 70 71 68 72 69 69 68 69 73 70 
73 68 69 71 67 68 65 68 68 69 70 74 71 69 70 69 


6-3 FREQUENCY DISTRIBUTIONS AND HISTOGRAMS 


A frequency distribution is a more compact summary of data than a stem-and-leaf dia- 
gram. To construct a frequency distribution, we must divide the range of the data into inter- 
vals, which are usually called class intervals, cells, or bins. If possible, the bins should be 


> 


SS 


204 CHAPTER 6 DESCRIPTIVE STATISTICS 


Table 6-4 Frequency Distribution for the Compressive Strength Data in Table 6-2 


Class 10= = 90) 90S 110) MO=3 130) B0=2— 150) 1503 = 170) 170 == 190) 190 = = 210) 210 230) 230 x= 250) 
Frequency 2 3 6 14 22 17 10 4 2 
Relative 

frequency 0.0250 0.0375 0.0750 0.1750 0.2750 0.2125 0.1250 0.0500 0.0250 
Cumulative 


relative 
frequency 0.0250 


0.0625 0.1375 0.3125 0.5875 0.8000 0.9250 0.9750 1.0000 


Constructing a 
Histogram (Equal 
Bin Widths) 


of equal width in order to enhance the visual information in the frequency distribution. 
Some judgment must be used in selecting the number of bins so that a reasonable display 
can be developed. The number of bins depends on the number of observations and the 
amount of scatter or dispersion in the data. A frequency distribution that uses either too few 
or too many bins will not be informative. We usually find that between 5 and 20 bins is sat- 
isfactory in most cases and that the number of bins should increase with n. There are several 
sets of rules that can be used to determine the member of bins in a histogram. However, 
choosing the number of bins approximately equal to the square root of the number of ob- 
servations often works well in practice. 

A frequency distribution for the comprehensive strength data in Table 6-2 is shown in 
Table 6-4. Since the data set contains 80 observations, and since 80 = 9, we suspect that 
about eight to nine bins will provide a satisfactory frequency distribution. The largest and 
smallest data values are 245 and 76, respectively, so the bins must cover a range of at least 
245 — 76 = 169 units on the psi scale. If we want the lower limit for the first bin to begin 
slightly below the smallest data value and the upper limit for the last bin to be slightly above 
the largest data value, we might start the frequency distribution at 70 and end it at 250. This is 
an interval or range of 180 psi units. Nine bins, each of width 20 psi, give a reasonable 
frequency distribution, so the frequency distribution in Table 6-4 is based on nine bins. 

The second row of Table 6-4 contains a relative frequency distribution. The relative 
frequencies are found by dividing the observed frequency in each bin by the total number of 
observations. The last row of Table 6-4 expresses the relative frequencies on a cumulative ba- 
sis. Frequency distributions are often easier to interpret than tables of data. For example, from 
Table 6-4 it is very easy to see that most of the specimens have compressive strengths between 
130 and 190 psi and that 97.5 percent of the specimens fail below 230 psi. 

The histogram is a visual display of the frequency distribution. The steps for construct- 
ing a histogram follow. 


(1) Label the bin (class interval) boundaries on a horizontal scale. 

(2) Mark and label the vertical scale with the frequencies or the relative 
frequencies. 

(3) Above each bin, draw a rectangle where height is equal to the frequency 
(or relative frequency) corresponding to that bin. 


Figure 6-7 is the histogram for the compression strength data. The histogram, like the stem- 
and-leaf diagram, provides a visual impression of the shape of the distribution of the meas- 
urements and information about the central tendency and scatter or dispersion in the data. 
Notice the symmetric, bell-shaped distribution of the strength measurements in Fig. 6-7. This 
display often gives insight about possible choices of probability distributions to use as a model 


Figure 6-7 Histogram 
of compressive strength 
for 80 aluminum- 
lithium alloy 
specimens. 
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for the population. For example, here we would likely conclude that the normal distribution is 
a reasonable model for the population of compression strength measurements. 

Sometimes a histogram with unequal bin widths will be employed. For example, if the 
data have several extreme observations or outliers, using a few equal-width bins will result 
in nearly all observations falling in just a few of the bins. Using many equal-width bins will 
result in many bins with zero frequency. A better choice is to use shorter intervals in the region 
where most of the data falls and a few wide intervals near the extreme observations. When the 
bins are of unequal width, the rectangle’s area (not its height) should be proportional to the 
bin frequency. This implies that the rectangle height should be 


bin frequency 


Rectangle height = 
a a bin width 


In passing from either the original data or stem-and-leaf diagram to a frequency distribu- 
tion or histogram, we have lost some information because we no longer have the individual ob- 
servations. However, this information loss is often small compared with the conciseness 
and ease of interpretation gained in using the frequency distribution and histogram. 

Figure 6-8 shows a histogram of the compressive strength data from Minitab with 17 bins. 
We have noted that histograms may be relatively sensitive to the number of bins and their 
width. For small data sets, histograms may change dramatically in appearance if the number 
and/or width of the bins changes. Histograms are more stable for larger data sets, preferably of 
size 75 to 100 or more. Figure 6-9 shows the Minitab histogram for the compressive strength 


20 
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bh 
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Figure 6-8 A histogram of the compressive strength Figure 6-9 A histogram of the compressive strength 


data from Minitab with 17 bins. data from Minitab with nine bins. 
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Figure 6-10 A 
cumulative distribution 
plot of the compressive 
strength data from 
Minitab. 


EXAMPLE 6-6 


Cumulative frequency 
as 
Oo 


100 150 200 250 
Strength 


data with nine bins. This used Minitab’s default setting and the result is similar to the original 
histogram shown in Fig. 6-7. Since the number of observations is moderately large (n = 80), 
the choice of the number of bins is not especially important, and both Figs. 6-8 and 6-9 con- 
vey similar information. 

Figure 6-10 shows a variation of the histogram available in Minitab, the cumulative fre- 
quency plot. In this plot, the height of each bar is the total number of observations that are less 
than or equal to the upper limit of the bin. Cumulative distributions are also useful in data in- 
terpretation; for example, we can read directly from Fig. 6-10 that there are approximately 70 
observations less than or equal to 200 psi. 

When the sample size is large, the histogram can provide a reasonably reliable indicator 
of the general shape of the distribution or population of measurements from which the sample 
was drawn. Figure 6-11 presents three cases. The median is denoted as x. Generally, if the data 
are symmetric, as in Fig. 6-11(b), the mean and median coincide. If, in addition, the data have 
only one mode (we say the data are unimodal), the mean, median, and mode all coincide. If the 
data are skewed (asymmetric, with a long tail to one side), as in Fig. 6-11(a) and (c), the mean, 
median, and mode do not coincide. Usually, we find that mode < median < mean if the 
distribution is skewed to the right, whereas mode > median > mean if the distribution is 
skewed to the left. 

Frequency distributions and histograms can also be used with qualitative or categorical 
data. In some applications there will be a natural ordering of the categories (such as freshman, 
sophomore, junior, and senior), whereas in others the order of the categories will be arbitrary 
(such as male and female). When using categorical data, the bins should have equal width. 


Figure 6-12 on the next page presents the production of trans- 737 was the most popular model, followed by the 757, 747, 
port aircraft by the Boeing Company in 1985. Notice that the 767, and 707. 


Figure 6-11 
Histograms for sym- 
metric and skewed 
distributions. 
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A chart of occurrences by category (in which the categories are ordered by the number of 
occurrences) is sometimes referred to as a Pareto chart. An exercise asks you to construct 


such a chart. 


In this section we have concentrated on descriptive methods for the situation in which each 
observation in a data set is a single number or belongs to one category. In many cases, we work 
with data in which each observation consists of several measurements. For example, in a gaso- 
line mileage study, each observation might consist of a measurement of miles per gallon, the size 
of the engine in the vehicle, engine horsepower, vehicle weight, and vehicle length. This is an ex- 
ample of multivariate data. In later chapters, we will discuss analyzing this type of data. 


EXERCISES FOR SECTION 6-3 


6-38. Construct a frequency distribution and histogram for 
the motor fuel octane data from Exercise 6-22. Use eight bins. 


6-39. Construct a frequency distribution and histogram us- 
ing the failure data from Exercise 6-23. 


6-40. Construct a frequency distribution and histogram for 
the cotton content data in Exercise 6-24. 


6-41. Construct a frequency distribution and histogram for 
the yield data in Exercise 6-25. 


6-42. Construct frequency distributions and histograms 
with 8 bins and 16 bins for the motor fuel octane data in 
Exercise 6-22. Compare the histograms. Do both histograms 
display similar information? 

6-43. Construct histograms with 8 and 16 bins for the 
data in Exercise 6-23. Compare the histograms. Do both his- 
tograms display similar information? 

6-44. Construct histograms with 8 and 16 bins for the data 
in Exercise 6-24. Compare the histograms. Do both his- 
tograms display similar information? 


6-45. Construct a histogram for the energy consumption 
data in Exercise 6-29. 


6-46. Construct a histogram for the female student height 
data in Exercise 6-30. 


6-47. Construct a histogram for the spot weld shear strength 
data in Exercise 6-31. Comment on the shape of the his- 
togram. Does it convey the same information as the stem-and- 
leaf display? 


6-48. Construct a histogram for the water quality data in 
Exercise 6-32. Comment on the shape of the histogram. Does 
it convey the same information as the stem-and-leaf display? 


6-49. Construct a histogram for the overall golf distance data 
in Exercise 6-33. Comment on the shape of the histogram. Does 
it convey the same information as the stem-and-leaf display? 


6-50. Construct a histogram for the semiconductor speed data 
in Exercise 6-34. Comment on the shape of the histogram. Does 
it convey the same information as the stem-and-leaf display? 


6-51. Construct a histogram for the pinot noir wine rating data 
in Exercise 6-35. Comment on the shape of the histogram. Does 
it convey the same information as the stem-and-leaf display? 


6-52. The Pareto Chart. An important variation of a his- 
togram for categorical data is the Pareto chart. This chart is 
widely used in quality improvement efforts, and the categories 
usually represent different types of defects, failure modes, or 
product/process problems. The categories are ordered so that 
the category with the largest frequency is on the left, followed 
by the category with the second largest frequency, and so forth. 
These charts are named after the Italian economist V. Pareto, 
and they usually exhibit “Pareto’s law”; that is, most of the 
defects can be accounted for by only a few categories. Suppose 
that the following information on structural defects in automo- 
bile doors is obtained: dents, 4; pits, 4; parts assembled out of 
sequence, 6; parts undertrimmed, 21; missing holes/slots, 8; 
parts not lubricated, 5; parts out of contour, 30; and parts not 
deburred, 3. Construct and interpret a Pareto chart. 
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Figure 6-13 Descrip- 
tion of a box plot. 
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Figure 6-14 Box 
plot for compressive 
strength data in 
Table 6-2. 


The stem-and-leaf display and the histogram provide general visual impressions about a data 
set, while numerical quantities such as x or s provide information about only one feature of 
the data. The box plot is a graphical display that simultaneously describes several important 
features of a data set, such as center, spread, departure from symmetry, and identification of 
unusual observations or outliers. 

A box plot displays the three quartiles, the minimum, and the maximum of the data on a rec- 
tangular box, aligned either horizontally or vertically. The box encloses the interquartile range with 
the left (or lower) edge at the first quartile, g,, and the right (or upper) edge at the third quartile, 3. 
A line is drawn through the box at the second quartile (which is the 50th percentile or the median), 
q2 = x.A line, or whisker, extends from each end of the box. The lower whisker is a line from the 
first quartile to the smallest data point within 1.5 interquartile ranges from the first quartile. The 
upper whisker is a line from the third quartile to the largest data point within 1.5 interquartile 
ranges from the third quartile. Data farther from the box than the whiskers are plotted as individ- 
ual points. A point beyond a whisker, but less than three interquartile ranges from the box edge, is 
called an outlier. A point more than three interquartile ranges from the box edge is called an 
extreme outlier. See Fig. 6-13. Occasionally, different symbols, such as open and filled circles, are 
used to identify the two types of outliers. Sometimes box plots are called box-and-whisker plots. 

Figure 6-14 presents the box plot from Minitab for the alloy compressive strength data 
shown in Table 6-2. This box plot indicates that the distribution of compressive strengths is 
fairly symmetric around the central value, because the left and right whiskers and the lengths 
of the left and right boxes around the median are about the same. There are also two mild out- 
liers at lower strength and one at higher strength. The upper whisker extends to observation 
237 because it is the greatest observation below the limit for upper outliers. This limit is 
q3 + 1.5IQR = 181 + 1.5(181 — 143.5) = 237.25. The lower whisker extends to observation 97 
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because it is the smallest observation above the limit for lower outliers. This limit is 
q, — 1.5IQR = 143.5 — 1.5(181 — 143.5) = 87.25. 

Box plots are very useful in graphical comparisons among data sets, because they have 
high visual impact and are easy to understand. For example, Fig. 6-15 shows the comparative 
box plots for a manufacturing quality index on semiconductor devices at three manufacturing 
plants. Inspection of this display reveals that there is too much variability at plant 2 and that 
plants 2 and 3 need to raise their quality index performance. 


EXERCISES FOR SECTION 6-4 


6-53. The “cold start ignition time” of an automobile engine 

is being investigated by a gasoline manufacturer. The follow- 

ing times (in seconds) were obtained for a test vehicle: 1.75, 

1.92, 2.62, 2.35, 3.09, 3.15, 2.53, 1.91. 

(a) Calculate the sample mean, sample variance, and sample 
standard deviation. 

(b) Construct a box plot of the data. 


6-54. An article in Transactions of the Institution of 

Chemical Engineers (Vol. 34, 1956, pp. 280-293) reported 

data from an experiment investigating the effect of several 

process variables on the vapor phase oxidation of naphtha- 

lene. A sample of the percentage mole conversion of naphtha- 

lene to maleic anhydride follows: 4.2, 4.7, 4.7, 5.0, 3.8, 3.6, 

3.0, 5.1, 3.1, 3.8, 4.8, 4.0, 5.2, 4.3, 2.8, 2.0, 2.8, 3.3, 4.8, 5.0. 

(a) Calculate the sample mean, sample variance, and sample 
standard deviation. 

(b) Construct a box plot of the data. 

6-55. The nine measurements that follow are furnace tem- 

peratures recorded on successive batches in a semiconductor 

manufacturing process (units are °F): 953, 950, 948, 955, 951, 

949, 957, 954, 955. 

(a) Calculate the sample mean, sample variance, and standard 
deviation. 

(b) Find the median. How much could the largest temperature 
measurement increase without changing the median value? 

(c) Construct a box plot of the data. 


6-56. Exercise 6-18 presents drag coefficients for the 
NASA 0012 airfoil. You were asked to calculate the sample 
mean, sample variance, and sample standard deviation of 
those coefficients. 

(a) Find the median and the upper and lower quartiles of the 
drag coefficients. 

(b) Construct a box plot of the data. 

(c) Set aside the largest observation (100) and rework parts 
(a) and (b). Comment on your findings. 

6-57. Exercise 6-19 presented the joint temperatures of 

the O-rings (°F) for each test firing or actual launch of the 

space shuttle rocket motor. In that exercise you were asked 
to find the sample mean and sample standard deviation of 
temperature. 

(a) Find the median and the upper and lower quartiles of 
temperature. 

(b) Set aside the smallest observation (31°F) and recompute 
the quantities in part (a). Comment on your findings. How 
“different” are the other temperatures from this smallest 
value? 

(c) Construct a box plot of the data and comment on the pos- 
sible presence of outliers. 

6-58. Reconsider the motor fuel octane rating data in 

Exercise 6-20. Construct a box plot of the data and write an 

interpretation of the plot. How does the box plot compare in in- 

terpretive value to the original stem-and-leaf diagram? 
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6-59. Reconsider the energy consumption data in Exercise 
6-29. Construct a box plot of the data and write an interpreta- 
tion of the plot. How does the box plot compare in interpretive 
value to the original stem-and-leaf diagram? 


6-60. Reconsider the water quality data in Exercise 6-32. 
Construct a box plot of the concentrations and write an interpre- 
tation of the plot. How does the box plot compare in interpretive 
value to the original stem-and-leaf diagram? 


for leukemia. One group received a high dose of the drug while 
the control group received no treatment. Expression data (mea- 
sures of gene activity) from one gene are shown in the follow- 
ing table. Construct a box plot for each group of patients. Write 
an interpretation to compare the information in these plots. 


Gene Expression 


High Dose Control Control Control 

6-61. Reconsider the weld strength data in Exercise 6-31. 16.1 07.1 820.1 166.5 
Construct a box plot of the data and write an interpretation of 134.9 491.8 82.5 2258.4 
the plot. How does the box plot compare in interpretive value 

a f Ss7) 1332.9 713.9 497.5 
to the original stem-and-leaf diagram? 
6-62. Reconsider the semiconductor speed data in Exercise \44 — _—. eee 
6-34. Construct a box plot of the data and write an interpretation a eee ae ae 
of the plot. How does the box plot compare in interpretive value 99 335.4 31.9 351.4 
to the original stem-and-leaf diagram? 24.3 528.9 86.3 678.9 
6-63. Use the data on heights of female and male engineer- 16.3 24.1 646.6 3010.2 
ing students from Exercises 6-30 and 6-37 to construct 15.2 545.2 169.9 67.1 
comparative box plots. Write an interpretation of the informa- 47.7 92.9 20.2 318.2 
tion that you see in these plots. 12.9 337.1 280.2 2476.4 
6-64. In Exercise 6-53, data were presented on the cold start 72.7 102.3 194.2 181.4 
ignition time of a particular gasoline used in a test vehicle. A 126.7 255.1 408.4 2081.5 
second formulation of the gasoline was tested in the same ve- 46.4 100.5 155.5 424.3 
hicle, with the following times (in seconds): 1.83, 1.99, 3.13, 60.3 159.9 864.6 188.1 
3.29, 2.65, 2.87, 3.40, 2.46, 1.89, and 3.35. Use these new data 23.5 168 355.4 563 
along with the cold start times reported in Exercise 6-53 to 43.6 95.2 634 149.1 
construct comparative box plots. Write an interpretation of the 79.4 132.5 029.9 2122.9 
information that you see in these plots. 38 442.6 362.1 1295.9 
6-65. An article in Nature Genetics [‘Treatment-specific 58.2 15.8 
Changes in Gene Expression Discriminate in Vivo Drug 26.5 175.6 
Response In Human Leukemia Cells” (2003, Vol. 34(1), 95.1 1311 


pp. 85—90)] studied gene expression as a function of treatments 


6-5 TIME SEQUENCE PLOTS 


The graphical displays that we have considered thus far such as histograms, stem-and-leaf 
plots, and box plots are very useful visual methods for showing the variability in data. 
However, we noted in Chapter | that time is an important factor that contributes to variability 
in data, and those graphical methods do not take this into account. A time series or time se- 
quence is a data set in which the observations are recorded in the order in which they occur. 
A time series plot is a graph in which the vertical axis denotes the observed value of the vari- 
able (say, x) and the horizontal axis denotes the time (which could be minutes, days, years, etc.). 
When measurements are plotted as a time series, we often see trends, cycles, or other broad 
features of the data that could not be seen otherwise. 

For example, consider Fig. 6-16(a), which presents a time series plot of the annual sales 
of a company for the last 10 years. The general impression from this display is that sales show 
an upward trend. There is some variability about this trend, with some years’ sales increasing 
over those of the last year and some years’ sales decreasing. Figure 6-16(b) shows the last 
three years of sales reported by quarter. This plot clearly shows that the annual sales in this 
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Sales, x 
Sales, x 


1982 1983 1984 1985 1986 1987 1988 1989 1990 1991 Years 12 3 4 1 2 3 4 1 2 3 = 4 Quarters 
1989 1990 1991 


(a) (b) 
Figure 6-16 Company sales by year (a) and by quarter (b). 


business exhibit a cyclic variability by quarter, with the first- and second-quarter sales being 
generally greater than sales during the third and fourth quarters. 

Sometimes it can be very helpful to combine a time series plot with some of the other 
graphical displays that we have considered previously. J. Stuart Hunter (The American 
Statistician, Vol. 42, 1988, p. 54) has suggested combining the stem-and-leaf plot with a time 
series plot to form a digidot plot. 

Figure 6-17 shows a digidot plot for the observations on compressive strength from 
Table 6-2, assuming that these observations are recorded in the order in which they 
occurred. This plot effectively displays the overall variability in the compressive strength 
data and simultaneously shows the variability in these measurements over time. The general 
impression is that compressive strength varies around the mean value of 162.66, and there 
is no strong obvious pattern in this variability over time. 
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Figure 6-17 A digidot plot of the compressive strength data in Table 6-2. 
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The digidot plot in Fig. 6-18 tells a different story. This plot summarizes 30 observations on 
concentration of the output product from a chemical process, where the observations are recorded 
at one-hour time intervals. This plot indicates that during the first 20 hours of operation this 
process produced concentrations generally above 85 grams per liter, but that following sample 20, 
something may have occurred in the process that resulted in lower concentrations. If this variabil- 
ity in output product concentration can be reduced, operation of this process can be improved. 
Notice that this apparent change in the process output is not seen in the stem-and-leaf portion of 
the digidot plot. The stem-and-leaf plot compresses the time dimension out of the data. This illus- 
trates why it is always important to construct a time series plot for time-oriented data. 


EXERCISES FOR SECTION 6-5 


6-66. The following data are the viscosity measurements 
for a chemical product observed hourly (read down, then left 
to right). Construct and interpret either a digidot plot or a sep- 
arate stem-and-leaf and time series plot of these data. 
Specifications on product viscosity are at 48 + 2. What con- 
clusions can you make about process performance? 


47.9 48.6 48.0 48.1 43.0 43.2 
47.9 48.8 47.5 48.0 42.9 43.6 
48.6 48.1 48.6 48.3 43.6 43.2 
48.0 48.3 48.0 43.2 43.3 43.5 
48.4 47.2 47.9 43.0 43.0 43.0 
48.1 48.9 48.3 43.5 42.8 

48.0 48.6 48.5 43.1 43.1 

6-67. The pull-off force for a connector is measured in a 


laboratory test. Data for 40 test specimens follow (read 
down, then left to right). Construct and interpret either a 
digidot plot or a separate stem-and-leaf and time series plot 
of the data. 


241 203 201 251 236 190 
258 195, 195 238 245 175 
237 249 255 210 209 178 
210 220 245 198 212 175 
194 194 235 199 185 190 
225 245 220 183 187 

248 209 249 213 218 


6-68. In their book Time Series Analysis, Forecasting, and 
Control (Prentice Hall, 1994), G. E. P. Box, G. M. Jenkins, 
and G. C. Reinsel present chemical process concentration 
readings made every two hours. Some of these data follow 
(read down, then left to right). 


17.0 16.7 17.1 17.5 17.6 
16.6 17.4 17.4 18.1 17.5 
16.3 17.2 17.4 17.5 16.5 
16.1 17.4 17.5 17.4 17.8 
17.1 17.4 17.4 17.4 17.3 
16.9 17.0 17.6 17.1 17.3 
16.8 17.3 17.4 17.6 17.1 
17.4 17.2 17.3 17.7 17.4 
17.1 17.4 17.0 17.4 16.9 
17.0 16.8 17.8 17.8 17.3 


Construct and interpret either a digidot plot or a stem-and- 
leaf plot of these data. 


6-69. The 100 annual Wolfer sunspot numbers from 1770 
to 1869 follow. (For an interesting analysis and interpreta- 
tion of these numbers, see the book by Box, Jenkins, and 
Reinsel referenced in Exercise 6-68. Their analysis requires 
some advanced knowledge of statistics and statistical model 
building.) Read down, then left to right. The 1869 result is 
74. Construct and interpret either a digidot plot or a stem- 
and-leaf and time series plot of these data. 


101 31 154 38 83 90 
82 7 125 23 132 67 
66 20 85 10 131 60 
35 92 68 24 118 47 
41 10 16 8 62 94 
21 8 7 13 98 96 
16 2 4 57 124 77 

6 0 122 96 59 
4 1 8 138 66 44 
7 5 17 103 64 47 
14 12 36 86 54 30 
34 14 50 63 39 16 

45 35 62 37 21 7 
43 46 67 24 7 37 
48 41 71 11 4 74 
42 30 48 15 23 
28 24 28 40 55 

6-70. In their book Introduction to Time Series Analysis 


and Forecasting, (Wiley, 2008), Montgomery, Jennings, and 
Kolahci presented the data in Table 6-5, which are the monthly 
total passenger airline miles flown in the United Kingdom 
from 1964 to 1970 (in millions of miles). Comment on any 
features of the data that are apparent. Construct and interpret 
either a digidot plot or a separate stem-and-leaf and time series 
plot of these data. 

6-71. The following table shows the number of earthquakes 
per year of magnitude 7.0 and greater since 1900 (source: 
Earthquake Data Base System of the U.S. Geological Survey, 
National Earthquake Information Center, Golden, Colorado). 


1900 13 1904 16 1908 18 1912 22 
1901 14 1905 26 1909 32 1913 23 
1902 8 1906 32 1910 36 1914 22 
1903 10 1907 27 1911 24 1915 18 


Table 6-5 United Kingdom Passenger Airline Miles Flown 


6-5 TIME SEQUENCE PLOTS 


1916 25 1942 27 1968 30 1994 13 
1917-21 1943 41 1969 27 1995 20 
1918 921 1944 31 1970 29 1996 15 
1919 14 1945 27 1971 23 1997 16 
1920 8 1946 35 1972 20 1998 12 
1921 11 1947. 26 1973 16 1999 18 
1922 14 1948 28 1974 21 2000 = = 15 
1923, 23 1949 36 1975 21 2001 16 
1924 18 1950 39 1976 25 2002 13 
1925, 17 1951 21 1977, 16 =6.2003— ss 15 
1926 19 1952 17 1978 18 2004 16 
1927, 20 1953 22 1979 15 2005 11 
1928 22 1954 17 1980 18 2006 11 
1929, 19 1955 19 1981 14 2007 18 
1930 13 1956 15 1982 10 2008 12 
1931 26 1957 34 1983 15 2009 15 
1932, 13 1958 10 1984 8 
1933 14 1959 15 1985 15 
1934 22 1960 22 1986 6 
1935 24 1961 18 1987 11 
1936 =. 21 1962 15 1988 8 
1937 22 1963 20 1989 7 
1938 26 1964 15 1990 18 
193921 1965 22 1991 16 
1940 23 1966 19 1992 13 
1941 24 1967 16 1993 12 


Construct and interpret either a digidot plot or a separate stem- 
and-leaf and time series plot of these data. 

6-72. The following table shows U.S. petroleum imports, 
imports as a percentage of total, and Persian Gulf imports as a 
percentage of all imports by year since 1973 (source: U.S. 
Department of Energy Web site, http://www.eia.doe.gov/). 
Construct and interpret either a digidot plot or a separate stem- 
and-leaf and time series plot for each column of data. 


Month 1964 1965 1966 1967 1968 1969 1970 
Jan. 7.269 8.350 8.186 8.334 8.639 9.491 10.840 
Feb. 6.775 7.829 7444 7.899 8.772 8.919 10.436 
Mar. 7.819 8.829 8.484 9.994 10.894 11.607 13.589 
Apr 8.371 9.948 9.864 10.078 10.455 8.852 13.402 
May 9.069 10.638 10.252 10.801 11.179 12.537 13.103 
June 10.248 11.253 12.282 12.953 10.588 14.759 14.933 
July 11.030 11.424 11.637 12.222 10.794 13.667 14.147 
Aug. 10.882 11.391 11.577 12.246 12.770 13.731 14.057 
Sept. 10.333 10.665 12.417 13.281 13.812 15.110 16.234 
Oct 9.109 9.396 9.637 10.366 10.857 12.185 12.389 
Nov. 7.685 WAYS 8.094 8.730 9.290 10.645 11.594 
Dec 7.682 7.933 9.280 9.614 10.925 12.161 12.772 


se 
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Total Petroleum Petroleum Imports 

Petroleum Imports Imports as Percent from Persian Gulf 

(thousand barrels of Petroleum as Percent of Total 

Year per day) Products Supplied Petroleum Imports 
1973 6256 36.1 13.5 
1974 6112 36.7 17.0 
1975 6055 37.1 12 
1976 7313 41.8 25.1 
1977 8807 47.7 27.8 
1978 8363 44.3 26.5 
1979 8456 45.6 24.4 
1980 6909 40.5 21.9 
1981 5996 33} 20.3 
1982 5113 33.4 13.6 
1983 5051 33.1 8.7 
1984 5437 34.5 9.3 
1985 5067 SII 6.1 
1986 6224 38.2 14.6 
1987 6678 40.0 16.1 
1988 7402 42.8 20.8 
1989 8061 46.5 23.0 
1990 8018 47.1 24.5 
1991 7627 45.6 24.1 
1992 7888 46.3 22.5 
1993 8620 50.0 20.6 
1994 8996 50.7 19.2 
1995 8835 49.8 17.8 
1996 9478 51.7 16.9 
1997 10,162 54.5 Ie 
1998 10,708 56.6 19.9 
1999 10,852 55.5 Doe 
2000 11,459 58.1 21.7 
2001 11,871 60.4 P32 
2002 11,530 58.3 19.6 
2003 12,264 61.2 20.3 
2004 13,145 63.4 18.9 
2005 13,714 65.9 17.0 
2006 13,707 66.3 16.1 
2007 13,468 65.1 16.1 
2008 12,915 66.2 18.4 


6-6 PROBABILITY PLOTS 


How do we know if a particular probability distribution is a reasonable model for data? 
Sometimes this is an important question, because many of the statistical techniques presented 
in subsequent chapters are based on an assumption that the population distribution is of a spe- 
cific type. Thus, we can think of determining whether data come from a specific probability 
distribution as verifying assumptions. In other cases, the form of the distribution can give 
insight into the underlying physical mechanism generating the data. For example, in reliability 
engineering, verifying that time-to-failure data come from an exponential distribution identifies 
the failure mechanism in the sense that the failure rate is constant with respect to time. 
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Some of the visual displays we have used earlier, such as the histogram, can provide 
insight about the form of the underlying distribution. However, histograms are usually not 
really reliable indicators of the distribution form unless the sample size is very large. A 
probability plot is a graphical method for determining whether sample data conform to a 
hypothesized distribution based on a subjective visual examination of the data. The general 
procedure is very simple and can be performed quickly. It is also more reliable than the his- 
togram for small- to moderate-size samples. Probability plotting typically uses special axes 
that have been scaled for the hypothesized distribution. Software is widely available for the 
normal, lognormal, Weibull, and various chi-square and gamma distributions. We focus pri- 
marily on normal probability plots because many statistical techniques are appropriate only 
when the population is (at least approximately) normal. 

To construct a probability plot, the observations in the sample are first ranked from small- 
est to largest. That is, the sample 2), %2,...,x, is arranged as X(1), X(2)---,X(n), Where 
x(1) is the smallest observation, x,g) is the second-smallest observation, and so forth, with x;,) 
the largest. The ordered observations x,;, are then plotted against their observed cumulative 
frequency (j — 0.5)/n on the appropriate probability paper. If the hypothesized distribution 
adequately describes the data, the plotted points will fall approximately along a straight line; 
if the plotted points deviate significantly from a straight line, the hypothesized model is not ap- 
propriate. Usually, the determination of whether or not the data plot as a straight line is sub- 
jective. The procedure is illustrated in the following example. 


EXAMPLE 6-7 Battery Life 


Ten observations on the effective service life in minutes of 
batteries used in a portable personal computer are as follows: 
176, 191, 214, 220, 205, 192, 201, 190, 183, 185. We 
hypothesize that battery life is adequately modeled by a nor- 
mal distribution. To use probability plotting to investigate this 
hypothesis, first arrange the observations in ascending order 
and calculate their cumulative frequencies (j — 0.5)/10 as 
shown in Table 6-6. 

The pairs of values x;;) and (j — 0.5)/10 are now plotted 
on normal probability axes. This plot is shown in Fig. 6-19. 


Most normal probability plots have 100(j — 0.5)/n on the left 
vertical scale and (sometimes) 100[1 — (j — 0.5)/n] on the 
right vertical scale, with the variable value plotted on the hori- 
zontal scale. A straight line, chosen subjectively, has been 
drawn through the plotted points. In drawing the straight line, 
you should be influenced more by the points near the middle of 
the plot than by the extreme points. A good rule of thumb is to 
draw the line approximately between the 25th and 75th per- 
centile points. This is how the line in Fig. 6-19 was determined. 
In assessing the “closeness” of the points to the straight line, 


99.9 0.1 
Table 6-6 Calculation for Constructing a Normal 
Probability Plot 99 1 
j Xj) (j — 0.5)/10 a 95 5 os 
1 176 0.05 —1.64 & 80 20 S 
2 183 0.15 —1.04 3 ‘ i 
I 50 o 50 > 
3 185 0.25 —0.67 = vd 
4 190 0.35 —0.39 S 20 go 5 
o 
5 191 0.45 =e 5. ae. 
6 192 0.55 0.13 
7 201 0.65 0.39 : 99 
8 205 0.75 0.67 0.1 99.9 
9 214 0.85 1.04 170 180 190 200 210 220 
10 220 0.95 1.64 ue 


Figure 6-19 Normal probability plot for battery life. 
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3.30 
1.65 
2] 
6) 
-1.65 
Figure 6-20 Normal 
probability plot 
obtained from 3.30 
statidlardized 1ionial 170 180 190 200 210 220 
scores. Xj) 
imagine a “fat pencil” lying along the line. If all the points are pass the “fat pencil’ test, we conclude that the normal distri- 
covered by this imaginary pencil, a normal distribution ade- bution is an appropriate model. 


quately describes the data. Since the points in Fig. 6-19 would 


A normal probability plot can also be constructed on ordinary axes by plotting the stan- 
dardized normal scores z; against x,, where the standardized normal scores satisfy 


pr 05 _ 


n 


For example, if (7 — 0.5)/n = 0.05, ®(z;) = 0.05 implies that z; = —1.64. To illustrate, 
consider the data from Example 6-4. In the last column of Table 6-6 we show the standardized 
normal scores. Figure 6-20 presents the plot of z; versus x(; This normal probability plot is 
equivalent to the one in Fig. 6-19. 

We have constructed our probability plots with the probability scale (or the z-scale) on the 
vertical axis. Some computer packages “flip” the axis and put the probability scale on the hor- 
izontal axis. 

The normal probability plot can be useful in identifying distributions that are symmetric 
but that have tails that are “heavier” or “lighter” than the normal. They can also be useful in 
identifying skewed distributions. When a sample is selected from a light-tailed distribution 
(such as the uniform distribution), the smallest and largest observations will not be as extreme 
as would be expected in a sample from a normal distribution. Thus, if we consider the straight 
line drawn through the observations at the center of the normal probability plot, observations 
on the left side will tend to fall below the line, whereas observations on the right side will tend 
to fall above the line. This will produce an S-shaped normal probability plot such as shown in 
Fig. 6-21(a). A heavy-tailed distribution will result in data that also produce an S-shaped nor- 
mal probability plot, but now the observations on the left will be above the straight line and the 
observations on the right will lie below the line. See Fig. 6-19(b). A positively skewed distri- 
bution will tend to produce a pattern such as shown in Fig. 6-19(c), where points on both ends 
of the plot tend to fall below the line, giving a curved shape to the plot. This occurs because 
both the smallest and the largest observations from this type of distribution are larger than 
expected in a sample from a normal distribution. 

Even when the underlying population is exactly normal, the sample data will not plot 
exactly on a straight line. Some judgment and experience are required to evaluate the plot. 
Generally, if the sample size is n < 30, there can be a lot of deviation from linearity in normal 
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Figure 6-21 
distribution. (c) A distribution with positive (or right) skew. 


190 200 210 220 
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Normal probability plots indicating a nonnormal distribution. (a) Light-tailed distribution. (b) Heavy-tailed 


plots, so in these cases only a very severe departure from linearity should be interpreted as a 
strong indication of nonnormality. As n increases, the linear pattern will tend to become 
stronger, and the normal probability plot will be easier to interpret and more reliable as an 
indicator of the form of the distribution. 


EXERCISES FOR SECTION 6-6 


6-73. Construct a normal probability plot of the piston ring 
diameter data in Exercise 6-7. Does it seem reasonable to 
assume that piston ring diameter is normally distributed? 


6-74. Construct a normal probability plot of the insulating 
fluid breakdown time data in Exercise 6-8. Does it seem 
reasonable to assume that breakdown time is normally 
distributed? 


6-75. Construct a normal probability plot of the visual 
accommodation data in Exercise 6-11. Does it seem reasonable 
to assume that visual accommodation is normally distributed? 


6-76. Construct a normal probability plot of the solar inten- 
sity data in Exercise 6-12. Does it seem reasonable to assume 
that solar intensity is normally distributed? 


6-77. Construct a normal probability plot of the O-ring joint 
temperature data in Exercise 6-19. Does it seem reasonable to 
assume that O-ring joint temperature is normally distributed? 
Discuss any interesting features that you see on the plot. 


6-78. Construct a normal probability plot of the octane rat- 
ing data in Exercise 6-22. Does it seem reasonable to assume 
that octane rating is normally distributed? 


6-79. Construct a normal probability plot of the cycles to 
failure data in Exercise 6-23. Does it seem reasonable to as- 
sume that cycles to failure is normally distributed? 


6-80. Construct a normal probability plot of the sus- 
pended solids concentration data in Exercise 6-32. Does it 
seem reasonable to assume that the concentration of 
suspended solids in water from this particular lake is 
normally distributed? 


6-81. Construct two normal probability plots for the 
height data in Exercises 6-30 and 6-37. Plot the data for 
female and male students on the same axes. Does height 
seem to be normally distributed for either group of students? 
If both populations have the same variance, the two normal 
probability plots should have identical slopes. What conclu- 
sions would you draw about the heights of the two groups of 
students from visual examination of the normal probability 
plots? 


6-82. It is possible to obtain a “quick and dirty” estimate of 
the mean of a normal distribution from the fiftieth percentile 
value on a normal probability plot. Provide an argument why 
this is so. It is also possible to obtain an estimate of the stan- 
dard deviation of a normal distribution by subtracting the 
eighty-fourth percentile value from the fiftieth percentile 
value. Provide an argument why this is so. 


Supplemental Exercises 


6-83. The National Oceanic and Atmospheric Admin- 
istration provided the monthly absolute estimates of global 
(land and ocean combined) temperature index (degrees C) 
from 2000. Read January to December from left to right 
(source: http://www.ncdc.noaa. gov/oa/climate/research/anom- 
alies/anomalies.html). Construct and interpret either a digidot 
plot or a separate stem-and-leaf and time series plot of these 
data. 


s 


es 


= 


218 


CHAPTER 6 DESCRIPTIVE STATISTICS 


Global Monthly Temperature 


Year 1 2 3 4 5 6 7 8 9 10 11 12 
2000 12.3 12.6 13.2 14.3 15.3 15.9 16.2 16.0 15.4 14.3 13.1 12.5 
2001 12.4 12.5 13.3 14.2 15.4 16.0 16.3 16.2 15.5 14.5 13.5 12.7 
2002 Ie7 12.9 13.4 14.2 15.3 16.1 16.4 16.1 15.5 14.5 13.5 12.6 
2003 12.6 12.6 13.2 14.2 15.4 16.0 16.3 16.2 15.6 14.7 13.4 12.9 
2004 12.6 12.8 113).3) 14.3 15.2 16.0 16.3 16.1 15.5 14.6 13.6 12.7 
2005 12.6 12.5 13.4 14.4 15.4 16.2 16.4 16.2 15.7 14.6 13.6 12.8 
2006 12.4 12.6 13.2 14.2 15.3 16.1 16.4 16.2 15.6 14.6 13.5 12.9 
2007 12.8 12.7 13.3 14.4 15.3 16.0 16.3 16.1 15.5 14.5 13.4 12.6 
2008 12.2 12.4 13.4 14.1 15.2 16.0 16.3 16.1 15.5 14.6 13.5 12.7 
2009 12.5 12.6 13.2 14.3 15:3 16.1 16.4 16.2 15.6 


6-84. The concentration of a solution is measured six times by 

one operator using the same instrument. She obtains the follow- 

ing data: 63.2, 67.1, 65.8, 64.0, 65.1, and 65.3 (grams per liter). 

(a) Calculate the sample mean. Suppose that the desirable value 
for this solution has been specified to be 65.0 grams per 
liter. Do you think that the sample mean value computed 
here is close enough to the target value to accept the solution 
as conforming to target? Explain your reasoning. 

(b) Calculate the sample variance and sample standard 
deviation. 

(c) Suppose that in measuring the concentration, the operator 
must set up an apparatus and use a reagent material. What 
do you think the major sources of variability are in this ex- 
periment? Why is it desirable to have a small variance of 
these measurements? 


6-85. The table below shows unemployment data for the 
US. that are seasonally adjusted. Construct a time series plot 
of these data and comment on any features (source: U.S. 
Bureau of Labor Web site, http://data.bls.gov). 


6-86. A sample of six resistors yielded the following resis- 


(a) Compute the sample variance and sample standard 
deviation. 

(b) Subtract 35 from each of the original resistance measure- 
ments and compute s* and s. Compare your results with 
those obtained in part (a) and explain your findings. 

(c) If the resistances were 450, 380, 470, 410, 350, and 430 
ohms, could you use the results of previous parts of this 
problem to find s? and s? 


6-87. 
Sample 1: 10, 9, 8, 7, 8, 6, 10, 6 
Sample 2: 10, 6, 10, 6, 8, 10, 8, 6 


(a) Calculate the sample range for both samples. Would you con- 
clude that both samples exhibit the same variability? Explain. 

(b) Calculate the sample standard deviations for both samples. 
Do these quantities indicate that both samples have the 
same variability? Explain. 

(c) Write a short statement contrasting the sample range versus 
the sample standard deviation as a measure of variability. 


Consider the following two samples: 


6-88. An article in Quality Engineering (Vol. 4, 1992, pp. 


tances (ohms): x; = 45, x. = 38, x3 = 47, x4 = 41, x5 = 35, 487-495) presents viscosity data from a batch chemical 
and x5 = 43. process. A sample of these data follows on p. 219: 
Unemployment Percentage 

Year Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec 
1999 43 44 4.2 4.3 4.2 43 4.3 4.2 4.2 4.1 4.1 4.0 
2000 4.0 4.1 4.0 3.8 4.0 4.0 4.0 4.1 3.9 3.9 3.9 3.9 
2001 4.2 4.2 4.3 44 4.3 45 4.6 4.9 5.0 5.3 So) Sell 
2002 5.7 5.7 5.7 5.9 5.8 5.8 5.8 5.7 5.7 5.7 5.9 6.0 
2003 5.8 ae) 3s) 6.0 6.1 6.3 6.2 6.1 6.1 6.0 5.8 Sell 
2004 5.7 5.6 5.8 5.6 5.6 5.6 5.5 5.4 5.4 5.5 5.4 5.4 
2005 37) 5.4 Sy) Sy) 5.1 5.1 5.0 4.9 5.0 5.0 5.0 4.8 
2006 4.7 4.8 4.7 4.7 47 4.6 4.7 4.7 4.5 4.4 45 4.4 
2007 4.6 45 44 45 45 4.6 47 4.7 47 4.8 4.7 4.9 
2008 4.9 4.8 5.1 5.0 5.5 5.6 5.8 6.2 6.2 6.6 6.8 iP) 
2009 7.6 8.1 8.5 8.9 9.4 oS) 9.4 O.7 9.8 


13.3 143 149 152 15.8 142 160 14.0 
145 161 13.7 152 13.7 169 149 144 
15.3 13.1 15.22 15.9 15.1 149 13.6 13.7 
15.3 155 145 165 134 15.2 15.3 13.8 
143 126 153 148 141 #144 143 15.6 
148 146 156 15.1 148 15.2 15.6 14.5 
15.2 143 158 170 143 146 16.1 128 
1445 154 133 149 143 164 13.9 16.1 
146 152 141 148 164 142 15.2 16.6 
14.1 168 154 140 169 15.7. 144 15.6 


(a) Reading down and left to right, draw a time series plot of 
all the data and comment on any features of the data that 
are revealed by this plot. 


(b) 


Consider the notion that the first 40 observations were 


generated from a specific process, whereas the last 40 
observations were generated from a different process. 
Does the plot indicate that the two processes generate 
similar results? 


(c) 


Compute the sample mean and sample variance of the first 


40 observations; then compute these values for the second 
40 observations. Do these quantities indicate that both 
processes yield the same mean level? The same variabil- 


ity? Explain. 
6-89. The total net electricity consumption of the U.S. by 
year from 1980 to 2007 (in billion kilowatt-hours) follows. 
Net consumption excludes the energy consumed by the gener- 
ating units. Read left to right. 
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Construct a time series plot of these data. Construct and inter- 

pret a stem-and-leaf display of these data. 

6-90. Reconsider the data from Exercise 6-88. Prepare 

comparative box plots for two groups of observations: the 

first 40 and the last 40. Comment on the information in the 

box plots. 

6-91. The data shown in Table 6-7 are monthly champagne 

sales in France (1962-1969) in thousands of bottles. 

(a) Construct a time series plot of the data and comment on 
any features of the data that are revealed by this plot. 

(b) Speculate on how you would use a graphical procedure to 
forecast monthly champagne sales for the year 1970. 
6-92. The following data are the temperatures of effluent at 
discharge from a sewage treatment facility on consecutive 

days: 


43 47 51 48 52 50 46 49 
45 52 46 51 44 49 46 51 
49 45 44 50 48 50 49 50 


(a) Calculate the sample mean, sample median, sample vari- 
ance, and sample standard deviation. 

(b) Construct a box plot of the data and comment on the in- 
formation in this display. 

6-93. A manufacturer of coil springs is interested in imple- 

menting a quality control system to monitor his production 

process. As part of this quality system, it is decided to record 

the number of nonconforming coil springs in each production 

batch of size 50. During 40 days of production, 40 batches of 

data were collected as follows: 


1980 2094.4 1981 2147.1 1982 2086.4 1983 2151.0 

1984 2285.8 1985 2324.0 1986 2368.8 1987 2457.3 Read data across. 

1988 2578.1 1989 2755.6 1990 2837.1 1991 2886.1 2 2 6 8 FS & LE +  & F 
1992 2897.2 1993 3000.7 1994 3080.9 1995 3164.0 8 5 9 7 8 1 3 6 7 7 
1996 3253.8 1997 3301.8 1998 3425.1 1999 3483.7 11 4 4 8 7 5 4 5 8 

2000 3592.4 2001 3557.1 2002 3631.7 2003 3662.0 9 19 18 12 11 7 15 17 1313 

2004 3715.9 2005 3811.0 2006 3816.8 2007 3891.7 (a) Construct a stem-and-leaf plot of the data. 

(source: U.S. Department of Energy Web site, http://www.eia.doe. (b) Find the sample average and standard deviation. 

gov/emeu/international/contents.html#InternationalElectricity) 

Table 6-7 Champagne Sales in France 

Month 1962 1963 1964 1965 1966 1967 1968 1969 
Jan. 2.851 2.541 3.113 Sli) 3.633 4.016 2.639 3.934 
Feb. 2.672 2.475 3.006 3.088 4.292 3.957 2.899 3.162 
Mar. 2S) 3.031 4.047 3.718 4.154 4.510 3.370 4.286 
Apr. 2.721 3.266 3.523 4.514 4.121 4.276 3.740 4.676 
May 2.946 3.776 3037 4.520 4.647 4.968 2.927 5.010 
June 3.036 3.230 3.986 4.539 4.753 4.677 3.986 4.874 
July 2.282 3.028 3.260 3.663 3.965 3.523) 4.217 4.633 
Aug. 2.212 1.759 1.573 1.643 1.723 1.821 1.738 1.659 
Sept. DQ) 3.595 3.528 4.739 5.048 5.222) 5.221 5.591 
Oct. 4.301 4.474 5.211 5.428 6.922 6.873 6.424 6.981 
Noy. 5.764 6.838 7.614 8.314 9.858 10.803 9.842 9.851 
Dec 7.132 8.357 9.254 10.651 11.331 13.916 13.076 12.670 
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(c) Construct a time series plot of the data. Is there evidence that 
there was an increase or decrease in the average number of 
nonconforming springs made during the 40 days? Explain. 


6-94. A communication channel is being monitored by 
recording the number of errors in a string of 1000 bits. Data 
for 20 of these strings follow: 


Read data across. 
3 1 0 1 3 2 4 1 3 1 
1 1 2 3 3 2 0 2 0 1 


(a) Construct a stem-and-leaf plot of the data. 

(b) Find the sample average and standard deviation. 

(c) Construct a time series plot of the data. Is there evidence 
that there was an increase or decrease in the number of 
errors in a string? Explain. 


6-95. Reconsider the golf course yardage data in Exercise 
6-9. Construct a box plot of the yardages and write an inter- 
pretation of the plot. 


6-96. Reconsider the data in Exercise 6-88. Construct nor- 
mal probability plots for two groups of the data: the first 40 
and the last 40 observations. Construct both plots on the same 
axes. What tentative conclusions can you draw? 


6-97. Construct anormal probability plot of the effluent dis- 
charge temperature data from Exercise 6-92. Based on the 
plot, what tentative conclusions can you draw? 


6-98. Construct normal probability plots of the cold start 
ignition time data presented in Exercises 6-53 and 6-64. 
Construct a separate plot for each gasoline formulation, but 
arrange the plots on the same axes. What tentative conclusions 
can you draw? 


6-99. Reconsider the golf ball overall distance data in 
Exercise 6-33. Construct a box plot of the yardage distance and 
write an interpretation of the plot. How does the box plot com- 
pare in interpretive value to the original stem-and-leaf diagram? 


6-100. Transformations. In some data sets, a transformation 

by some mathematical function applied to the original data, 

such as Vy or log y, can result in data that are simpler to work 
with statistically than the original data. To illustrate the effect 
of a transformation, consider the following data, which repre- 

sent cycles to failure for a yarn product: 675, 3650, 175, 1150, 

290, 2000, 100, 375. 

(a) Construct a normal probability plot and comment on the 
shape of the data distribution. 

(b) Transform the data using logarithms; that is, let y* (new 
value) = log y (old value). Construct a normal probability 
plot of the transformed data and comment on the effect of 
the transformation. 


6-101. In 1879, A. A. Michelson made 100 determinations 
of the velocity of light in air using a modification of a method 
proposed by the French physicist Foucault. He made the 
measurements in five trials of 20 measurements each. The ob- 
servations (in kilometers per second) follow. Each value has 
299,000 subtracted from it. 


Trial 1 
850 900 930 950 980 
1000 930 760 1000 960 
740 1070 850 980 880 
980 650 810 1000 960 
Trial 2 
960 960 880 850 900 
830 810 880 800 760 
940 940 800 880 840 
790 880 830 790 800 
Trial 3 
880 880 720 620 970 
880 850 840 850 840 
880 860 720 860 950 
910 870 840 840 840 
Trial 4 
890 810 800 760 750 
910 890 880 840 850 
810 820 770 740 760 
920 860 720 850 780 
Trial 5 
890 780 760 790 820 
870 810 810 950 810 
840 810 810 810 850 
870 740 940 800 870 


The currently accepted true velocity of light in a vacuum is 
299,792.5 kilometers per second. Stigler (1977, The Annals of 
Statistics) reported that the “true” value for comparison to 
these measurements is 734.5. Construct comparative box plots 
of these measurements. Does it seem that all five trials are 
consistent with respect to the variability of the measurements? 
Are all five trials centered on the same value? How does each 
group of trials compare to the true value? Could there have 
been “startup” effects in the experiment that Michelson 
performed? Could there have been bias in the measuring 
instrument? 


6-102. In 1789, Henry Cavendish estimated the density of 
the earth by using a torsion balance. His 29 measurements 
follow, expressed as a multiple of the density of water. 


5.50 5.30 5.47 5.10 5.29 5.65 
9 5.61 313 5.63 5.27 5.44 
5.57 5.36 4.88 5.86 5.34 5.39 
5.34 2.3 5.29 4.07 5.85 5.46 
5.42 5.79 5.62 5.58 5.26 


(a) Calculate the sample mean, sample standard deviation, 
and median of the Cavendish density data. 


(b) Construct a normal probability plot of the data. Comment on 
the plot. Does there seem to be a “low” outlier in the data? 

(c) Would the sample median be a better estimate of the 
density of the earth than the sample mean? Why? 


6-103. In their book Introduction to Time Series Analysis 
and Forecasting (Wiley, 2008), Montgomery, Jennings, and 
Kulahci presented the data on the drowning rate for children 
between one and four years old per 100,000 of population in 
Arizona from 1970 to 2004. The data are: 19.9, 16.1, 19.5, 

19.8, 21.3, 15.0, 15.5, 16.4, 18.2, 15.3, 15.6, 19.5, 14.0, 13.1, 

10.5, 11.5, 12.9, 8.4, 9.2, 11.9, 5.8, 8.5, 7.1, 7.9, 8.0, 9.9, 8.5, 

9.1, 9.7, 6.2, 7.2, 8.7, 5.8, 5.7, and 5.2. 

(a) Perform an appropriate graphical analysis of the data. 

(b) Calculate and interpret the appropriate numerical sum- 
maries. 

(c) Notice that the rate appears to decrease dramatically start- 
ing about 1990. Discuss some potential reasons why this 
could have happened. 

(d) If there has been a real change in the drowning rate begin- 
ning about 1990, what impact does this have on the sum- 
mary statistics that you calculated in part (b)? 


6-104. Patients arriving at a hospital emergency department 
present a variety of symptoms and complaints. The following 
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data were collected during one weekend night shift (11:00 P.M. 
to 7:00 A.M.): 


Chest pain 8 
Difficulty breathing vi 
Numbness in extremities 3 
Broken bones 11 
Abrasions 16 
Cuts 21 
Stab wounds 9 
Gunshot wounds 4 
Blunt force trauma 10 
Fainting, loss of consciousness 5 
Other 9 


(a) Calculate numerical summaries of these data. What prac- 
tical interpretation can you give to these summaries? 

(b) Suppose that you knew that a certain fraction of these 
patients leave without treatment (LWOT). This is an im- 
portant problem, because these patients may be seriously 
ill or injured. Discuss what additional data you would re- 
quire to begin a study into the reasons why patients 
LWOT. 


MIND-EXPANDING EXERCISES 


6-105. Consider the airfoil data in Exercise 6-18. 
Subtract 30 from each value and then multiply the re- 
sulting quantities by 10. Now compute s* for the new 
data. How is this quantity related to s for the original 
data? Explain why. 

6-106. Consider the quantity >j_, (x; — a)?. For 
what value of a is this quantity minimized? 

6-107. Using the results of Exercise 6-106, which 
of the two quantities >j_, C= x) and 
Se = yp)? will be smaller, provided that x # w? 
6-108. Coding the Data. Let y, = a + bx;, i = 
1,2,...,n, where a and b are nonzero constants. Find the 
relationship between x and y, and between s, and s,. 


6-109. A sample of temperature measurements in a 
furnace yielded a sample average (°F) of 835.00 and a 
sample standard deviation of 10.5. Using the results 
from Exercise 6-108, what are the sample average and 
sample standard deviations expressed in °C? 

6-110. Consider the sample x,,%,...,x, with 
sample mean xX and sample standard deviation s. Let 
Z; = (x; — x)/s,i = 1, 2,..., n. What are the values of 
the sample mean and sample standard deviation of 
the z;? 


6-111. Anexperiment to investigate the survival time 
in hours of an electronic component consists of placing 
the parts in a test cell and running them for 100 hours 
under elevated temperature conditions. (This is called an 
“accelerated” life test.) Eight components were tested 
with the following resulting failure times: 


75, 63, 100°, 36, 51, 45, 80, 90 


The observation 100° indicates that the unit still func- 

tioned at 100 hours. Is there any meaningful measure of 

location that can be calculated for these data? What is its 

numerical value? 

6-112. Suppose that we have a sample x), x2, ..., x, 

and we have calculated x, and sx for the sample. Now 

an (n + 1)st observation becomes available. Let x, +, 

and s*,, be the sample mean and sample variance for 

the sample using all n + 1 observations. 

(a) Show how x, , can be computed using x,, and x,,,,. 

AXp+1 i mole 
n+1 

(c) Use the results of parts (a) and (b) to calculate the new 
sample average and standard deviation for the data of 
Exercise 6-30, when the new observation is x33, = 64. 


(b) Show that ns?,, = (n — 1)s? 4 
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MIND-EXPANDING EXERCISES 


6-113. Trimmed Mean. Suppose that the data are 
arranged in increasing order, 7% of the observations 
are removed from each end, and the sample mean of 
the remaining numbers is calculated. The resulting 
quantity is called a trimmed mean. The trimmed mean 
generally lies between the sample mean x and the 
sample median x. Why? The trimmed mean with a 
moderate trimming percentage (5% to 20%) is a rea- 
sonably good estimate of the middle or center. It is not 
as sensitive to outliers as the mean but more sensitive 
than the median. 


IMPORTANT TERMS AND CONCEPTS 


(a) Calculate the 10% trimmed mean for the yield data 
in Exercise 6-25. 

(b) Calculate the 20% trimmed mean for the yield data 
in Exercise 6-25 and compare it with the quantity 
found in part (a). 

(c) Compare the values calculated in parts (a) and (b) 
with the sample mean and median for the yield data. 
Is there much difference in these quantities? Why? 

6-114. Trimmed Mean. Suppose that the sample size n 

is such that the quantity 77/100 is not an integer. Develop 

a procedure for obtaining a trimmed mean in this case. 


Box plot Normal probability plot 
Frequency distribution Pareto chart 
and histogram Population mean 
Median, quartiles, and Population standard 
percentiles deviation 


Multivariable data Population variance 


Probability plot Sample variance 
Relative frequency Stem-and-leaf diagram 
distribution Time series plots 


Sample mean 
Sample standard 
deviation 
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Sampling Distributions and 
Point Estimation of Parameters 


INTRODUCTION 


Statistical methods are used to make decisions and draw conclusions about populations. 
This aspect of statistics is generally called statistical inference. These techniques utilize 
the information in a sample in drawing conclusions. This chapter begins our study of the 
statistical methods used in decision making. 

Statistical inference may be divided into two major areas: parameter estimation and 
hypothesis testing. As an example of a parameter estimation problem, suppose that an 
engineer is analyzing the tensile strength of a component used in an automobile chassis. 
Variability is naturally present between the individual components because of differences 
in raw material batches, manufacturing processes, and measurement procedures (for 
example), so the engineer wants to estimate the mean strength of the population of compo- 
nents. In practice, the engineer will use sample data to compute a number that is in some 
sense a reasonable value (a good guess) of the true population mean. This number is 
called a point estimate. We will see that there are procedures for developing point esti- 
mates of parameters that have good statistical properties. We will also be able to establish 
the precision of the point estimate. 

Now let's consider a different type of question. Suppose that two different reaction tem- 
peratures ¢, and f, can be used in a chemical process. The engineer conjectures that f, will 
result in higher yields than ft). Statistical hypothesis testing is the framework for solving 
problems of this type. In this example, the engineer would be interested in formulating 
hypotheses that allow him or her to demonstrate that the mean yield using ¢, is higher than 
the mean yield using f,. Notice that there is no emphasis on estimating yields; instead, the 
focus is on drawing conclusions about a hypothesis that is relevant to the engineering decision. 

This chapter and Chapter 8 discuss parameter estimation. Chapters 9 and 10 focus on 
hypothesis testing. 
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CHAPTER OUTLINE 
7-1 POINT ESTIMATION 7-3.3 Standard Error: Reporting a 
7-2 SAMPLING DISTRIBUTIONS pose Potumate 
AND THE CENTRAL LIMIT 7-3.4 Mean Squared Error of an Estimator 
THEOREM 
7-4 METHODS OF POINT ESTIMATION 
7-3. GENERAL CONCEPTS OF POINT 
ESTIMATION 7-4.1 Method of Moments 
7-3.1 Unbiased Estimators 7-4.2 Method of Maximum Likelihood 
7-3.2 Variance of a Point Estimator 7-4.3 Bayesian Estimation of Parameters 
LEARNING OBJECTIVES 


After careful study of this chapter you should be able to do the following: 


1. Explain the general concepts of estimating the parameters of a population or a probability 
distribution 


. Explain the important role of the normal distribution as a sampling distribution 
. Understand the central limit theorem 


. Explain important properties of point estimators, including bias, variance, and mean square error 


nA BW WN 


. Know how to construct point estimators using the method of moments and the method of maximum 


likelihood 
6. Know how to compute and explain the precision with which a parameter is estimated 


7. Know how to construct a point estimator using the Bayesian approach 


7-1 POINT ESTIMATION 


Statistical inference is always focused on drawing conclusions about one or more parameters 
of a population. An important part of this process is obtaining estimates of the parameters. 
Suppose that we want to obtain a point estimate (a reasonable value) of a population param- 
eter. We know that before the data are collected, the observations are considered to be random 
variables, say, X,, X>,..., X,- Therefore, any function of the observation, or any statistic, is 
also a random variable. For example, the sample mean X and the sample variance S? are sta- 
tistics and they are also random variables. 

Since a statistic is a random variable, it has a probability distribution. We call the proba- 
bility distribution of a statistic a sampling distribution. The notion of a sampling distribution 
is very important and will be discussed and illustrated later in the chapter. 

When discussing inference problems, it is convenient to have a general symbol to represent 
the parameter of interest. We will use the Greek symbol 0 (theta) to represent the parameter. The 
symbol @ can represent the mean w, the variance o”, or any parameter of interest to us. The 
objective of point estimation is to select a single number, based on sample data, that is the most 
plausible value for 8. A numerical value of a sample statistic will be used as the point estimate. 

In general, if X is a random variable with probability distribution f(x), characterized by 
the unknown parameter 6, and if X), X>,...,X, is a random sample of size n from X, the 
statistic @ = h(X,, Xo, ..., X,) is called a point estimator of 6. Note that © is a random vari- 
able because it is a function of random variables. After the sample has been selected, © takes 
on a particular numerical value 6 called the point estimate of 0. 
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Point 
Estimator A point estimate of some population parameter @ is a single numerical value 6 of a 
statistic ©. The statistic © is called the point estimator. 


As an example, suppose that the random variable X is normally distributed with an un- 
known mean w. The sample mean is a point estimator of the unknown population mean w. 
That is, (1 = X. After the sample has been selected, the numerical value x is the point estimate 
of w. Thus, if x; = 25, x. = 30, x3 = 29, and x4 = 31, the point estimate of ws is 


_ 25 + 30 + 29 + 31 
oe: 


= 28.7 
1 8.75 


Similarly, if the population variance o7 is also unknown, a point estimator for o7 is the sample 
variance S*, and the numerical value s* = 6.9 calculated from the sample data is called the 
point estimate of 07. 

Estimation problems occur frequently in engineering. We often need to estimate 


e The mean p of a single population 

¢ The variance o” (or standard deviation o) of a single population 

e The proportion p of items in a population that belong to a class of interest 
e The difference in means of two populations, w, — wo 


¢ The difference in two population proportions, p; — p2 
Reasonable point estimates of these parameters are as follows: 


e For p, the estimate is f) = x, the sample mean. 


2 


° Foro”, the estimate is G* = s°, the sample variance. 


e For p, the estimate is p = x/n, the sample proportion, where x is the number of items 
in arandom sample of size n that belong to the class of interest. 


e For 1 — Mo, the estimate is (1; — fiz = xX; — X, the difference between the sample 
means of two independent random samples. 


e For p; — po, the estimate is p,; — po, the difference between two sample proportions 
computed from two independent random samples. 


We may have several different choices for the point estimator of a parameter. For example, if 
we wish to estimate the mean of a population, we might consider the sample mean, the sample me- 
dian, or perhaps the average of the smallest and largest observations in the sample as point esti- 
mators. In order to decide which point estimator of a particular parameter is the best one to use, 
we need to examine their statistical properties and develop some criteria for comparing estimators. 


7-2 SAMPLING DISTRIBUTIONS AND THE CENTRAL 
LIMIT THEOREM 


Statistical inference is concerned with making decisions about a population based on the 
information contained in a random sample from that population. For instance, we may be 
interested in the mean fill volume of a can of soft drink. The mean fill volume in the 
population is required to be 300 milliliters. An engineer takes a random sample of 25 cans 
and computes the sample average fill volume to be x = 298 milliliters. The engineer will 
probably decide that the population mean is », = 300 milliliters, even though the sample mean 
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Random 
Sample 


Statistic 


Sampling 
Distribution 


was 298 milliliters because he or she knows that the sample mean is a reasonable estimate of 
ww and that a sample mean of 298 milliliters is very likely to occur, even if the true population 
mean is p» = 300 milliliters. In fact, if the true mean is 300 milliliters, tests of 25 cans made 
repeatedly, perhaps every five minutes, would produce values of x that vary both above and 
below w = 300 milliliters. 

The link between the probability models in the earlier chapters and the data is made as fol- 
lows. Each numerical value in the data is the observed value of a random variable. Furthermore, 
the random variables are usually assumed to be independent and identically distributed. These 
random variables are known as a random sample. 


The random variables X,, X>,..., X,, are a random sample of size n if (a) the X;’s are 
independent random variables, and (b) every X; has the same probability distribution. 


The observed data are also referred to as a random sample, but the use of the same phrase 
should not cause any confusion. 

The assumption of a random sample is extremely important. If the sample isn’t random 
and is based on judgment or flawed in some other way, then statistical methods will not work 
properly and will lead to incorrect decisions. 

The primary purpose in taking a random sample is to obtain information about the unknown 
population parameters. Suppose, for example, that we wish to reach a conclusion about the pro- 
portion of people in the United States who prefer a particular brand of soft drink. Let p represent 
the unknown value of this proportion. It is impractical to question every individual in the popu- 
lation to determine the true value of p. In order to make an inference regarding the true propor- 
tion p, a more reasonable procedure would be to select a random sample (of an appropriate size) 
and use the observed proportion p of people in this sample favoring the brand of soft drink. 

The sample proportion, p, is computed by dividing the number of individuals in the sample 
who prefer the brand of soft drink by the total sample size n. Thus, p is a function of the 
observed values in the random sample. Since many random samples are possible from a popu- 
lation, the value of p will vary from sample to sample. That is, p is a random variable. Such a 
random variable is called a statistic. 


A statistic is any function of the observations in a random sample. 


We have encountered statistics before. For example, if X, X>, ..., X,, is arandom sample of 
size n, the sample mean X, the sample variance S”, and the sample standard deviation S 
are statistics. Since a statistic is a random variable, it has a probability distribution. 


The probability distribution of a statistic is called a sampling distribution. 


For example, the probability distribution of X is called the sampling distribution of the 
mean. The sampling distribution of a statistic depends on the distribution of the population, 
the size of the sample, and the method of sample selection. We now present perhaps the most 
important sampling distribution. Other sampling distributions and their applications will be 
illustrated extensively in the following two chapters. 


Central Limit 
Theorem 
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Consider determining the sampling distribution of the sample mean X . Suppose that a ran- 
dom sample of size n is taken from a normal population with mean w and variance o*. Now 
each observation in this sample, say, X,, X>, ... , X,,, is anormally and independently distributed 
random variable with mean wp and variance o*. Then, because linear functions of independent, 
normally distributed random variables are also normally distributed (Chapter 5), we conclude 
that the sample mean 


_ X+X t+ +X, 
Y= 1 2— n 


has a normal distribution with mean 


ue en 
Bex = n — pe 


and variance 


If we are sampling from a population that has an unknown probability distribution, the 
sampling distribution of the sample mean will still be approximately normal with mean pw and 
variance o”/n, if the sample size n is large. This is one of the most useful theorems in statis- 
tics, called the central limit theorem. The statement is as follows: 


If X,, X>,..., X, is a random sample of size n taken from a population (either finite 
or infinite) with mean p and finite variance o*, and if X is the sample mean, the 
limiting form of the distribution of 


_X=h 


aia 


7-1) 


as n —> ©, is the standard normal distribution. 


The normal approximation for X depends on the sample size n. Figure 7-1(a) shows the 
distribution obtained for throws of a single, six-sided true die. The probabilities are equal 
(1/6) for all the values obtained, 1, 2, 3, 4, 5, or 6. Figure 7-1(b) shows the distribution of the 
average score obtained when tossing two dice, and Fig. 7-1(c), 7-1(d), and 7-1(e) show the 
distributions of average scores obtained when tossing three, five, and ten dice, respectively. 
Notice that, while the population (one die) is relatively far from normal, the distribution of 
averages is approximated reasonably well by the normal distribution for sample sizes as small 
as five. (The dice throw distributions are discrete, however, while the normal is continuous.) 
Although the central limit theorem will work well for small samples (n = 4, 5) in most cases, 
particularly where the population is continuous, unimodal, and symmetric, larger samples will 
be required in other situations, depending on the shape of the population. In many cases of 
practical interest, if n = 30, the normal approximation will be satisfactory regardless of the 
shape of the population. If n < 30, the central limit theorem will work if the distribution of the 
population is not severely nonnormal. 
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(a) One die 
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(b) Two dice 
ial | ta 
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(c) Three dice 
lll, 
Figure 7-1 1 2 3 4 5 6 x 
Distributions of average (d) Five dice 
scores from throwing 
dice. [Adapted with 
Hunter, and Hunter 1 2 3 4 5 6 x 
(1978).] (e) Ten dice 


EXAMPLE 7-1 Resistors 
An electronics company manufactures resistors that have a 
mean resistance of 100 ohms and a standard deviation of 
10 ohms. The distribution of resistance is normal. Find the 
probability that a random sample of n = 25 resistors will have 
an average resistance less than 95 ohms. __ 

Note that the sampling distribution of X is normal, with 
mean px = 100 ohms and a standard deviation of 

— « 10 | > 
Ein AIR 

Therefore, the desired probability corresponds to the shaded 
area in Fig. 7-2. Standardizing the point X = 95 in Fig. 7-2, 
we find that 


and therefore, 
P(X < 95) = P(Z< —2.5) 
= 0.0062 

Practical Conclusion: This example show that if the dis- 
tribution of resistance is normal with mean 100 ohms and 
standard deviation of 10 ohms, then finding that a random 
sample of resistors with a sample mean smaller than 95 ohms 
is a rare event. If this actually happen, it caste doubt as to 
whether the true mean is really 100 ohms or if the true stan- 
dard deviation is really 10 ohms. 


The following example makes use of the central limit theorem. 


EXAMPLE 7-2 Central Limit Theorem 


Suppose that a random variable X has a continuous uniform 
distribution 


1/72, 45x=6 
soy = {¥ : 


0, otherwise 


Find the distribution of the sample mean of a random sample 
of size n = 40. 


The mean and variance of X are w=5 and 
o? = (6 — 4/12 = 1/3. The central limit theorem indicates 
that the distribution of X is approximately normal with mean 
wx = 5 and variance of = o?/n = 1/[3(40)] = 1/120. The 
distributions of X and X are shown in Fig. 7-3. 


Now consider the case in which we have two independent populations. Let the first pop- 
ulation have mean w, and variance o7 and the second population have mean ., and variance 
03. Suppose that both populations are normally distributed. Then, using the fact that linear 
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95 100 
Figure 7-2. Probability for Example 7-1. 


4 5 6 


x 
Figure 7-3 The distributions of X and X 
for Example 7-2. 


R] 


combinations of independent normal random variables follow a normal distribution (see 
Chapter 5), we can say that the sampling distribution of X, — X is normal with mean 


and variance 


by, -x, = Bx, — Bx, = Mi — Be (7-2) 
2, 2 
2 2 2 OT 07 
O2,-2,- 94 1 9s = a a (7-3) 


If the two populations are not normally distributed and if both sample sizes n, and ny are 
greater than 30, we may use the central limit theorem and assume that X, and X, follow 
approximately independent normal distributions. Therefore, the sampling distribution of 
X, — X> is approximately normal with mean and variance given by Equations 7-2 and 7-3, 
respectively. If either n, or n is less than 30, the sampling distribution of X, — X> will still be 
approximately normal with mean and variance given by Equations 7-2 and 7-3, provided that 
the population from which the small sample is taken is not dramatically different from the 
normal. We may summarize this with the following definition. 


Approximate 
Sampling 
Distribution of 
a Difference in 
Sample Means 


If we have two independent populations with means 2, and 2 and variances of and 
o3, and if X, and X> are the sample means of two independent random samples of 
sizes n, and ny from these populations, then the sampling distribution of 


GG (i) 


Zi 


(7-4) 


standard normal. 


is approximately standard normal, if the conditions of the central limit theorem 
apply. If the two populations are normal, the sampling distribution of Z is exactly 


Vo02/n ar 03/n> 


EXAMPLE 7-3 Aircraft Engine Life 

The effective life of a component used in a jet-turbine aircraft 
engine is a random variable with mean 5000 hours and stan- 
dard deviation 40 hours. The distribution of effective life is 
fairly close to a normal distribution. The engine manufacturer 


introduces an improvement into the manufacturing process 
for this component that increases the mean life to 5050 hours 
and decreases the standard deviation to 30 hours. Suppose 
that a random sample of n, = 16 components is selected from 
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Figure 7-4 The 

sampling distribution 

of X, — X,in 

Example 7-3. 0) 25 50 75 


the “old” process and a random sample of n. = 25 components 
is selected from the “improved” process. What is the probabil- 
ity that the difference in the two sample means X, — X; is at 
least 25 hours? Assume that the old and improved processes 
can be regarded as independent populations. 

To solve this problem, we first note that the distribution of 
X, is normal with mean w., = 5000 hours and standard devia- 
tion o,/Vn, = 40/V16 = 10 hours, and the distribution of 
X, is normal with mean yw. = 5050 hours and standard devia- 
tion o5/Vny = 30/V25 = 6 hours. Now the distribution of 
X, — X, is normal with mean p. — py; = 5050 — 5000 = 50 
hours and variance o3/n, + o7/n, = (6)? + (10)? = 136 
hours”. This sampling distribution is shown in Fig. 7-4. The 
probability that X, — X, = 25 is the shaded portion of the 
normal distribution in this figure. 


EXERCISES FOR SECTION 7-2 
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100 %-x, 


Corresponding to the value x, — x, = 25 in Fig. 7-4, we 
find that 
— 25-50 _ 


Z = = 22.14 
V 136 


and consequently, 


P(X, — X, = 25) = P(Z = -2.14) 
= 0.9838 


Therefore, there is a high probability (0.9838) that the differ- 
ence in sample means between the new and the old process 
will be at least 25 hours if the sample sizes are n, = 16 and 
Ny = 25. 


7-1. Consider the hospital emergency room data from 
Exercise 6-104. Estimate the proportion of patients who arrive 
at this emergency department experiencing chest pain. 


7-2. Consider the compressive strength data in Table 6-2. 
What proportion of the specimens exhibit compressive 
strength of at least 200 psi? 


7-3. PVC pipe is manufactured with a mean diameter of 
1.01 inch and a standard deviation of 0.003 inch. Find the 
probability that a random sample of n = 9 sections of pipe 
will have a sample mean diameter greater than 1.009 inch and 
less than 1.012 inch. 


7-4. Suppose that samples of size n = 25 are selected at 
random from a normal population with mean 100 and standard 
deviation 10. What is the probability that the sample mean falls 
in the interval from wx — 1.80% to pe + 1.00%? 

7-5. Asynthetic fiber used in manufacturing carpet has ten- 
sile strength that is normally distributed with mean 75.5 psi 
and standard deviation 3.5 psi. Find the probability that a ran- 
dom sample of n = 6 fiber specimens will have sample mean 
tensile strength that exceeds 75.75 psi. 

7-6. Consider the synthetic fiber in the previous exercise. 
How is the standard deviation of the sample mean changed 
when the sample size is increased from n = 6 ton = 49? 
7-7. The compressive strength of concrete is normally 
distributed with ~ = 2500 psi and o = 50 psi. Find the prob- 
ability that a random sample of n = 5 specimens will have a 


sample mean diameter that falls in the interval from 2499 psi 
to 2510 psi. 

7-8. Consider the concrete specimens in the previous 
exercise. What is the standard error of the sample mean? 


7-9. A normal population has mean 100 and variance 25. 
How large must the random sample be if we want the standard 
error of the sample average to be 1.5? 

7-10. Suppose that the random variable X has the continu- 
ous uniform distribution 


F(a) {a O<x<1 
x) = 
0, otherwise 
Suppose that a random sample of n = 12 observations is 
selected from this distribution. What is the approximate 
probability distribution of X — 6? Find the mean and vari- 
ance of this quantity. 
7-11. Suppose that X has a discrete uniform distribution 
7 = 1,2,3 
= 35 Xt 24 
FQ) { 0, otherwise 
A random sample of n = 36 is selected from this population. 
Find the probability that the sample mean is greater than 2.1 
but less than 2.5, assuming that the sample mean would be 
measured to the nearest tenth. 


7-12. The amount of time that a customer spends waiting 
at an airport check-in counter is a random variable with mean 
8.2 minutes and standard deviation 1.5 minutes. Suppose that 
a random sample of n = 49 customers is observed. Find the 
probability that the average time waiting in line for these 
customers is 

(a) Less than 10 minutes 

(b) Between 5 and 10 minutes 

(c) Less than 6 minutes 


7-13. A random sample of size n, = 16 is selected from a 
normal population with a mean of 75 and a standard deviation 
of 8. A second random sample of size n, = 9 is taken from 
another normal population with mean 70 and standard devia- 
tion 12. Let X, and X, be the two sample means. Find: 

(a) The probability that X, — X> exceeds 4 

(b) The probability that 3.5 = X, — X, = 5.5 
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7-14. A consumer electronics company is comparing the 
brightness of two different types of picture tubes for use in its 
television sets. Tube type A has mean brightness of 100 and 
standard deviation of 16, while tube type B has unknown 
mean brightness, but the standard deviation is assumed to be 
identical to that for type A. A random sample of n = 25 tubes 
of each type is selected, and X, — X,q is computed. If pp 
equals or exceeds jta, the manufacturer would like to adopt 
type B for use. The observed difference is x, — x, = 3.5. 
What decision would you make, and why? 


7-15. The elasticity of a polymer is affected by the concen- 
tration of a reactant. When low concentration is used, the true 
mean elasticity is 55, and when high concentration is used the 
mean elasticity is 60. The standard deviation of elasticity is 4, 
regardless of concentration. If two random samples of size 16 
are taken, find the probability that Xhign — Xow = 2. 
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7-3.1 Unbiased Estimators 


An estimator should be “close” in some sense to the true value of the unknown parameter. 
Formally, we say that @ is an unbiased estimator of 0 if the expected value of @ is equal to 0. 
This is equivalent to saying that the mean of the probability distribution of © (or the mean of 
the sampling distribution of @) is equal to 0. 


Bias of 
an Estimator 


The point estimator © is an unbiased estimator for the parameter 0 if 


If the estimator is not unbiased, then the difference 


is called the bias of the estimator 0. 


(7-5) 


E(O) — 6 (7-6) 


When an estimator is unbiased, the bias is zero; that is, E(O) —-08=0. 


EXAMPLE 7-4 Sample Mean and Variance Are Unbiased 


Suppose that X is a random variable with mean p and variance 
o*. Let X), X>,..., X, be a random sample of size n from the 
population represented by X. Show that the sample mean X 
and sample variance S? are unbiased estimators of w and 0°, 
respectively. 


First consider the sample mean. In Section 5.5 in Chapter 5, 


we showed that E(X) = yp. Therefore, the sample mean X is an 
unbiased estimator of the population mean p. 
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Now consider the sample variance. We have The last equality follows the equation for the mean of a linear 
sh function in Chapter 5. However, since E(X?) = wp? + o” and 
> &@ = xy , : E(X’) = w? + o7/n, we have 
i=1 = 
E(S*) = E = EY = Ay n 
= = , i 1 
n 1 n 1 i=l E(S?) = aor y (uw? ae 0°) = n(w> a o?/n) 
i=1 
1 ae 4 2 
= = . 1 
f1 2 +X 2X Xj) _ (np2 + no? — np2 — 0?) = 2 
i 
1 . = : : : : 
= i E ( X ? — nX ») Therefore, the sample variance S 2 is an unbiased estimator of 
n— i=1 2. 


the population variance o~. 


Although S” is unbiased for 0”, S is a biased estimator of o. For large samples, the bias is very 
small. However, there are good reasons for using S as an estimator of o in samples from _nor- 
mal distributions, as we will see in the next three chapters when we discuss confidence 
intervals and hypothesis testing. 

Sometimes there are several unbiased estimators of the sample population parameter. For 
example, suppose we take a random sample of size n = 10 from a normal population and 
obtain the data x, = 12.8, x, = 9.4, x3 = 8.7, x4 = 11.6, x5 = 13.1, x6 = 9.8, x, = 14.1, 
Xg = 8.5, X9 = 12.1, x19 = 10.3. Now the sample mean is 


— 1284944 87+ 11.6 4+ 13.1 + 9.8 + 14.1 + 8.5 + 12.1 + 10.3 


x 10 = 11.04 
the sample median is 
10.3 + 11.6 
X= aa an 10.95 


and a 10% trimmed mean (obtained by discarding the smallest and largest 10% of the sample 
before averaging) is 


_ 8.7+ 9.4 + 9.8 + 10.3 + 11.6 + 12.1 + 12.8 + 13.1 
Xn(10) = 8 = 10.98 


We can show that all of these are unbiased estimates of jw. Since there is not a unique unbiased 
estimator, we cannot rely on the property of unbiasedness alone to select our estimator. We 
need a method to select among unbiased estimators. We suggest a method in the following 
section. 


7-3.2 Variance of a Point Estimator 


Suppose that @, and @, are unbiased estimators of 8. This indicates that the distribution of 
each estimator is centered at the true value of 8. However, the variance of these distributions 
may be different. Figure 7-5 illustrates the situation. Since @, has a smaller variance than O,, 
the estimator @, is more likely to produce an estimate close to the true value 0. A logical prin- 
ciple of estimation, when selecting among several estimators, is to choose the estimator that 
has minimum variance. 


Figure 7-5 The 
sampling distributions 
of two unbiased estima- 
tors ©, and 0). 


Minimum 
Variance 
Unbiased 
Estimator 
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AN 
Distribution of ©, 


A 
Distribution of ©, 


If we consider all unbiased estimators of 8, the one with the smallest variance is 
called the minimum variance unbiased estimator (MVUE). 


In a sense, the MVUE is most likely among all unbiased estimators to produce an estimate 6 
that is close to the true value of 0. It has been possible to develop methodology to identify the 
MVUE in many practical situations. While this methodology is beyond the scope of this book, 
we give one very important result concerning the normal distribution. 


If X;, Xz, ..., X, is arandom sample of size n from a normal distribution with mean 
yw. and variance o”, the sample mean X is the MVUE for p. 


In situations in which we do not know whether an MVUE exists, we could still use a mini- 
mum variance principle to choose among competing estimators. Suppose, for example, we 
wish to estimate the mean of a population (not necessarily a normal population). We have a 
random sample of n observations X,, X5, ..., X,, and we wish to compare two possible esti- 
mators for w: the sample mean X and a single observation from the sample, say, X;. Note that 
both X and X; are unbiased estimators of 1; for the sample mean, we have V(X y= o?/n 
from Chapter 5 and the variance of any observation is V(X;) = 07. Since V(X) < V(X;) for 
sample sizes n = 2, we would conclude that the sample mean is a better estimator of » than 
a single observation X;. 


7-3.3 Standard Error: Reporting a Point Estimate 


Standard 
Error of an 
Estimator 


When the numerical value or point estimate of a parameter is reported, it is usually desirable 
to give some idea of the precision of estimation. The measure of precision usually employed 
is the standard error of the estimator that has been used. 


The standard error of an estimator © is its standard deviation, given by 
o@ = V V(@). If the standard error involves unknown parameters that can be esti- 
mated, substitution of those values into o @ produces an estimated standard error, 


denoted by 4. 
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Sometimes the estimated standard error is denoted by sg or se(@). 

Suppose we are sampling from a normal distribution with mean p and variance o°. 
Now the distribution of X is normal with mean wp and variance o*/n, so the standard error 
of X is 


Ox Vn 


If we did not know o but substituted the sample standard deviation S into the above equation, 
the estimated standard error of X would be 


When the estimator follows a normal distribution, as in the above situation, we can be rea- 
sonably confident that the true value of the parameter lies within two standard errors of the 
estimate. Since many point estimators are normally distributed (or approximately so) for large 
n, this is a very useful result. Even in cases in which the point estimator is not normally 
distributed, we can state that so long as the estimator is unbiased, the estimate of the parameter 
will deviate from the true value by as much as four standard errors at most 6 percent of the time. 
Thus a very conservative statement is that the true value of the parameter differs from the point 
estimate by at most four standard errors. See Chebyshev’s inequality in the supplemental mate- 
rial on the Web site. 


EXAMPLE 7-5 Thermal Conductivity 

An article in the Journal of Heat Transfer (Trans. ASME, Sec. C, 
96, 1974, p. 59) described a new method of measuring the 
thermal conductivity of Armco iron. Using a temperature of 
100°F and a power input of 550 watts, the following 10 
measurements of thermal conductivity (in Btu/hr-ft-°F) were 
obtained: 


41.60, 41.48, 42.34, 41.95, 41.86, 
42.18, 41.72, 42.26, 41.81, 42.04 


A point estimate of the mean thermal conductivity at 100°F 
and 550 watts is the sample mean or 


x = 41.924 Btu/hr-ft-°F 


7-3.4 Mean Squared Error of an Estimator 


The standard error of the sample mean is ox = o/ Vn, and 
since O is unknown, we may replace it by the sample standard 
deviation s = 0.284 to obtain the estimated standard error of 
X as 


ag = 2 = 2 = 0.0808 
°X Vn VIO 


Practical Interpretation: Notice that the standard error is about 
0.2 percent of the sample mean, implying that we have obtained 
a relatively precise point estimate of thermal conductivity. If we 
can assume that thermal conductivity is normally distributed, 
2 times the standard error is 26 ¢ = 2(0.0898) = 0.1796, and 
we are highly confident that the true mean thermal conductiv- 
ity is within the interval 41.924 + 0.1796, or between 41.744 
and 42.104. 


Mean Squared 
Error of 
an Estimator 


Sometimes it is necessary to use a biased estimator. In such cases, the mean squared error of 
the estimator can be important. The mean squared error of an estimator © is the expected 
squared difference between © and 0. 


The mean squared error of an estimator © of the parameter 0 is defined as 


MSE(®) = E(0 — 6) Gea 


Figure 7-6 A biased 
estimator ©, that has 
smaller variance than 
the unbiased estimator 


>. 
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A 
Distribution of ©; 


A 
Distribution of Og 


8 5(0,) 


The mean squared error can be rewritten as follows: 


ey 


MSE(®) = E[6 — E(6)}? + [0 — E(6)/? 


That is, the mean squared error of @ is equal to the variance of the estimator plus the squared 
bias. If © is an unbiased estimator of 0, the mean squared error of @ is equal to the variance 
of ©. 

The mean squared error is an important criterion for comparing two estimators. Let 
0, and 0, be two estimators of the parameter 8, and let MSE (@,) and MSE (O) be the 
mean squared errors of 0, and 0. Then the relative efficiency of 0, to 0, is defined as 


MSE(0,) 
MSE(@,) uo) 
If this relative efficiency is less than 1, we would conclude that On is a more efficient estima- 
tor of 6 than @,, in the sense that it has a smaller mean squared error. 

Sometimes we find that biased estimators are preferable to unbiased estimators because 
they have smaller mean squared error. That is, we may be able to reduce the variance of the 
estimator considerably by introducing a relatively small amount of bias. As long as the reduc- 
tion in variance is greater than the squared bias, an improved estimator from a mean squared 
error viewpoint will result. For example, Fig. 7-6 shows the probability distribution of a biased 
estimator 0, that has a smaller variance than the unbiased estimator 05. An estimate based on 
0, would more likely be close to the true value of @ than would an estimate based on >. 
Linear regression analysis (Chapters 11 and 12) is an example of an application area in which 
biased estimators are occasionally used. 

An estimator © that has a mean squared error that is less than or equal to the mean 
squared error of any other estimator, for all values of the parameter 9, is called an optimal 
estimator of 6. Optimal estimators rarely exist. 


EXERCISES FOR SECTION 7-3 


7-16. A computer software package was used to calculate 
some numerical summaries of a sample of data. The results are 
displayed here: 


Variable | N | Mean | SE Mean | StDev | Variance 
lisie | 2 


x [20] 50184] 2 


(a) Fill in the missing quantities. 
(b) What is the estimate of the mean of the population from 
which this sample was drawn? 


7-17. A computer software package was used to calculate 
some numerical summaries of a sample of data. The results are 
displayed here: 


SE Sum of 
Variable| N | Mean| Mean| StDev| Variance} Sum |Squares 
x |?! 2 |205l10251 2 3761.70! 2 


(a) Fill in the missing quantities. 
(b) What is the estimate of the mean of the population from 
which this sample was drawn? 
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7-18. Let X, and X, be independent random variables with 
mean pw and variance a. Suppose that we have two estimators 
of ws: 


e X, +X 
0, = 
: 2 
and 
a X, + 3X 
0, = ! z 
4 


(a) Are both estimators unbiased estimators of j? 
(b) What is the variance of each estimator? 


7-19. Suppose that we have a random sample X,, X>,..., X,, 
from a population that is N(u, 0°). We plan to use 
O= di. \(X; — X)°/c to estimate o*. Compute the bias in (6) 
as an estimator of o” as a function of the constant c. 

7-20. Suppose we have a random sample of size 2n from a 
population denoted by X, and E(X) = p. and V(X) = o”. Let 


= 1 2n = 1 2 
X=— XX, ad Y=- dy, 
1~ 5) Pa j an 2 >> i 


be two estimators of jt. Which is the better estimator of w? 
Explain your choice. 
7-21. Let Xj, X5,...,X, denote a random sample from a 


population having mean yw and variance o”. Consider the 
following estimators of w: 


6, Xt Xt + Xy 
7 
ia —Xet+ X, 
2 
(a) Is either estimator unbiased? 

(b) Which estimator is best? In what sense is it best? 
Calculate the relative efficiency of the two estimators. 
7-22. Suppose that ©, and @, are unbiased estimators of the 
parameter 6. We know that V(@,) = 10 and V(@,) = 4. 
Which estimator is best and in what sense is it best? Calculate 

the relative efficiency of the two estimators. 

7-23. Suppose that ©, and ©, are estimators of the parame- 
ter 0. We know that E(@,) = 0, E(@) = 0/2, V(@;) = 10, 
V(@ ) = 4. Which estimator is best? In what sense is it best? 
7-24. Suppose that ©,, @5, and ©; are estimators of 6. We 
know that E(0,) = E(O,) = 0, E(O3) # 0, V(O,) = 12, 
V(®,) = 10, and E(®; — 6)* = 6. Compare these three 
estimators. Which do you prefer? Why? 

7-25. Let three random samples of sizes n, = 20, n. = 10, 
and n; = 8 be taken from a population with mean p and 
variance a. Let Si, S3, and S3 be the sample variances. 
Show that S? = (20S7 + 1053 + 853)/38 is an unbiased 
estimator of 0”. 


7-26. (a) Show that oe (X, — X)’/n is a biased estima- 
tor of o*. 
(b) Find the amount of bias in the estimator. 


(c) What happens to the bias as the sample size n increases? 


7-27. Let X,, Xo, ..., X, be arandom sample of size n from 
a population with mean p. and variance o”. 

(a) Show that X? is a biased estimator for pw. 

(b) Find the amount of bias in this estimator. 

(c) What happens to the bias as the sample size n increases? 


7-28. Data on pull-off force (pounds) for connectors used in 
an automobile engine application are as follows: 79.3, 75.1, 
78.2, 74.1, 73.9, 75.0, 77.6, 77.3, 73.8, 74.6, 75.5, 74.0, 74.7, 
75.9, 72.9, 73.8, 74.2, 78.1, 75.4, 76.3, 75.3, 76.2, 74.9, 78.0, 
75.1, 76.8. 

(a) Calculate a point estimate of the mean pull-off force of all 
connectors in the population. State which estimator you 
used and why. 

(b) Calculate a point estimate of the pull-off force value that 
separates the weakest 50% of the connectors in the popu- 
lation from the strongest 50%. 

(c) Calculate point estimates of the population variance and 
the population standard deviation. 

(d) Calculate the standard error of the point estimate found in 
part (a). Provide an interpretation of the standard error. 

(e) Calculate a point estimate of the proportion of all con- 
nectors in the population whose pull-off force is less than 
73 pounds. 

7-29. Data on oxide thickness of semiconductors are as 

follows: 425, 431, 416, 419, 421, 436, 418, 410, 431, 433, 423, 

426, 410, 435, 436, 428, 411, 426, 409, 437, 422, 428, 413, 416. 

(a) Calculate a point estimate of the mean oxide thickness for 
all wafers in the population. 

(b) Calculate a point estimate of the standard deviation of 
oxide thickness for all wafers in the population. 

(c) Calculate the standard error of the point estimate from 
part (a). 

(d) Calculate a point estimate of the median oxide thickness 
for all wafers in the population. 

(e) Calculate a point estimate of the proportion of wafers in 
the population that have oxide thickness greater than 430 
angstroms. 


7-30. Suppose that X is the number of observed “successes” 

in a sample of n observations, where p is the probability of 

success on each observation. 

(a) Show that P = X/nis an unbiased estimator of D. 

(b) Show that the standard error of P is Vp(1 — p)/n. 
How would you estimate the standard error? 


7-31. Xj, and S? are the sample mean and sample variance 
from a population with mean ., and variance 7. Similarly, X, 
and $3 are the sample mean and sample variance from a sec- 
ond independent population with mean « and variance o3. 


The sample sizes are n, and n», respectively. 


(a) Show that X,— X>, is an unbiased estimator of wy — pL. 


(b) Find the standard error of X; — X>. How could you 
estimate the standard error? 

(c) Suppose that both populations have the same variance; that 
is, of = 03 = o”. Show that 


(nm — 1)S7 + (m — 1)83 
ny +n, — 2 


So= 


is an unbiased estimator of 0”. 


7-32. Two different plasma etchers in a semiconductor fac- 
tory have the same mean etch rate jz. However, machine 1| is 
newer than machine 2 and consequently has smaller variabil- 
ity in etch rate. We know that the variance of etch rate for 
machine 1 is oj and for machine 2 it is o5 = aoy. Suppose that 
we have n, independent observations on etch rate from 
machine | and n, independent observations on etch rate from 
machine 2. 

(a) Show that f. = a X, + (1 — a) X> is an unbiased estima- 

tor of w for any value of a between 0 and 1. 

(b) Find the standard error of the point estimate of py. in part (a). 
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(c) What value of a would minimize the standard error of 
the point estimate of j.? 

(d) Suppose that a = 4 and n, = 25. What value of a would 
you select to minimize the standard error of the point esti- 
mate of 1? How “bad” would it be to arbitrarily choose 
a = 0.5 in this case? 

7-33. Ofn, randomly selected engineering students at ASU, 

X, owned an HP calculator, and of n. randomly selected 

engineering students at Virginia Tech, X, owned an HP calcu- 

lator. Let p, and p, be the probability that randomly selected 

ASU and Virginia. Tech engineering students, respectively, 

own HP calculators. 

(a) Show that an unbiased estimate for p, — pp is (X,/n,) — 
(X2/ Ny). 

(b) What is the standard error of the point estimate in part (a)? 

(c) How would you compute an estimate of the standard error 
found in part (b)? 

(d) Suppose that n, = 200, X, = 150, n. = 250, and X, = 185. 
Use the results of part (a) to compute an estimate of p, — p>. 

(e) Use the results in parts (b) through (d) to compute an 
estimate of the standard error of the estimate. 


The definitions of unbiasedness and other properties of estimators do not provide any guid- 
ance about how good estimators can be obtained. In this section, we discuss methods for 
obtaining point estimators: the method of moments and the method of maximum likelihood. 
We also briefly discuss a Bayesian approach to parameter estimation. Maximum likelihood 
estimates are generally preferable to moment estimators because they have better efficiency 
properties. However, moment estimators are sometimes easier to compute. Both methods can 
produce unbiased point estimators. 


7-4.1 Method of Moments 


Moments 


The general idea behind the method of moments is to equate population moments, which are 
defined in terms of expected values, to the corresponding sample moments. The population 
moments will be functions of the unknown parameters. Then these equations are solved to 
yield estimators of the unknown parameters. 


Let X,, X>,..., X, be a random sample from the probability distribution f(x), where 
f(x) can be a discrete probability mass function or a continuous probability density 
function. The kth population moment (or distribution moment) is E(X"), k = 
1, 2, .... The corresponding kth sample moment is (1/n) S;_, X*, k = 1,2,.... 
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To illustrate, the first population moment is E(X) = p, and the first sample moment is 
(1/n) Sj, X; = X. Thus by equating the population and sample moments, we find that 

j. = X. That is, the sample mean is the moment estimator of the population mean. In the 
general case, the population moments will be functions of the unknown parameters of the 
distribution, say, 0), 0, ..., On. 


Moment 
Estimators Let X,, X>,..., X, be a random sample from either a probability mass function or a 
probability denis function with m unknown parameters 6), 05,...,9,,. The 
moment estimators 0), 0,...,©,, are found by equating the first m population 


moments to the first m sample TORR and solving the resulting equations for the 
unknown parameters. 


EXAMPLE 7-6 Exponential Distribution Moment Estimator 

Suppose that X,, X>,..., X, is arandom sample from an expo- sy a EX ) = 1/A. Therefore E(X) = X results in 
nential distribution with parameter A. Now there is only one 1/A = X,so\ = 1/X is the moment estimator of A. 
parameter to estimate, so we must equate E(X) to X. For the 


As an example, suppose that the time to failure of an electronic module used in an automobile 
engine controller is tested at an elevated temperature to accelerate the failure mechanism. 
The time to failure is exponentially distributed. Eight units are randomly selected and 
tested, resulting in the following failure time (in hours): x, = 11.96, x. = 5.03, x; = 67.40, 
4 = 16.07, x5 = 31.50, x6 = 7.73, x7 = 11.10, and xg = 22.38. Because x = 21.65, the 
moment estimate of A is A = 1/X = 1/21.65 = 0.0462. 


EXAMPLE 7-7 Normal Distribution Moment Estimators 


Suppose that X,, X5,... , X,, is arandom sample from a normal Solving these equations gives the moment estimators 
distribution with parameters y and o”. For the normal distribu- i : 4 7 
tion, E(X) = wand E(X’) = p2 + o?. Equating E(X)to X and ») x= o( 3 Ss x,) > &— xP 
E(X?) to} SX? gives 5 =X a2 _ i= ” FI eee el 
Me ? sd n n 
p=, > x? Practical Conclusion: Notice that the moment estimator of 


i=1 o” is not an unbiased estimator. 


EXAMPLE 7-8 Gamma Distribution Moment Estimators 

Suppose that X,, X,,..., X,, is arandom sample from a gamma To illustrate, consider the time to failure data introduced fol- 
distribution with parameters r and \. For the gamma distribution, Tye Example 7-6. For these data, x = 21.65 and 
E(X) = r/A and E(X”) = r(r + 1)/A2. The moment estima- D}_ 1x? = 6639.40, so the moment estimates are 

tors are found by solving 


; (21.65) 
p= 5 = 1.29, 
ee veony 13x (1/8)6645.43 — (21.65) 
r/A = xX, rr 
« 21.65 
= 5 = 0.0598 


The resulting estimators are (1/8)6645.43 — (21.65) 

Interpretation: When r = 1, the gamma reduces to the expo- 

- nential distribution. Because 7 slightly exceeds unity, it is 

(1/n) S x2 — ¥2 (1/n) x x2 — ¥2 quite possible that either the gamma or the exponential distri- 
i=1 =i bution would provide a reasonable model for the data. 


x x 
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7-4.2 Method of Maximum Likelihood 


Maximum 
Likelihood 
Estimator 


One of the best methods of obtaining a point estimator of a parameter is the method of maxi- 
mum likelihood. This technique was developed in the 1920s by a famous British statistician, 
Sir R. A. Fisher. As the name implies, the estimator will be the value of the parameter that 
maximizes the likelihood function. 


Suppose that X is a random variable with probability distribution f(x; 8), where 0 is 
a single unknown parameter. Let x,, x,,..., x, be the observed values in a random 
sample of size n. Then the likelihood function of the sample is 


L(0) = (ais 8) » f(x; 8) +++ + Fn 8) (7-9) 


Note that the likelihood function is now a function of only the unknown parameter 0. 
The maximum likelihood estimator (MLE) of 0 is the value of 6 that maximizes 
the likelihood function L(8). 


In the case of a discrete random variable, the interpretation of the likelihood function is 
simple. The likelihood function of the sample L(6) is just the probability 


P(X, = x1, X_ = X,...,Xp_ = Xp) 
That is, L(0) is just the probability of obtaining the sample values x,, x,,..., x,. Therefore, in 


the discrete case, the maximum likelihood estimator is an estimator that maximizes the prob- 
ability of occurrence of the sample values. 


EXAMPLE 7-9 Bernoulli Distribution MLE 


Let X be a Bernoulli random variable. The probability mass n n 
function is In L(p) = 2. x; |Inp +(n—- x; }In(1 — p) 
i= i= 
Bas pa - i ae x=0,1 Now, 
i) { 0, otherwise 

n n 

where p is the parameter to be estimated. The likelihood func- din L(p) Xj (> — > x) 
i=1 i=1 


tion of a random sample of size n is 


L(p) = p"(1 — p)™"p@(1 — p)' 2 p(1 = p)i™ 
I 


dp P 1—p 
Equating this to zero and solving for p yields p = (1/n) }~3;.- 


ui ” 
= Il pi(l- p)'* = pet (1 — p) a Therefore, the maximum likelihood estimator of p is 
i=l 


n 12 
We observe that if p maximizes L(p), p also maximizes om >> X; 


In L(p). Therefore, 


Suppose that this estimator was applied to the following situation: n items are selected 
at random from a production line, and each item is judged as either defective (in which case 
we set x, = 1) or nondefective (in which case we set x; = 0). Then >}_, x; is the number of 
defective units in the sample, and p is the sample proportion defective. The parameter p is 
the population proportion defective, and it seems intuitively quite reasonable to use p as an 
estimate of p. 
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Although the interpretation of the likelihood function given above is confined to the dis- 
crete random variable case, the method of maximum likelihood can easily be extended to a 
continuous distribution. We now give two examples of maximum likelihood estimation for 


continuous distributions. 


EXAMPLE 7-10 Normal Distribution MLE 

Let X be normally distributed with unknown p and known 
variance o”. The likelihood function of a random sample of 
size n, say X,, Xp, ... , X,, iS 


n? 


and 


OY 2 Sw w) 


du i=1 
” 1 21992 1 =13 2 . : ‘ : 
L(p) = I] ar ei BY/20") — Gnoy? C202 = hi) Equating this last result to zero and solving for wp yields 
> x; 
Now, i = i — -~X 
In L(w) = —(n/2) In(2107) — (207)! > (x; — p)* Conclusion: The sample mean is the maximum likelihood esti- 
i=1 


mator of 2. Notice that this is identical to the moment estimator. 


EXAMPLE 7-11 Exponential Distribution MLE 


Let X be exponentially distributed with parameter \. The like- Now, 
lihood function of a random sample of size n, say, X,, X2,... , dinL() n n 
Xn is a _ Xr — » Xj 


The log likelihood is 


n 
= A x 
Il Ae = A"e Az i 


n 
InL(A) =nIndA-A > x; 


i=1 


and upon equating this last result to zero we obtain 


faal> H=1/k 


i=l 
Conclusion: Thus, the maximum likelihood estimator of \ is 


the reciprocal of the sample mean. Notice that this is the same 
i=1 as the moment estimator. 


It is easy to illustrate graphically just how the method of maximum likelihood works. 
Figure 7-7(a) plots the log of the likelihood function for the exponential parameter from 
Example 7-11, using the n = 8 observations on failure time given following Example 7-6. It 
is common for the log likelihood function to be negative. We found that the estimate of \ was 
\ = 0.0462. From Example 7-11, we know that this is a maximum likelihood estimate. Figure 
7-7(a) shows clearly that the log likelihood function is maximized at a value of A that is ap- 
proximately equal to 0.0462. Notice that the log likelihood function is relatively flat in the re- 
gion of the maximum. This implies that the parameter is not estimated very precisely. If the 
parameter were estimated precisely, the log likelihood function would be very peaked at the 
maximum value. The sample size here is relatively small, and this has led to the imprecision 
in estimation. This is illustrated in Fig. 7-7(b) where we have plotted the difference in log like- 
lihoods for the maximum value, assuming that the sample sizes were n = 8, 20, and 40 but 
that the sample average time to failure remained constant at x = 21.65. Notice how much 
steeper the log likelihood is for = 20 in comparison to n = 8, and for = 40 in comparison 
to both smaller sample sizes. 

The method of maximum likelihood can be used in situations where there are several un- 
known parameters, say, 0), 95, ... , 8, to estimate. In such cases, the likelihood function is a func- 
tion of the & unknown parameters 0), 95, ..., 6,, and the maximum likelihood estimators {0;} 
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Figure 7-7 Log likelihood for the exponential distribution, using the failure time data. (a) Log likelihood with n = 8 (original 
data). (b) Log likelihood if n = 8, 20, and 40. 


would be found by equating the k partial derivatives 0L(81, 95, ..., 0,)/00; i = 1, 2,...,k to 
zero and solving the resulting system of equations. 


EXAMPLE 7-12 Normal Distribution MLEs for p and o? 
Let X be normally distributed with mean w and variance oc”, Now, 
where both p and o? are unknown. The likelihood function for 


2 
a random sample of size n is d In L(p, 0”) ieee = 
au om (x; — B) 
i= 
us 1 “fe. — W292 din Lip o°) n 1 a 
L(p, 02) = e Gi BY /20°) : = : 2-9 
ed II oV2T a(o”) 207 204 & (i ~ H) 
= | ie ir BP The solutions to the above equations yield the maximum 
(21107)? likelihood estimators 
and ie ee v2 
=X #=—->%,-X) 
i=1 
1 n 
In L(p, 07) = —> In(2mr07) — 462 (x; — py Conclusion: Once again, the maximum likelihood estimators 


are equal to the moment estimators. 


Properties of the Maximum Likelihood Estimator 

The method of maximum likelihood is often the estimation method that mathematical statisti- 
cians prefer, because it produces estimators with good statistical properties. We summarize 
these properties as follows. 


Properties of 


ss Maxinnent Under very general and not restrictive conditions, when the sample size n is large and 
Likelihood if © is the maximum likelihood estimator of the parameter 0, 
Estimator 


(1) © is an approximately unbiased estimator for 6 [E() ~ @], 


(2) the variance of @ is nearly as small as the variance that could be obtained 
with any other estimator, and 


(3) @has an approximate normal distribution. 


242 


CHAPTER 7 SAMPLING DISTRIBUTIONS AND POINT ESTIMATION OF PARAMETERS 


Properties 1 and 2 essentially state that the maximum likelihood estimator is approxi- 
mately an MVUE. This is a very desirable result and, coupled with the fact that it is fairly easy 
to obtain in many situations and has an asymptotic normal distribution (the “asymptotic” 
means “when n is large’), explains why the maximum likelihood estimation technique is 
widely used. To use maximum likelihood estimation, remember that the distribution of the 


population must be either known or assumed. 
To illustrate the “large-sample” or asymptotic nature of the above properties, consider the 
maximum likelihood estimator for a7, the variance of the normal distribution, in Example 7-12. 


It is easy to show that 


The bias is 


Because the bias is negative, G” tends to underestimate the true variance o7. Note that the bias 


approaches zero as n increases. Therefore, & 


2 2 


is an asymptotically unbiased estimator for 0°. 


We now give another very important and useful property of maximum likelihood 


estimators. 
Invariance 
Property Let 0, 0,, Foes On be the maximum likelihood estimators of the parameters 6), 
Q5,..., 8,. Then the maximum likelihood estimator of any function h(8,, 6, ... , 9;) 
of these parameters is the same function h(,, @,,..., ©,) of the estimators 
0, O, ..., Op. 


EXAMPLE 7-13 

In the normal distribution case, the maximum likelihood esti- 
mators of pw and o? were (2 = X and G* = D}_,(X; — X)*/n. 
To obtain the maximum likelihood estimator of the function 
h(p, 0°) = Vo? = o, substitute the estimators ji and 6? into 
the function h, which yields 


6=V@= E > (X; x] 


Conclusion: The maximum likelihood estimator of the standard 
deviation o is not the sample standard deviation S. 


Complications in Using Maximum Likelihood Estimation 


While the method of maximum likelihood is an excellent technique, sometimes complications 
arise in its use. For example, it is not always easy to maximize the likelihood function because 
the equation(s) obtained from dL(8)/d6 = 0 may be difficult to solve. Furthermore, it may 
not always be possible to use calculus methods directly to determine the maximum of L(68). 


These points are illustrated in the following two examples. 


EXAMPLE 7-14 Uniform Distribution MLE 

Let X be uniformly distributed on the interval 0 to a. Since the 
density function is f(x) = 1/a for 0 = x = a and zero other- 
wise, the likelihood function of a random sample of size n is 


for 


Note that the slope of this function is not zero anywhere. That 
is, as long as max(x;) = a, the likelihood is 1/a”, which is pos- 
itive, but when a < max(x;), the likelihood goes to zero, as 
illustrated in Fig. 7-8. Therefore, calculus methods cannot be 
used directly because the maximum value of the likelihood 
function occurs at a point of discontinuity. However, since 
d/da(a~") = —n/a"* ‘is less than zero for all values of a > 0, 
a "is adecreasing function of a. This implies that the maximum 
of the likelihood function L(a) occurs at the lower boundary 


L(a) | 
Figure 7-8 The like- 
lihood function for the 
uniform distribution in 
Example 7-14. 0) Max (x;) 


point. The figure clearly shows that we could maximize L(a) by 
setting a equal to the smallest value that it could logically take 
on, which is max(x;). Clearly, a cannot be smaller than the 


EXAMPLE 7-15 Gamma Distribution MLE 
Let X,, X,... , X, be arandom sample from the gamma distri- 
bution. The log of the likelihood function is 


icra) 


i=1 


=nrin(ad) + (r-1) & Inv) 
i=1 
= nin[P(r)]-2 Sx 
i=1 
The derivatives of the log likelihood are 
d In L(r, A) n I(r) 
ee nin(a) 4 = In(x;) — 1 TW 


dln L(r, A) nr i 


an ak 


(a) 
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largest sample observation, so setting a equal to the largest 
sample value is reasonable. 


When the derivatives are equated to zero, we obtain the equa- 
tions that must be solved to find the maximum likelihood esti- 
mators of r and X: 


i 
x 
nIn(A) + S In(x;) ae @) 


There is no closed form solution to these equations. 

Figure 7-9 shows a graph of the log likelihood for the 
gamma distribution using the n = 8 observations on failure 
time introduced previously. Figure 7-9(a) shows the log likeli- 
hood surface as a function of r and \, and Figure 7-9(b) is a 
contour plot. These plots reveal that the log likelihood is 
maximized at approximately * = 1.75 and A = 0.08. Many 
statistics computer programs use numerical techniques to 
solve for the maximum likelihood estimates when no simple 
solution exists. 


a 0.081 Fe". 


0.079 |" -32,009 
31.398 

0.077% 

0.075 * 
1.58 1.62 


(b) 


Figure 7-9 Log likelihood for the gamma distribution using the failure time data. (a) Log likelihood surface. (b) Contour plot. 
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7-4.3 Bayesian Estimation of Parameters 


This book uses methods of statistical inference based on the information in the sample data. 
In effect, these methods interpret probabilities as relative frequencies. Sometimes we call 
probabilities that are interpreted in this manner objective probabilities. There is another 
approach to statistical inference, called the Bayesian approach, that combines sample infor- 
mation with other information that may be available prior to collecting the sample. In this sec- 
tion we briefly illustrate how this approach may be used in parameter estimation. 

Suppose that the random variable X has a probability distribution that is a function of one 
parameter 6. We will write this probability distribution as f(x| 6). This notation implies that 
the exact form of the distribution of X is conditional on the value assigned to 0. The classical ap- 
proach to estimation would consist of taking a random sample of size n from this distribution 
and then substituting the sample values x; into the estimator for 0. This estimator could have 
been developed using the maximum likelihood approach, for example. 

Suppose that we have some additional information about @ and that we can summarize 
that information in the form of a probability distribution for 0, say, f(6). This probability dis- 
tribution is often called the prior distribution for 6, and suppose that the mean of the prior is 
\., and the variance is 0. This is a very novel concept insofar as the rest of this book is con- 
cerned because we are now viewing the parameter 0 as a random variable. The probabilities 
associated with the prior distribution are often called subjective probabilities, in that they 
usually reflect the analyst’s degree of belief regarding the true value of 8. The Bayesian 
approach to estimation uses the prior distribution for 0, f(®), and the joint probability distri- 
bution of the sample, say, f(x), %,...,x,|9), to find a posterior distribution for 0, say, 
fO | x1, Xo, ... , X,)- This posterior distribution contains information from both the sample and 
the prior distribution for 0. In a sense, it expresses our degree of belief regarding the true value 
of 6 after observing the sample data. It is easy conceptually to find the posterior distribution. 
The joint probability distribution of the sample X,, X>,..., X,, and the parameter @ (remember 
that 8 is a random variable) is 


Fs %, soe Uns 8) = f (XX, sna Xn | 9) f(9) 


and the marginal distribution of X,, X>,..., X,, is 
D fer cighe 8), @ discrete 
| F(X, Xp. +++ %n, 8) d8, 9 continuous 


Therefore, the desired distribution is 


hicce XQ, +++ Xp 9) 


Ricce XQ, +++ 5 a) 


F(O| x1, X2, «-+ 5 Xp) ~ 


We define the Bayes estimator of @ as the value 6 that corresponds to the mean of the poste- 
rior distribution f(0| x1, x2, .-. ; X,)- 

Sometimes, the mean of the posterior distribution of @ can be determined easily. As a 
function of 6, f(8| x1, ..., x,) is a probability density function and x,, ... , x, are just constants. 


Because 9 enters into f(0|x;,.. 


.,X,) only through f(x, .. 
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Xp, 9) if f(x, ...,X_, 9), as a 


function of 6 is recognized as a well-known probability function, the posterior mean of 8 can 
be deduced from the well-known distribution without integration or even calculation of 


Sf (m1, pers Xn) 


EXAMPLE 7-16 Bayes Estimator for the Mean of a Normal Distribution 


Let X,, X>,... , X, be a random sample from the normal distri- 
bution with mean p and variance o”, where p is unknown and 
o” is known. Assume that the prior distribution for y is normal 
with mean 19 and variance 09; that is, 


1 


_ 1 
f(p) = Vinay” 


V2103 


—(h- Bo) /(208) = ewe —2 pon + p.5)/(205) 


The joint probability distribution of the sample is 


= YS (~,—y)2 
f(a, May ses 4 Xp | LB) A (2002)? e (120 2% ¥) 


= 1 e (1/20?) BaxF ~ 2px; + np?) 
(200°)? 
Thus, the joint probability distribution of the sample and p is 
1 
X1, XQ, --+ Xn BW) = 
f( 1> 42: n L) (2n0°)"?V/2709 
K eo /AM(/oo+n/o") pw —(2o/og + 2Ex;/o°) + Zxj/o" + 15/0] 


-0a(a+ eA a + 
= (1) of o/n Me 06 aa) h 


=eé 


(1, see Xn o, Mo» 55) 


Upon completing the square in the exponent, 


wn(2 a. )| ; C= m8 ) 
Fx X x ) =e ) ao o/n te oito'/n opto’/n 
9 XQ. +++ Xn 


2 2 
X ho(X1, --- Xn» Os Lo, OO) 


2 


where h,(x1,... ; Xys 67s bios 09) is a function of the observed 
values and the parameters 0”, \1o, and o. 


Now, because f(x), .. 
1 1 2 eres 
~~ 1 y) — 
(u/ (3 r aa ( 05 +o0°/n 


2 2 
X 3(X1, --- Xp» Os Mo, FO) 


., X,) does not depend on p, 


Flu| x1. 0%) =e 


This is recognized as a normal probability density function 
with posterior mean 


(o7/n) po + 06x 
06 + 0?/n 


and posterior variance 


(at em) 
0 ' o2/n 


Consequently, the Bayes estimate of is a weighted average 
of p49 and x. For purposes of comparison, note that the maxi- 
mum likelihood estimate of is { = x. 

To illustrate, suppose that we have a sample of size n = 
10 from a normal distribution with unknown mean w and 
variance o* = 4. Assume that the prior distribution for w is 
normal with mean jp = 0 and variance of = 1. If the sample 
mean is 0.75, the Bayes estimate of is 


09(0°/n) 


05 + o°/n 


(4/10)0 + 1(0.75) 0.75 
1 + (4/10) 1.4 


= 0.536 


Conclusion: Note that the maximum likelihood estimate of w 


is x = 0.75. The Bayes estimate is between the maximum like- 
lihood estimate and the prior mean. 


There is a relationship between the Bayes estimator for a parameter and the maximum 
likelihood estimator of the same parameter. For large sample sizes, the two are nearly 
equivalent. In general, the difference between the two estimators is small compared to 
1/Vn. In practical problems, a moderate sample size will produce approximately the same 
estimate by either the Bayes or maximum likelihood method, if the sample results are con- 
sistent with the assumed prior information. If the sample results are inconsistent with the 
prior assumptions, the Bayes estimate may differ considerably from the maximum likeli- 
hood estimate. In these circumstances, if the sample results are accepted as being correct, 
the prior information must be incorrect. The maximum likelihood estimate would then be 
the better estimate to use. 

If the sample results are very different from the prior information, the Bayes estimator 
will always tend to produce an estimate that is between the maximum likelihood estimate and 
the prior assumptions. If there is more inconsistency between the prior information and the 
sample, there will be more difference between the two estimates. 
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EXERCISES FOR SECTION 7-4 


7-34. Let X be a geometric random variable with parameter 
p. Find the maximum likelihood estimator of p, based on a 
random sample of size n. 

7-35. Consider the Poisson distribution with parameter A. 
Find the maximum likelihood estimator of A, based on a 
random sample of size n. 

7-36. Let X be a random variable with the following proba- 
bility distribution: 


(0+ 1)x9, O<x<=1 


so) = { — 


Find the maximum likelihood estimator of 6, based on a random 
sample of size n. 


7-37. 


otherwise 


Consider the shifted exponential distribution 


f(x) = AVN AY), x=@6 

When 0 = 0, this density reduces to the usual exponential 

distribution. When 0 > 0, there is only positive probability to 

the right of 0. 

(a) Find the maximum likelihood estimator of A and 6, based 
on arandom sample of size n. 

(b) Describe a practical situation in which one would suspect 
that the shifted exponential distribution is a plausible 
model. 


7-38. Consider the probability density function 


1 
fe) = gaxe™, 


Os=x<0, 0<0<x% 


Find the maximum likelihood estimator for 0. 


7-39. Let X,, X5,..., X,, be uniformly distributed on the in- 
terval 0 to a. Show that the moment estimator of a is @ = 2X. 
Is this an unbiased estimator? Discuss the reasonableness of 
this estimator. 


7-40. Consider the probability density function 


f(x) =c(1 + 6x), -lsx=1 

(a) Find the value of the constant c. 

(b) What is the moment estimator for 0? 

(c) Show that 6 = 3X is an unbiased estimator for 0. 
(d) Find the maximum likelihood estimator for 0. 
7-41. 
function 


The Rayleigh distribution has probability density 


(a) It can be shown that E(X) = 26. Use this information to 
construct an unbiased estimator for 0. 

(b) Find the maximum likelihood estimator of 8. Compare 
your answer to part (a). 

(c) Use the invariance property of the maximum likelihood 
estimator to find the maximum likelihood estimator of the 
median of the Raleigh distribution. 


7-42. Let X;, Xz,..., X, be uniformly distributed on the 
interval 0 to a. Recall that the maximum likelihood estimator 
of ais G = max(X)). 

(a) Argue intuitively why @ cannot be an unbiased estimator 
for a. 

(b) Suppose that E(@) = na/(n + 1). Is it reasonable that @ 
consistently underestimates a? Show that the bias in the 
estimator approaches zero as n gets large. 

(c) Propose an unbiased estimator for a. 

(d) Let Y = max(X;). Use the fact that Y = y if and only 
if each X;=y to derive the cumulative distribution 
function of Y. Then show that the probability density func- 
tion of Y is 


0 , otherwise 


Use this result to show that the maximum likelihood esti- 
mator for a is biased. 

(e) We have two unbiased estimators for a: the moment 
estimator @, = 2X and 4G = [(n + 1)/n] max(X)), 
where max(X;) is the largest observation in a random 
sample of size n. It can be shown that V(a,) = a*/(3n) 
and that V(a@) = a’/[n(n + 2)]. Show that if n > 1, a 
is a better estimator than @. In what sense is it a better 
estimator of a? 


7-43. Consider the Weibull distribution 


O<x 


0 , otherwise 


(a) Find the likelihood function based on a random sample of 
size n. Find the log likelihood. 

(b) Show that the log likelihood is maximized by solving the 
equations 


> xf In(xi) 


i=1 i=1 


n 
>t 
i=1 


n 1/B 
aw 
i=1 


n 


56 = 


(c) What complications are involved in solving the two equa- 
tions in part (b)? 


7-44. Reconsider the oxide thickness data in Exercise 7-29 
and suppose that it is reasonable to assume that oxide thick- 
ness is normally distributed. 

(a) Compute the maximum likelihood estimates of . and o°. 

(b) Graph the likelihood function in the vicinity of fi and 6”, 
the maximum likelihood estimates, and comment on its 
shape. 

(c) Suppose that the sample size was larger (n = 40) but the 
maximum likelihood estimates were numerically equal to 
the values obtained in part (a). Graph the likelihood func- 
tion for n = 40, compare it to the one from part (b), and 
comment on the effect of the larger sample size. 


7-45. Suppose that X is a normal random variable with un- 
known mean yp and known variance o”. The prior distribution 
for w is anormal distribution with mean jt, and variance 03. 
Show that the Bayes estimator for 1. becomes the maximum 
likelihood estimator when the sample size n is large. 


7-46. Suppose that X is a normal random variable with un- 
known mean w and known variance o”. The prior distribution 
for pis a uniform distribution defined over the interval [a, b]. 
(a) Find the posterior distribution for wp. 

(b) Find the Bayes estimator for wp. 


7-47. Suppose that X is a Poisson random variable with pa- 

rameter \. Let the prior distribution for \ be a gamma distri- 

bution with parameters m + 1 and (m + 1)/Ao. 

(a) Find the posterior distribution for A. 

(b) Find the Bayes estimator for A. 

7-48. Suppose that X is a normal random variable with 

unknown mean and known variance o* = 9. The prior distri- 

bution for pz is normal with wo = 4 and of = 1. A random 

sample of n = 25 observations is taken, and the sample mean 

is x = 4.85. 

(a) Find the Bayes estimate of wp. 

(b) Compare the Bayes estimate with the maximum likeli- 
hood estimate. 


7-49. The weight of boxes of candy is a normal random 
variable with mean p and variance 1/10 pound. The prior dis- 
tribution for w is normal, with mean 5.03 pound and variance 
1/25 pound. A random sample of 10 boxes gives a sample 
mean of x = 5.05 pounds. 

(a) Find the Bayes estimate of wp. 

(b) Compare the Bayes estimate with the maximum likeli- 

hood estimate. 


7-50. The time between failures of a machine has an expo- 


nential distribution with parameter \. Suppose that the prior 
distribution for \ is exponential with mean 100 hours. Two 
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machines are observed, and the average time between failures 

is x = 1125 hours. 

(a) Find the Bayes estimate for i. 

(b) What proportion of the machines do you think will fail 
before 1000 hours? 


Supplemental Exercises 


7-51. Transistors have a life that is exponentially distributed 
with parameter \. A random sample of n transistors is taken. 
What is the joint probability density function of the sample? 


7-52. Suppose that a random variable is normally distrib- 
uted with mean p and variance o7, and we draw a random 
sample of five observations from this distribution. What is the 
joint probability density function of the sample? 


7-53. Suppose that X is uniformly distributed on the interval 
from 0 to 1. Consider a random sample of size 4 from X. What 
is the joint probability density function of the sample? 


7-54. A procurement specialist has purchased 25 resistors 
from vendor 1 and 30 resistors from vendor 2. Let X,4, 
Xj9,.-., X15 represent the vendor 1 observed resistances, 
which are assumed to be normally and independently distrib- 
uted with mean 100 ohms and standard deviation 1.5 ohms. 
Similarly, let X,,, X>,..., X39 represent the vendor 2 ob- 
served resistances, which are assumed to be normally and in- 
dependently distributed with mean 105 ohms and standard 
deviation of 2.0 ohms. What is the sampling distribution of 
X, — X,? What is the standard error of X; — X,? 


7-55. A random sample of 36 observations has been drawn 
from a normal distribution with mean 50 and standard deviation 
12. Find the probability that the sample mean is in the interval 
47 < X < 53. Is the assumption of normality important? Why? 


7-56. A random sample of n = 9 structural elements is 
tested for compressive strength. We know that the true mean 
compressive strength 2 = 5500 psi and the standard deviation 
is o = 100 psi. Find the probability that the sample mean 
compressive strength exceeds 4985 psi. 


7-57. A normal population has a known mean 50 and 
known variance o”? = 2. A random sample of n = 16 is se- 
lected from this population, and the sample mean is x = 52. 
How unusual is this result? 


7-58. Arandom sample of size n = 16 is taken from a nor- 
mal population with j. = 40 and o? = 5. Find the probability 
that the sample mean is less than or equal to 37. 

7-59. A manufacturer of semiconductor devices takes a 
random sample of 100 chips and tests them, classifying each 
chip as defective or nondefective. Let X; = 0 if the chip is 
nondefective and X; = 1 if the chip is defective. The sample 
fraction defective is 


a + Xy + +++ + X09 
100 


What is the sampling distribution of the random variable P? 
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7-60. Let X be a random variable with mean pw and variance 
o”. Given two independent random samples of sizes n, and ny, 
with sample means X, and X>, show that 


>< 


=aX,+(l-a)X, O0<a<1 


is an unbiased estimator for jw. If X; and X> are independent, 
find the value of a that minimizes the standard error of X. 


7-61. A random variable x has probability density function 


O<x<a%, 0<0<m 


Find the maximum likelihood estimator for 0. 

7-62. Let f(x) = 0x9!) 0O<@0<%, and O<x<1. 
Show that @ = —n/(In TI/_, X;) is the maximum likelihood 
estimator for 0. 


7-63. Letf(x) = (1/0)x"- 9" 0<x<land0<0 <~, 
Show that @ = —(1/n) 57_, In(X;) is the maximum likelihood 


estimator for @ and that © is an unbiased estimator for 0. 


CHAPTER 7 SAMPLING DISTRIBUTIONS AND POINT ESTIMATION OF PARAMETERS 


7-64. You plan to use a rod to lay out a square, each side of 
which is the length of the rod. The length of the rod is , which 
is unknown. You are interested in estimating the area of the 
square, which is 2. Because pw is unknown, you measure it n 
times, obtaining observations X,, X5,..., X,. Suppose that 
each measurement is unbiased for jt. with variance o”. 


(a) Show that X° is a biased estimate of the area of the square. 
(b) Suggest an estimator that is unbiased. 


7-65. An electric utility has placed special meters on 10 
houses in a subdivision that measures the energy consumed 
(demand) at each hour of the day. They are interested in the en- 
ergy demand at one specific hour, the hour at which the system 
experiences the peak consumption. The data from these 10 me- 
ters are as follows (in KW): 23.1, 15.6, 17.4, 20.1, 19.8, 26.4, 
25.1, 20.5, 21.9, and 28.7. If pz is the true mean peak demand 
for the ten houses in this group of houses having the special 
meters, estimate yp. Now suppose that the utility wants to esti- 
mate the demand at the peak hour for all 5,000 houses in this 
subdivision. Let @ be this quantity. Estimate @ using the data 
given above. Estimate the proportion of houses in the subdivi- 
sion that demand at least 20KW at the hour of system peak. 


MIND-EXPANDING EXERCISES 


7-66. A lot consists of N transistors, and of these, M 
(M = N) are defective. We randomly select two transis- 
tors without replacement from this lot and determine 
whether they are defective or nondefective. The 
random variable 


1, if the ith transistor 
is nondefective 

0, if the ith transistor 
is defective 


i= 12 


Determine the joint probability function for X, and X. 
What are the marginal probability functions for X, and 
X,? Are X, and X, independent random variables? 

7-67. When the sample standard deviation is based on a 
random sample of size n from a normal population, it can 
be shown that S is a biased estimator for o. Specifically, 


E(S) = oV2/(n — 1) V(n/2)/T (a — 1/2] 
(a) Use this result to obtain an unbiased estimator for o 


of the form c,S, when the constant c, depends on the 
sample size n. 


(b) Find the value of c, for n= 10 and n= 25. 
Generally, how well does S perform as an estimator 
of o for large n with respect to bias? 


7-68. A collection of n randomly selected parts is 

measured twice by an operator using a gauge. Let X; and 

Y, denote the measured values for the ith part. Assume 

that these two random variables are independent and 

normally distributed and that both have true mean j1; and 

variance o°. 

(a) Show that the maximum likelihood estimator of o? 
is 6? = (1/4n) >j_,(X; — Y;). 

(b) Show that G? is a biased estimator for G?. What 
happens to the bias as n becomes large? 

(c) Find an unbiased estimator for o°. 


7-69. Consistent Estimator. Another way to measure 
the closeness of an estimator 6 to the parameter @ is in 
terms of consistency. If 6, is an estimator of 0 based on a 
random sample of n observations, 6, is consistent for 0 if 


lim P(|6, — 6| <«) =1 


no 
Thus, consistency is a large-sample property, describing 
the limiting behavior of ©, as n tends to infinity. It is 
usually difficult to prove consistency using the above 
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MIND-EXPANDING EXERCISES 


definition, although it can be done from other ap- 
proaches. To illustrate, show that X is a consistent esti- 
mator of w (when o* < ~) by using Chebyshev’s 
inequality from the supplemental material on the Web. 
7-70. Order Statistics. Let X,, X5, ... , X,, be a random 
sample of size n from X, a random variable having distri- 
bution function F(x). Rank the elements in order of in- 
creasing numerical magnitude, resulting in Xj), Xq, ..., 
Xn, Where Xq) is the smallest sample element (X() = 
min{X,, X>,..., X,}) and X;,) is the largest sample ele- 
ment (X;,, = max{X), X5,..., X,}). Xj is called the ith 
order statistic. Often the distribution of some of the order 
Statistics is of interest, particularly the minimum and 
maximum sample values X,;) and X(,), respectively. 
(a) Prove that the cumulative distribution functions of 
these two order statistics, denoted respectively by 
Fy ,(t) and Fy. (t), are 


(b) Prove that if X is continuous with probability den- 
sity function f(x), the probability distributions of 
Xi) and X,,) are 


fa (t) = nll — FOP FO 
Fag) = nLF)Y" FO) 


(c) Let X,, X>,..., X, be arandom sample of a Bernoulli 
random variable with parameter p. Show that 


P(X) c= 1) = i= (1 = janie 
P(X) = 0) = 1 — p" 


(d) Let X,, X>,..., X,, be a random sample of a normal 
random variable with mean ws and variance o”. Derive 
the probability density functions of X(,) and X(,). 

(e) Let X;, X>,..., X,, be a random sample of an expo- 
nential random variable of parameter \. Derive the 
cumulative distribution functions and probability 
density functions for X(,, and X;,,). 


7-71. Let X,, X,..., X, be a random sample of a 
continuous random variable with cumulative distribu- 
tion function F(x). Find 


and 


7-72. Let X be a random variable with mean p and 
variance o”, and let X;, X>,..., X, be a random sample 
of size n from X. Show that the statistic V = k>;_ : 
(X;4, — X;)° is an unbiased estimator for o? for an ap- 
propriate choice for the constant k. Find this value for k. 


7-73. When the population has a normal distribution, 
the estimator 


& = median (|X, — X|, |X. — X|, 


++, |X, — X|)/0.6745 


is sometimes used to estimate the population standard 

deviation. This estimator is more robust to outliers than the 

usual sample standard deviation and usually does not dif- 

fer much from S when there are no unusual observations. 

(a) Calculate G and S for the data 10, 12, 9, 14, 18, 15, 
and 16. 

(b) Replace the first observation in the sample (10) with 
50 and recalculate both S and G. 


7-74. Censored Data. A common problem in indus- 
try is life testing of components and systems. In this 
problem, we will assume that lifetime has an exponen- 
tial distribution with parameter A, so (1 = 1/A = X is 
an unbiased estimate of jz. When n components are tested 
until failure and the data X,, X>,..., X,, represent actual 
lifetimes, we have a complete sample, and X is indeed an 
unbiased estimator of . However, in many situations, the 
components are only left under test until r < n failures 
have occurred. Let Y, be the time of the first failure, Y, 
be the time of the second failure, ..., and Y, be the 
time of the last failure. This type of test results in censored 
data. There are n — r units still running when the test 
is terminated. The total accumulated test time at 
termination is 


(a) Show that ji. = T,/r is an unbiased estimator for w. 
[Hint: You will need to use the memoryless property 
of the exponential distribution and the results of 
Exercise 7-70 for the distribution of the minimum of 
a sample from an exponential distribution with 
parameter A.] 

(b) It can be shown that V(T,/r) = 1/(A2r). How does 
this compare to V(X ) in the uncensored experiment? 
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IMPORTANT TERMS AND CONCEPTS 


Bayes estimator Mean square error of an = Normal distribution as Prior distribution 

Bias in parameter estimator the sampling distribu- Sample moments 
estimation Minimum variance tion of the difference Sampling distribution 

Central limit theorem unbiased estimator in two sample means Standard error and 

Estimator versus Moment estimator Parameter estimation estimated standard 
estimate Normal distribution Point estimator error of an estimator 

Likelihood function as the sampling Population or distribu- Statistic 

Maximum likelihood distribution of a tion moments Statistical inference 


estimator sample mean Posterior distribution Unbiased estimator 


© Steve Rosset/iStockphoto 


Statistical Intervals for 
a Single Sample 


INTRODUCTION 


Engineers are often involved in estimating parameters. For example, there is an ASTM 
Standard E23 that defines a technique called the Charpy V-notch method for notched 
bar impact testing of metallic materials. The impact energy is often used to determine 
if the material experiences a ductile-to-brittle transition as the temperature decreases. 
Suppose that you have tested a sample of 10 specimens of a particular material with 
this procedure. You know that you can use the sample average X to estimate the true 
mean impact energy 2. However, we also know that the true mean impact energy is un- 
likely to be exactly equal to your estimate. Reporting the results of your test as a single 
number is unappealing, because there is nothing inherent in X that provides any infor- 
mation about how close it is to w. Your estimate could be very close, or it could be con- 
siderably far from the true mean. A way to avoid this is to report the estimate in terms 
of a range of plausible values called a confidence interval. A confidence interval al- 
ways specifies a confidence level, usually 90%, 95%, or 99%, which is a measure of 
the reliability of the procedure. So if a 95% confidence interval on the impact energy 
based on the data from your 10 specimens has a lower limit of 63.84/ and an upper 
limit of 65.08J/, then we can say that at the 95% level of confidence any value of mean 
impact energy between 63.84 J and 65.08 J is a plausible value. By reliability, we mean 
that if we repeated this experiment over and over again, 95% of all samples would pro- 
duce a confidence interval that contains the true mean impact energy, and only 5% of 
the time would the interval be in error. In this chapter you will learn how to construct 
confidence intervals and other useful types of statistical intervals for many important 
types of problem situations. 
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CHAPTER 8 STATISTICAL INTERVALS FOR A SINGLE SAMPLE 


CHAPTER OUTLINE 

8-1 CONFIDENCE INTERVAL ON THE 8-3 CONFIDENCE INTERVAL ON THE 
MEAN OF A NORMAL DISTRIBU- VARIANCE AND STANDARD 
TION, VARIANCE KNOWN DEVIATION OF A NORMAL 


8-1.1 Development of the Confidence DISTRIBUTION 
Interval and Its Basic Properties 8-4 LARGE-SAMPLE CONFIDENCE 
8-1.2 Choice of Sample Size INTERVAL FOR A POPULATION 


PROPORTION 
8-1.3 One-Sided Confidence Bounds 
8-5 GUIDELINES FOR CONSTRUCT- 
8-1.4 General Method to Derive a ING CONFIDENCE INTERVALS 
Confidence Interval 
8-6 TOLERANCE AND PREDICTION 
8-1.5 Large-Sample Confidence INTERVALS 


Interval for p 


8-2. CONFIDENCE INTERVAL ON THE 
MEAN OF A NORMAL DISTRIBU- 
TION, VARIANCE UNKNOWN 8-6.2 Tolerance Interval for a Normal 


Distribution 
8-2.1 t Distribution 


8-6.1 Prediction Interval for a Future 
Observation 


8-2.2 t Confidence Interval on 


LEARNING OBJECTIVES 


After careful study of this chapter, you should be able to do the following: 


1. Construct confidence intervals on the mean of a normal distribution, using either the normal 
distribution or the t distribution method 


. Construct confidence intervals on the variance and standard deviation of a normal distribution 
. Construct confidence intervals on a population proportion 

. Use a general method for constructing an approximate confidence interval on a parameter 

. Construct prediction intervals for a future observation 


. Construct a tolerance interval for a normal population 


NWN Bh WwW WN 


. Explain the three types of interval estimates: confidence intervals, prediction intervals, and 
tolerance intervals 


In the previous chapter we illustrated how a point estimate of a parameter can be estimated 
from sample data. However, it is important to understand how good is the estimate obtained. 
For example, suppose that we estimate the mean viscosity of a chemical product to be 
iL = x = 1000. Now because of sampling variability, it is almost never the case that the true 
mean w. is exactly equal to the estimate x. The point estimate says nothing about how close jt 
is to w. Is the process mean likely to be between 900 and 1100? Or is it likely to be between 
990 and 1010? The answer to these questions affects our decisions regarding this process. 
Bounds that represent an interval of plausible values for a parameter are an example of an in- 
terval estimate. Surprisingly, it is easy to determine such intervals in many cases, and the same 
data that provided the point estimate are typically used. 

An interval estimate for a population parameter is called a confidence interval. 
Information about the precision of estimation is conveyed by the length of the interval. A short 
interval implies precise estimation. We cannot be certain that the interval contains the true, 
unknown population parameter—we only use a sample from the full population to compute 
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the point estimate and the interval. However, the confidence interval is constructed so that we 
have high confidence that it does contain the unknown population parameter. Confidence in- 
tervals are widely used in engineering and the sciences. 

A tolerance interval is another important type of interval estimate. For example, the 
chemical product viscosity data might be assumed to be normally distributed. We might like 
to calculate limits that bound 95% of the viscosity values. For a normal distribution, we know 
that 95% of the distribution is in the interval 


w — 1.960, wp + 1.960 


However, this is not a useful tolerance interval because the parameters . and o are unknown. 
Point estimates such as x and s can be used in the above equation for 1 and o. However, we 
need to account for the potential error in each point estimate to form a tolerance interval for 
the distribution. The result is an interval of the form 


x — ks,x + ks 


where k is an appropriate constant (that is larger than 1.96 to account for the estimation 
error). As in the case of a confidence interval, it is not certain that the tolerance interval 
bounds 95% of the distribution, but the interval is constructed so that we have high confi- 
dence that it does. Tolerance intervals are widely used and, as we will subsequently see, they 
are easy to calculate for normal distributions. 

Confidence and tolerance intervals bound unknown elements of a distribution. In this 
chapter you will learn to appreciate the value of these intervals. A prediction interval pro- 
vides bounds on one (or more) future observations from the population. For example, a 
prediction interval could be used to bound a single, new measurement of viscosity—another 
useful interval. With a large sample size, the prediction interval for normally distributed data 
tends to the tolerance interval, but for more modest sample sizes the prediction and tolerance 
intervals are different. 

Keep the purpose of the three types of interval estimates clear: 


e A confidence interval bounds population or distribution parameters (such as the mean 
viscosity). 
e A tolerance interval bounds a selected proportion of a distribution. 


e A prediction interval bounds future observations from the population or distribution. 


8-1 CONFIDENCE INTERVAL ON THE MEAN OF A NORMAL 


8-1.1 


DISTRIBUTION, VARIANCE KNOWN 


The basic ideas of a confidence interval (CI) are most easily understood by initially consider- 
ing a simple situation. Suppose that we have a normal population with unknown mean wp and 
known variance o”. This is a somewhat unrealistic scenario because typically both the mean 
and variance are unknown. However, in subsequent sections we will present confidence inter- 
vals for more general situations. 


Development of the Confidence Interval and Its Basic Properties 


Suppose that X,, X>,..., X, 1s a random sample from a normal distribution with unknown 
mean wand known variance o”. From the results of Chapter 5 we know that the sample mean 
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Confidence 
Interval on the 
Mean, Variance 
Known 


X is normally distributed with mean p and variance o”/n. We may standardize X by sub- 
tracting the mean and dividing by the standard deviation, which results in the variable 


_X7p 
~ o/Va 


Z (8-1) 


The random variable Z has a standard normal distribution. 

A confidence interval estimate for p is an interval of the form / = jw. = u, where the end- 
points / and u are computed from the sample data. Because different samples will produce 
different values of / and u, these end-points are values of random variables L and U, respec- 
tively. Suppose that we can determine values of L and U such that the following probability 
statement is true: 


P{LspSU}=1-a (8-2) 


where 0 = a = 1. There is a probability of 1 — a of selecting a sample for which the CI will 
contain the true value of . Once we have selected the sample, so that X; = x), X. = %,..., 
X,, = X,, and computed / and u, the resulting confidence interval for w is 


l=psu (8-3) 


The end-points or bounds / and u are called the lower- and upper-confidence limits, respec- 
tively, and 1 — a is called the confidence coefficient. 

In our problem situation, because Z = (X — 2)/(o/V/n) has a standard normal distribu- 
tion, we may write 


X= pi 
P alt = Gin Fa? =l-a 


Now manipulate the quantities inside the brackets by (1) multiplying through by o/ Vn, (2) 
subtracting X from each term, and (3) multiplying through by —1. This results in 


— oO — oO 
pfx =ne¥t uo S}=1-4 (8-4) 


From consideration of Equation 8-4, the lower and upper limits of the inequalities in Equation 
8-4 are the lower- and upper-confidence limits L and U, respectively. This leads to the fol- 
lowing definition. 


If x is the sample mean of a random sample of size n from a normal population with 
known variance o7, a 100(1 — a)% Cl on p is given by 


J = 20/ v= a Zq20/Vn (8-5) 


where Z,./2 is the upper 100a/2 percentage point of the standard normal distribution. 
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EXAMPLE 8-1 Metallic Material Transition 

ASTM Standard E23 defines standard test methods for 
notched bar impact testing of metallic materials. The Charpy 
V-notch (CVN) technique measures impact energy and is of- 
ten used to determine whether or not a material experiences a 
ductile-to-brittle transition with decreasing temperature. Ten 
measurements of impact energy (J) on specimens of A238 
steel cut at 60°C are as follows: 64.1, 64.7, 64.5, 64.6, 64.5, 
64.3, 64.6, 64.8, 64.2, and 64.3. Assume that impact energy is 
normally distributed with o = 1J. We want to find a 95% CI 
for , the mean impact energy. The required quantities are 
1.96,n = 10,0 = 1, and x = 64.46. The resulting 


Za/2 = 20,025 


Interpreting a Confidence Interval 


95% CI is found from Equation 8-5 as follows: 


oO _ oO 
a SU Spsxt afd Fe 


1 1 
64.46 — 1.96 —— <= vu = 64.46 + 1.96 ——— 
vio V10 


63.84 <p < 65.08 


Practical Interpretation: Based on the sample data, a range of 
highly plausible values for mean impact energy for A238 steel 
at 60°C is 63.84/ = p = 65.08 J. 


How does one interpret a confidence interval? In the impact energy estimation problem in 
Example 8-1, the 95% CT is 63.84 = ww = 65.08, so it is tempting to conclude that p is within 
this interval with probability 0.95. However, with a little reflection, it’s easy to see that this 
cannot be correct; the true value of w is unknown and the statement 63.84 <= pp <= 65.08 is ei- 
ther correct (true with probability 1) or incorrect (false with probability 1). The correct 
interpretation lies in the realization that a CI is a random interval because in the probability 
statement defining the end-points of the interval (Equation 8-2), L and U are random variables. 
Consequently, the correct interpretation of a 100(1 — «)% CI depends on the relative fre- 
quency view of probability. Specifically, if an infinite number of random samples are collected 
and a 100(1 — a)% confidence interval for 2 is computed from each sample, 100(1 — a)% of 
these intervals will contain the true value of w. 

The situation is illustrated in Fig. 8-1, which shows several 100(1 — a)% confidence 
intervals for the mean yw of a normal distribution. The dots at the center of the intervals indicate 
the point estimate of js (that is, x). Notice that one of the intervals fails to contain the true value of 
w. If this were a 95% confidence interval, in the long run only 5% of the intervals would fail to 


contain j. 


Now in practice, we obtain only one random sample and calculate one confidence 
interval. Since this interval either will or will not contain the true value of 1, it is not reason- 
able to attach a probability level to this specific event. The appropriate statement is that the ob- 
served interval [/, u] brackets the true value of » with confidence 100(1 — a). This statement 
has a frequency interpretation; that is, we don’t know if the statement is true for this specific 
sample, but the method used to obtain the interval [/, u] yields correct statements 10001 — a)% 


of the time. 


Figure 8-1 Repeated 
construction of a con- 1 


fidence interval for p. Interval number 


23 4 5 6 7 8 9 10 1112 13 14 15 16 
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Confidence Level and Precision of Estimation 

Notice in Example 8-1 that our choice of the 95% level of confidence was essentially arbi- 
trary. What would have happened if we had chosen a higher level of confidence, say, 99%? In fact, 
doesn’t it seem reasonable that we would want the higher level of confidence? At a = 0.01, we 
find Zy/2 = 2001/2 = 2000s = 2-58, while for a = 0.05, Zoo25 = 1.96. Thus, the length of the 95% 
confidence interval is 


2(1.960/Vn) = 3.920/Vn 
whereas the length of the 99% CI is 


2(2.580/Vn) = 5.160/Vn 


Thus, the 99% Clis longer than the 95% CI. This is why we have a higher level of confidence 
in the 99% confidence interval. Generally, for a fixed sample size n and standard deviation o, 
the higher the confidence level, the longer the resulting CI. 

The length of a confidence interval is a measure of the precision of estimation. From the 
preceeding discussion, we see that precision is inversely related to the confidence level. It is 
desirable to obtain a confidence interval that is short enough for decision-making purposes 
and that also has adequate confidence. One way to achieve this is by choosing the sample size 
n to be large enough to give a CI of specified length or precision with prescribed confidence. 


8-1.2 Choice of Sample Size 


Sample Size for 
Specified Error 
on the Mean, 
Variance Known 


Figure 8-2 Error in 
estimating j with x. 


The precision of the confidence interval in Equation 8-5 is 224/20/ Vn. This means that in 
using X to estimate , the error E = |x — p| is less than or equal to zy,0/Vn with 
confidence 100(1 — a). This is shown graphically in Fig. 8-2. In situations where the sam- 
ple size can be controlled, we can choose n so that we are 100(1 — a) percent confident that 
the error in estimating w is less than a specified bound on the error E. The appropriate 
sample size is found by choosing n such that Z,0/ Vn = E. Solving this equation gives the 
following formula for n. 


If x is used as an estimate of , we can be 100(1 — a)% confident that the error 
|x — p| will not exceed a specified amount E when the sample size is 


2, 
AS (= 2°) (8-6) 


E 


If the right-hand side of Equation 8-6 is not an integer, it must be rounded up. This will ensure 
that the level of confidence does not fall below 100(1 — a)%. Notice that 2E is the length of 
the resulting confidence interval. 


E = error = lz-ul 


= 


1l=X-Zzgpol fn x Lu U=Kt2Zg0/ fn 
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EXAMPLE 8-2 Metallic Material Transition 

To illustrate the use of this procedure, consider the CVN test 
described in Example 8-1, and suppose that we wanted to de- 
termine how many specimens must be tested to ensure that the 
95% Cl on pw for A238 steel cut at 60°C has a length of at most 
1.0J. Since the bound on error in estimation E is one-half of 
the length of the CI, to determine n we use Equation 8-6 with 
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E= 05,0 = l,andZy/2 = 1.96. The required sample size is 16, 


£a/2F 2 (1.96)1 i 
= = = 15.37 
7 ( E ) 0.5 a 


and because n must be an integer, the required sample size is 
n= 16. 


Notice the general relationship between sample size, desired length of the confidence 
interval 2E, confidence level 100(1 — a), and standard deviation o: 


e As the desired length of the interval 2E decreases, the required sample size n increases 
for a fixed value of o and specified confidence. 


e Aso increases, the required sample size n increases for a fixed desired length 2E and 


specified confidence. 


e As the level of confidence increases, the required sample size n increases for fixed 
desired length 2F and standard deviation o. 


8-1.3 One-Sided Confidence Bounds 


The confidence interval in Equation 8-5 gives both a lower confidence bound and an upper 
confidence bound for jt. Thus, it provides a two-sided CI. It is also possible to obtain one-sided 
confidence bounds for «ws by setting either / = —9 or u = © and replacing Z,/2 by z,. 


One-Sided 
Confidence 
Bounds on the 
Mean, Variance 
Known 


A 100(1 — a)% upper-confidence bound for py is 
psu =x+z,0/Vn 
and a 100(1 — «)% lower-confidence bound for ju is 


%—2,9/Vn —1= p 


(8-7) 


(8-8) 


EXAMPLE 8-3 One-Sided Confidence Bound 

The same data for impact testing from Example 8-1 are used 
to construct a lower, one-sided 95% confidence interval for 
the mean impact energy. Recall that x = 64.46, 0 = LJ, and 
n = 10. The interval is 


ae en SS 
Bal 

1 
4.46 —-164—— < 
eg 


Practical Interpretation: The lower limit for the two-sided in- 
terval in Example 8-1 was 63.84. Because z, < Z,y,2, the lower 
limit of a one-sided interval is always greater than the lower 
limit of a two-sided interval of equal confidence. The one- 
sided interval does not bound p from above so that it still 
achieves 95% confidence with a slightly greater lower limit. If 
our interest is only in the lower limit for 2, then the one-sided 
interval is preferred because it provides equal confidence with 
a greater lower limit. Similarly, a one-sided upper limit is 
always less than a two-sided upper limit of equal confidence. 
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8-1.4 General Method to Derive a Confidence Interval 


It is easy to give a general method for finding a confidence interval for an unknown 
parameter 0. Let X,, X>,..., X,, be a random sample of n observations. Suppose we can find a 
Statistic g(X,, X>,..., X,,3 8) with the following properties: 


1. g(X,, X,..., X,,; 8) depends on both the sample and 80. 


2. The probability distribution of g(X), Xz,..., X,; 8) does not depend on 6 or any other 
unknown parameter. 


In the case considered in this section, the parameter 8 = w. The random variable g(X,, Xo,..., 
X,; &) = (X — p)/(o/Vn) and satisfies both conditions above; it depends on the sample and 
on pt, and it has a standard normal distribution since o is known. Now one must find constants 
C, and Cy so that 


P(C, = g(X, Xp, ey Xns 9) = Cy] = 1a (8-9) 


Because of property 2, C, and Cy do not depend on @. In our example, Cy = —Zq/2 
and Cy = Z,q,2. Finally, you must manipulate the inequalities in the probability statement so that 


PLLOG, Mo... X,) 30 = UK, Mo, X)] = 1 = (8-10) 


This gives L(X,, X>,..., X,) and U(X), X>,..., X,,) as the lower and upper confidence limits 
defining the 100(1 — «)% confidence interval for 6. The quantity g(X,, Xp, ..., X,,; 9) is often 
called a “pivotal quantity” because we pivot on this quantity in Equation 8-9 to produce 
Equation 8-10. In our example, we manipulated the pivotal quantity (X — w)/(o/Vn) to 


obtain L(X), Xz, ..., Xp) = X — Zqy20/Vn and U(X, Xo, ..., Xp) = X + Zqyn0/Vn. 


8-1.5 Large-Sample Confidence Interval for 1 


Large-Sample 
Confidence 
Interval on 

the Mean 


We have assumed that the population distribution is normal with unknown mean and known stan- 
dard deviation o. We now present a large-sample CI for jx that does not require these assumptions. 
Let X,, X>,..., X,, be a random sample from a population with unknown mean p. and variance 0. 
Now if the sample size n is large, the central limit theorem implies that X has approximately a nor- 
mal distribution with mean w and variance o”/n. Therefore, Z = (X — )/(o/V/n) has approxi- 
mately a standard normal distribution. This ratio could be used as a pivotal quantity and manipu- 
lated as in Section 8-1.1 to produce an approximate CI for jz. However, the standard deviation o is 
unknown. It turns out that when n is large, replacing o by the sample standard deviation S has lit- 
tle effect on the distribution of Z. This leads to the following useful result. 


When zn is large, the quantity 
X= ih 
S/Vn 


has an approximate standard normal distribution. Consequently, 


= S ee, S 
TED dt Is CO a (8-11) 


is a large sample confidence interval for j1, with confidence level of approximately 
10001 — a)%. 
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Equation 8-11 holds regardless of the shape of the population distribution. Generally n should 
be at least 40 to use this result reliably. The central limit theorem generally holds for n = 30, 
but the larger sample size is recommended here because replacing o by S in Z results in addi- 


tional variability. 


EXAMPLE 8-4 Mercury Contamination 

An article in the 1993 volume of the Transactions of the 
American Fisheries Society reports the results of a study to in- 
vestigate the mercury contamination in largemouth bass. A 


sample of fish was selected from 53 Florida lakes, and mer- 
cury concentration in the muscle tissue, was measured (ppm). 
The mercury concentration values are 


1.230 0.490 0.490 1.080 0.590 0.280 0.180 0.100 0.940 
1.330 0.190 1.160 0.980 0.340 0.340 0.190 0.210 0.400 
0.040 0.830 0.050 0.630 0.340 0.750 0.040 0.860 0.430 
0.044 0.810 0.150 0.560 0.840 0.870 0.490 0.520 0.250 
1.200 0.710 0.190 0.410 0.500 0.560 1.100 0.650 0.270 
0.270 0.500 0.770 0.730 0.340 0.170 0.160 0.270 
The summary statistics from Minitab are displayed below: 
Descriptive Statistics: Concentration 
Variable N Mean Median TrMean StDev SE Mean 
Concentration 53 0.5250 0.4900 0.5094 0.3486 0.0479 
Variable Minimum Maximum Ql Q3 
Concentration 0.0400 1.3300 0.2300 0.7900 
Figure 8-3(a) and (b) presents the histogram and normal prob- BA: Se 2 ye pk a 
ability plot of the mercury concentration data. Both plots indi- =e Vn dese, eS Vn 
cate that the distribution of mercury concentration is not nor- 0.3486 0.3486 
mal and is positively skewed. We want to find an approximate 0.5250 — 1.96 = p = 0.5250 + 1.96 


95% Cl on pw. Because n > 40, the assumption of normality is 
not necessary to use Equation 8-11. The required quantities 
are n = 53, X = 0.5250, s = 0.3486, and Zo 25 = 1.96. The 
approximate 95% CI on wp is 


Percentage 


Frequency 


OrRrNWHF ODN WO OO 


0.0 0.5 1.0 
Concentration 


(a) 


V53 


0.4311 S p S 0.6189 


V53 


Practical Interpretation: This interval is fairly wide because 
there is a lot of variability in the mercury concentration meas- 
urements. A larger sample size would have produced a shorter 
interval. 


0.5 


1.0 


Concentration 
(b) 


Figure 8-3 Mercury concentration in largemouth bass. (a) Histogram. (b) Normal probability plot. 
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Large-Sample Confidence Interval for a Parameter 

The large-sample confidence interval for jz in Equation 8-11 is a special case of a more general 
result. Suppose that 0 is a parameter of a probability distribution, and let © be an estimator of 
6. If O (1) has an approximate normal distribution, (2) is approximately unbiased for 0, and 
(3) has standard deviation o@ that can be estimated from the sample data, then the quantity 
(O — 6)/o@ has an approximate standard normal distribution. Then a large-sample approximate 
CI for @ is given by 


O20, 0 = 0 ing (8-12) 


Maximum likelihood estimators usually satisfy the three conditions listed above, so Equation 
8-12 is often used when © is the maximum likelihood estimator of 0. Finally, note that 
Equation 8-12 can be used even when o¢@ is a function of other unknown parameters (or of 8). 
Essentially, all one does is to use the sample data to compute estimates of the unknown 
parameters and substitute those estimates into the expression for o@. 


EXERCISES FOR SECTION 8-1 


8-1. For a normal population with known variance o”, 

answer the following questions: 

(a) What is the confidence level for the interval X — 2.140/Vn 
Spsx4+2.140/Vn? 

(b) What is the confidence level for the interval X — 2.490/Vn 
Sp sxX4+ 2.490/Vn? 

(c) What is the confidence level for the interval X — 1.850/Vn 
<p <=x+ 1.850/Vn? 

8-2. Fora normal population with known variance o”: 

(a) What value of z,,2 in Equation 8-5 gives 98% confidence? 

(b) What value of z,./2 in Equation 8-5 gives 80% confidence? 

(c) What value of 2/2 in Equation 8-5 gives 75% confidence? 


8-3. Consider the one-sided confidence interval expressions 
for a mean of a normal population. 

(a) What value of z, would result in a 90% CI? 

(b) What value of z, would result in a 95% CI? 

(c) What value of z, would result in a 99% CI? 


8-4. A confidence interval estimate is desired for the gain in 
a circuit on a semiconductor device. Assume that gain is nor- 
mally distributed with standard deviation o = 20. 

(a) Find a 95% CI for when n = 10 and x = 1000. 

(b) Find a 95% CI for p when n = 25 and x = 1000. 

(c) Find a 99% CI for w when n = 10 and x = 1000. 

(d) Find a 99% CI for when n = 25 and x = 1000. 

(e) How does the length of the CIs computed above change 
with the changes in sample size and confidence level? 
8-5. A random sample has been taken from a normal distri- 
bution and the following confidence intervals constructed us- 
ing the same data: (38.02, 61.98) and (39.95, 60.05) 

(a) What is the value of the sample mean? 


(b) One of these intervals is a 95% CI and the other is a 90% 
CI. Which one is the 95% CI and why? 


8-6. A random sample has been taken from a normal distri- 

bution and the following confidence intervals constructed us- 

ing the same data: (37.53, 49.87) and (35.59, 51.81) 

(a) What is the value of the sample mean? 

(b) One of these intervals is a 99% CI and the other is a 95% 
CI. Which one is the 95% CI and why? 


8-7. Consider the gain estimation problem in Exercise 8-4. 

(a) How large must n be if the length of the 95% Clis to be 40? 

(b) How large must n be if the length of the 99% CI is to 
be 40? 


8-8. Following are two confidence interval estimates of the 

mean p of the cycles to failure of an automotive door latch 

mechanism (the test was conducted at an elevated stress level 

to accelerate the failure). 

3124.9 = p = 3215.7 3110.5 S p S 3230.1 

(a) What is the value of the sample mean cycles to failure? 

(b) The confidence level for one of these Cls is 95% and the 
confidence level for the other is 99%. Both Cls are calcu- 
lated from the same sample data. Which is the 95% CI? 
Explain why. 

8-9. Suppose that n = 100 random samples of water from a 

freshwater lake were taken and the calcium concentration 

(milligrams per liter) measured. A 95% CI on the mean cal- 

cium concentration is 0.49 = pw = 0.82. 

(a) Would a 99% CI calculated from the same sample data be 
longer or shorter? 
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(b) Consider the following statement: There is a 95% chance 
that is between 0.49 and 0.82. Is this statement correct? 
Explain your answer. 

(c) Consider the following statement: If n = 100 random 
samples of water from the lake were taken and the 95% CI 
on wt computed, and this process were repeated 1000 
times, 950 of the CIs would contain the true value of w. Is 
this statement correct? Explain your answer. 


8-10. Past experience has indicated that the breaking 
strength of yarn used in manufacturing drapery material is 
normally distributed and that o = 2 psi. A random sample of 
nine specimens is tested, and the average breaking strength is 
found to be 98 psi. Find a 95% two-sided confidence interval 
on the true mean breaking strength. 


8-11. The yield of a chemical process is being studied. From 
previous experience, yield is known to be normally distributed 
and o = 3. The past five days of plant operation have resulted in 
the following percent yields: 91.6, 88.75, 90.8, 89.95, and 91.3. 
Find a 95% two-sided confidence interval on the true mean yield. 


8-12. The diameter of holes for a cable harness is known to 
have a normal distribution with o = 0.01 inch. A random sample 
of size 10 yields an average diameter of 1.5045 inch. Find a 99% 
two-sided confidence interval on the mean hole diameter. 


8-13. A manufacturer produces piston rings for an auto- 

mobile engine. It is known that ring diameter is normally dis- 

tributed with o = 0.001 millimeters. A random sample of 15 

rings has a mean diameter of x = 74.036 millimeters. 

(a) Construct a 99% two-sided confidence interval on the 
mean piston ring diameter. 

(b) Construct a 99% lower-confidence bound on the mean 
piston ring diameter. Compare the lower bound of this 
confidence interval with the one in part (a). 


8-14. The life in hours of a 75-watt light bulb is known to be 

normally distributed with o = 25 hours. A random sample of 

20 bulbs has a mean life of x = 1014 hours. 

(a) Construct a 95% two-sided confidence interval on the 
mean life. 

(b) Construct a 95% lower-confidence bound on the mean life. 
Compare the lower bound of this confidence interval with 
the one in part (a). 


8-15. Acivil engineer is analyzing the compressive strength 
of concrete. Compressive strength is normally distributed with 
o* = 1000(psi)’. A random sample of 12 specimens has a 
mean compressive strength of x = 3250 psi. 


(a) Construct a 95% two-sided confidence interval on mean 
compressive strength. 

(b) Construct a 99% two-sided confidence interval on mean 
compressive strength. Compare the width of this confi- 
dence interval with the width of the one found in part (a). 

8-16. Suppose that in Exercise 8-14 we wanted the error in 

estimating the mean life from the two-sided confidence inter- 

val to be five hours at 95% confidence. What sample size 
should be used? 

8-17. Suppose that in Exercise 8-14 we wanted the total 

width of the two-sided confidence interval on mean life to be 

six hours at 95% confidence. What sample size should be used? 

8-18. Suppose that in Exercise 8-15 it is desired to estimate 

the compressive strength with an error that is less than 15 psi 

at 99% confidence. What sample size is required? 

8-19. By how much must the sample size n be increased if 

the length of the CI on pw in Equation 8-5 is to be halved? 

8-20. If the sample size n is doubled, by how much is the 

length of the CI on w in Equation 8-5 reduced? What happens 

to the length of the interval if the sample size is increased by a 

factor of four? 

8-21. An article in the Journal of Agricultural Science [“The 

Use of Residual Maximum Likelihood to Model Grain Quality 

Characteristics of Wheat with Variety, Climatic and Nitrogen 

Fertilizer Effects” (1997, Vol. 128, pp. 135—142)] investigated 

means of wheat grain crude protein content (CP) and Hagberg 

falling number (HFN) surveyed in the UK. The analysis used a 

variety of nitrogen fertilizer applications (kg N/ha), tempera- 

ture (°C), and total monthly rainfall (mm). The data shown be- 
low describe temperatures for wheat grown at Harper Adams 

Agricultural College between 1982 and 1993. The tempera- 

tures measured in June were obtained as follows: 


15.2 14.2 14.0 12.2 14.4 12.5 
14.3 14.2 13.5 11.8 15.2 


Assume that the standard deviation is known to be o = 0.5. 

(a) Construct a 99% two-sided confidence interval on the 
mean temperature. 

(b) Construct a 95% lower-confidence bound on the mean 
temperature. 

(c) Suppose that we wanted to be 95% confident that the error 
in estimating the mean temperature is less than 2 degrees 
Celsius. What sample size should be used? 

(d) Suppose that we wanted the total width of the two-sided con- 
fidence interval on mean temperature to be 1.5 degrees 
Celsius at 95% confidence. What sample size should be used? 


8-2 CONFIDENCE INTERVAL ON THE MEAN OF A NORMAL 
DISTRIBUTION, VARIANCE UNKNOWN 


When we are constructing confidence intervals on the mean wp of a normal population when 
o° is known, we can use the procedure in Section 8-1.1. This CI is also approximately valid 
(because of the central limit theorem) regardless of whether or not the underlying population 
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is normal, so long as n is reasonably large (n = 40, say). As noted in Section 8-1.5, we can 
even handle the case of unknown variance for the large-sample-size situation. However, when 
the sample is small and o? is unknown, we must make an assumption about the form of the un- 
derlying distribution to obtain a valid CI procedure. A reasonable assumption in many cases is 
that the underlying distribution is normal. 

Many populations encountered in practice are well approximated by the normal distribu- 
tion, so this assumption will lead to confidence interval procedures of wide applicability. In 
fact, moderate departure from normality will have little effect on validity. When the assump- 
tion is unreasonable, an alternative is to use nonparametric statistical procedures that are valid 
for any underlying distribution. 

Suppose that the population of interest has a normal distribution with unknown mean ju 
and unknown variance o”. Assume that a random sample of size n, say, X,, X>,..., X,, is avail- 
able, and let X and S” be the sample mean and variance, respectively. 

We wish to construct a two-sided CI on w. If the variance o” is known, we know that 
Z = (X — p)/(o/Vn) has a standard normal distribution. When o? is unknown, a logical pro- 
cedure is to replace o with the sample standard deviation S. The random variable Z now 
becomes T = (X — p)/(S/V/n). A logical question is, what effect does replacing o by S have on 
the distribution of the random variable T? If n is large, the answer to this question is “very little,” 
and we can proceed to use the confidence interval based on the normal distribution from 
Section 8-1.5. However, n is usually small in most engineering problems, and in this situation 
a different distribution must be employed to construct the CI. 


t Distribution 


t Distribution 


Let X,, X>,..., X, be a random sample from a normal distribution with unknown 
mean and unknown variance o*. The random variable 


paral! (8-13) 
— «S/Va : 
has a ¢ distribution with n — 1 degrees of freedom. 
The ¢ probability density function is 
T[(k + 1)/2 1 
( 72 . ocx <0 (8-14) 


fx) = VakE (k/2) [G2/k) + 192 


where k is the number of degrees of freedom. The mean and variance of the f distribution are 
zero and k/(k — 2) (for k > 2), respectively. 

Several ¢ distributions are shown in Fig. 8-4. The general appearance of the ¢ distribution is 
similar to the standard normal distribution in that both distributions are symmetric and 
unimodal, and the maximum ordinate value is reached when the mean pw = 0. However, the ¢ 
distribution has heavier tails than the normal; that is, it has more probability in the tails than the 
normal distribution. As the number of degrees of freedom k — ~, the limiting form of the ¢ dis- 
tribution is the standard normal distribution. Generally, the number of degrees of freedom for ¢ 
is the number of degrees of freedom associated with the estimated standard deviation. 

Appendix Table V provides percentage points of the f distribution. We will let t, , be the 
value of the random variable T with k degrees of freedom above which we find an area 
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k=10 


k = [N (0, 1)] 


a a 
) x ti-ak =—tak O lok t 
Figure 8-4 Probability density functions of several t Figure 8-5 Percentage points of the t 
distributions. distribution. 


(or probability) a. Thus, 7, ;, is an upper-tail 100a percentage point of the ¢ distribution with k 
degrees of freedom. This percentage point is shown in Fig. 8-5. In the Appendix Table V the 
a values are the column headings, and the degrees of freedom are listed in the left column. To 
illustrate the use of the table, note that the value with 10 degrees of freedom having an area 
of 0.05 to the right is fo,95,19 = 1.812. That is, 


P(Ti9 > 10,05,10) = P(Ti0 = 1.812) = 0.05 


Since the f distribution is symmetric about zero, we have t)_ 4, = —t,n; that is, the t-value hav- 
ing an area of | — a to the right (and therefore an area of « to the left) is equal to the nega- 
tive of the f-value that has area a in the right tail of the distribution. Therefore, fo95 19 = 
—to05,10 = —1.812. Finally, because ¢,... is the standard normal distribution, the familiar z, 
values appear in the last row of Appendix Table V. 


8-2.2 t Confidence Interval on p 


It is easy to find a 100(1 — a) percent confidence interval on the mean of a normal distribu- 
tion with unknown variance by proceeding essentially as we did in Section 8-1.1. We know 
that the distribution of T= (X — »)/(S/Vn) is t with n — 1 degrees of freedom. Letting 
ty2n-1 be the upper 1000/2 percentage point of the ¢ distribution with n — 1 degrees of 
freedom, we may write 


Pl =tafigi STs to/2,n-1) =l-a 


or 


Xp 
P\ —toyon 1 Va foe =l-a 


Rearranging this last equation yields 


P(X - tejrn—1S/Vn Syups X+ ta/2.n-15/ Vn) =l-a (8-15) 


This leads to the following definition of the 100(1 — a)% two-sided confidence inter- 
val on p. 
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Confidence 
Interval on the 
Mean, Variance 


Unknown given by 


degrees of freedom. 


If x and s are the mean and standard deviation of a random sample from a normal 
distribution with unknown variance o”, a 100(1 — «)% confidence interval on yp is 


x= lap ais) = ye Si toj2n—15/ Vn 


where fy/2,,—1 18 the upper 1000/2 percentage point of the f distribution with n — 1 


(8-16) 


One-sided confidence bounds on the mean of a normal distribution are also of interest 
and are easy to find. Simply use only the appropriate lower or upper confidence limit from 
Equation 8-16 and replace fy/2,,—1 by ton—1- 


EXAMPLE 8-5 Alloy Adhesion 

An article in the journal Materials Engineering (1989, Vol. II, 
No. 4, pp. 275-281) describes the results of tensile adhesion 
tests on 22 U-700 alloy specimens. The load at specimen 
failure is as follows (in megapascals): 


19.8 10.1 14.9 1:3 15.4 15.4 
15.4 18.5 7.9 12.7 11.9 11.4 
11.4 14.1 17.6 16.7 15.8 
19.5 8.8 13.6 11.9 11.4 


The sample mean is x = 13.71, and the sample standard devi- 
ation is s = 3.55. Figures 8-6 and 8-7 show a box plot and 
a normal probability plot of the tensile adhesion test data, 
respectively. These displays provide good support for the 


assumption that the population is normally distributed. We want 
to find a 95% CI on wp. Since n = 22, we have n — 1 = 21 de- 
grees of freedom for ¢, 80 fo925,21 = 2.080. The resulting CI is 


GS tejrn—18/Vn = ww i tayrn—1s/Vn 
13.71 — 2.080(3.55)/V22 S p S 13.71 + 2.080(3.55)/V22 
13.71 — 1.57 Sp S 13.71 + 1.57 
12.14 <p S 15.28 
Practical Interpretation: The CI is fairly wide because there is a 


lot of variability in the tensile adhesion test measurements. A 
larger sample size would have led to a shorter interval. 


It is not as easy to select a sample size n to obtain a specified length (or precision of estima- 
tion) for this CI as it was in the known-o case, because the length of the interval involves s (which 
is unknown before the data are collected), n, and ty/7,,—. Note that the t-percentile depends on the 
sample size n. Consequently, an appropriate n can only be obtained through trial and error. The re- 
sults of this will, of course, also depend on the reliability of our prior “guess” for o. 


20.5 
18.0 | 
oO 
3 
= 15.5 
8 
g 130 
oa 
10.5 
8.0 


Figure 8-6 Box and whisker plot for the 
load at failure data in Example 8-5. 


Normal probability plot 


Percent 
fon) 
oO 


5 10 15 20 25 
Load at failure 


Figure 8-7 Normal probability 
plot of the load at failure data from 
Example 8-5. 
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EXERCISES FOR SECTION 8-2 


8-22. Find the values of the following percentiles: f9 995 15, 
10.05,10> £0.10,20> 40,005,257 2d fo 091,30- 

8-23. Determine the f-percentile that is required to construct 
each of the following two-sided confidence intervals: 

(a) Confidence level = 95%, degrees of freedom = 12 

(b) Confidence level = 95%, degrees of freedom = 24 

(c) Confidence level = 99%, degrees of freedom = 13 

(d) Confidence level = 99.9%, degrees of freedom = 15 


8-24. Determine the t-percentile that is required to construct 
each of the following one-sided confidence intervals: 

(a) Confidence level = 95%, degrees of freedom = 14 

(b) Confidence level = 99%, degrees of freedom = 19 

(c) Confidence level = 99.9%, degrees of freedom = 24 


8-25. A random sample has been taken from a normal dis- 
tribution. Output from a software package is given below: 


Variable N Mean SEMean _ StDev Variance Sum 
x 10 Wa 0.507 1.605 ? 251.848 


(a) Fill in the missing quantities. 
(b) Find a 95% CI on the population mean. 


8-26. A random sample has been taken from a normal dis- 
tribution. Output from a software package is given below: 


Variable N Mean SEMean_ StDev Variance Sum 
x ? Wa 1.58 6.11 ? 751.40 


(a) Fill in the missing quantities. 
(b) Find a 95% CI on the population mean. 


8-27. A research engineer for a tire manufacturer is investi- 
gating tire life for a new rubber compound and has built 16 tires 
and tested them to end-of-life in a road test. The sample mean 
and standard deviation are 60,139.7 and 3645.94 kilometers. 
Find a 95% confidence interval on mean tire life. 


8-28. An Izod impact test was performed on 20 specimens of 
PVC pipe. The sample mean is x = 1.25 and the sample stan- 
dard deviation is s = 0.25. Find a 99% lower confidence 
bound on Izod impact strength. 


8-29. A postmix beverage machine is adjusted to release a 
certain amount of syrup into a chamber where it is mixed with 
carbonated water. A random sample of 25 beverages was 
found to have a mean syrup content of x = 1.10 fluid ounce 
and a standard deviation of s = 0.015 fluid ounce. Find a 95% 
CI on the mean volume of syrup dispensed. 


8-30. An article in Medicine and Science in Sports and 
Exercise [“Maximal Leg-Strength Training Improves Cycling 
Economy in Previously Untrained Men” (2005, Vol. 37, 
pp. 131-136)] studied cycling performance before and after 
eight weeks of leg-strength training. Seven previously untrained 
males performed leg-strength training three days per week for 
eight weeks (with four sets of five replications at 85% of one 
repetition maximum). Peak power during incremental cycling 


increased to a mean of 315 watts with a standard deviation of 
16 watts. Construct a 95% confidence interval for the mean 
peak power after training. 

8-31. An article in Obesity Research [“Impaired Pressure 
Natriuresis in Obese Youths” (2003, Vol. 11, pp. 745-751)] de- 
scribed a study in which all meals were provided for 14 lean boys 
for three days followed by one stress (with a video-game task). 
The average systolic blood pressure (SBP) during the test was 
118.3 mm HG with a standard deviation of 9.9 mm HG. Construct 
a 99% one-sided upper confidence interval for mean SBP. 
8-32. An article in the Journal of Composite Materials 
(December 1989, Vol. 23, p. 1200) describes the effect of delam- 
ination on the natural frequency of beams made from composite 
laminates. Five such delaminated beams were subjected to 
loads, and the resulting frequencies were as follows (in hertz): 


230.66, 233.05, 232.58, 229.48, 232.58 


Check the assumption of normality in the population. Calculate 
a 90% two-sided confidence interval on mean natural frequency. 
8-33. The Bureau of Meteorology of the Australian 
Government provided the mean annual rainfall (in milli- 
meters) in Australia 1983-2002 as follows (http://www.bom.gov. 
au/climate/change/rainO3.txt): 


499.2, 555.2, 398.8, 391.9, 453.4, 459.8, 483.7, 417.6, 469.2, 
452.4, 499.3, 340.6, 522.8, 469.9, 527.2, 565.5, 584.1, 727.3, 
558.6, 338.6 


Check the assumption of normality in the population. Construct 
a 95% confidence interval for the mean annual rainfall. 

8-34. The solar energy consumed (in trillion BTU) in the 
US. by year from 1989 to 2004 (source: U.S. Department of 
Energy Web site, http://www.eia.doe.gov/emeu) is shown in 
the table below. Read down, then right for year. 


55.291 66.458 70.237 65.454 
59.718 68.548 69.787 64.391 
62.688 69.857 68.793 63.62 

63.886 70.833 66.388 63.287 


Check the assumption of normality in the population. Construct 
a 95% confidence interval for the mean solar energy consumed. 
8-35. The brightness of a television picture tube can be eval- 
uated by measuring the amount of current required to achieve a 
particular brightness level. A sample of 10 tubes results in 
x = 317.2 and s = 15.7. Find (in microamps) a 99% confi- 
dence interval on mean current required. State any necessary 
assumptions about the underlying distribution of the data. 
8-36. A particular brand of diet margarine was analyzed to 
determine the level of polyunsaturated fatty acid (in percent- 
ages). A sample of six packages resulted in the following data: 
16.8, 17.2, 17.4, 16.9, 16.5, 17.1. 

(a) Check the assumption that the level of polyunsaturated 

fatty acid is normally distributed. 


266 CHAPTER 8 STATISTICAL INTERVALS FOR A SINGLE SAMPLE 


(b) Calculate a 99% confidence interval on the mean p. 
Provide a practical interpretation of this interval. 

(c) Calculate a 99% lower confidence bound on the mean. 
Compare this bound with the lower bound of the two-sided 
confidence interval and discuss why they are different. 

8-37. The compressive strength of concrete is being tested 

by a civil engineer. He tests 12 specimens and obtains the 

following data. 


2216 2237 2249 2204 
2225 2301 2281 2263 
2318 2255 2275 2295 


(a) Check the assumption that compressive strength is normally 
distributed. Include a graphical display in your answer. 

(b) Construct a 95% two-sided confidence interval on the 
mean strength. 

(c) Construct a 95% lower confidence bound on the mean 
strength. Compare this bound with the lower bound of the 
two-sided confidence interval and discuss why they are 
different. 

8-38. A machine produces metal rods used in an automobile 

suspension system. A random sample of 15 rods is selected, 

and the diameter is measured. The resulting data (in millime- 
ters) are as follows: 


8.24 8.25 8.20 8.23 8.24 
8.21 8.26 8.26 8.20 8.25 
8.23 8.23 8.19 8.28 8.24 


(a) Check the assumption of normality for rod diameter. 

(b) Calculate a 95% two-sided confidence interval on mean 
rod diameter. 

(c) Calculate a 95% upper confidence bound on the mean. 
Compare this bound with the upper bound of the two-sided 
confidence interval and discuss why they are different. 


8-39. An article in Computers & Electrical Engineering 
[Parallel Simulation of Cellular Neural Networks” (1996, Vol. 22, 
pp. 61—84)] considered the speed-up of cellular neural networks 
(CNN) for a parallel general-purpose computing architecture 
based on six transputers in different areas. The data follow: 


3.775302 3.350679 = 4.217981 = 4.030324 = 4.639692 
4.139665 4.395575 = 4.824257 = 4.268119 = 4.584193 
4.930027 4.315973 4.600101 


(a) Is there evidence to support the assumption that speed-up 
of CNN is normally distributed? Include a graphical dis- 
play in your answer. 

(b) Construct a 95% two-sided confidence interval on the 
mean speed-up. 

(c) Construct a 95% lower confidence bound on the mean 
speed-up. 

8-40. The wall thickness of 25 glass 2-liter bottles was mea- 

sured by a quality-control engineer. The sample mean was 

x = 4.05 millimeters, and the sample standard deviation was 

s = 0.08 millimeter. Find a 95% lower confidence bound for 

mean wall thickness. Interpret the interval you have obtained. 

8-41. An article in Nuclear Engineering International 

(February 1988, p. 33) describes several characteristics of fuel 

rods used in a reactor owned by an electric utility in Norway. 

Measurements on the percentage of enrichment of 12 rods 

were reported as follows: 


2.94 3.00 2.90 2.75 3.00 2.95 
2.90 2.75 2.95 2.82 2.81 3.05 


(a) Use a normal probability plot to check the normality as- 
sumption. 

(b) Find a 99% two-sided confidence interval on the mean per- 
centage of enrichment. Are you comfortable with the state- 
ment that the mean percentage of enrichment is 2.95%? 
Why? 


8-3 CONFIDENCE INTERVAL ON THE VARIANCE AND 
STANDARD DEVIATION OF A NORMAL DISTRIBUTION 


Sometimes confidence intervals on the population variance or standard deviation are needed. 
When the population is modeled by a normal distribution, the tests and intervals described in 
this section are applicable. The following result provides the basis of constructing these con- 


fidence intervals. 


a Distribution 
ep xXGe XG. ee 


X,, be a random sample from a normal distribution with mean w and 
variance a”, and let S* be the sample variance. Then the random variable 


has a chi-square (°) distribution with n — 1 degrees of freedom. 


= yee 
ore aes (8-17) 
oO 
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Figure 8-8 Proba- 
bility density functions 
of several x? distribu- 
tions. 


f(x) 


The probability density function of a x’ random variable is 


1 ape 
f(x) = Prey le a x>0 (8-18) 


where k is the number of degrees of freedom. The mean and variance of the yx? distribution are 
k and 2k, respectively. Several chi-square distributions are shown in Fig. 8-8. Note that the 
chi-square random variable is nonnegative and that the probability distribution is skewed to 
the right. However, as k increases, the distribution becomes more symmetric. As k > ™, the 
limiting form of the chi-square distribution is the normal distribution. 

The percentage points of the x? distribution are given in Table IV of the Appendix. 
Define Xk as the percentage point or value of the chi-square random variable with k degrees 
of freedom such that the probability that X* exceeds this value is «. That is, 


00 


PO? > x2) = | Pd du = a 
Xink 
This probability is shown as the shaded area in Fig. 8-9(a). To illustrate the use of Table IV, 


note that the areas a are the column headings and the degrees of freedom k are given in the left 
column. Therefore, the value with 10 degrees of freedom having an area (probability) of 0.05 


f(x) f(x) 


a 0.05 0.05 


0) Xek ~ 0 Xéo5, 10 X6.05, 10 
= 3.94 = 18.31 


(a) (b) 


Figure 8-9 Percentage point of the x’ distribution. (a) The percentage point Xovke (b) The upper 
percentage point X6.95,19 = 18.31 and the lower percentage point X595.10 = 3.94. 
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Confidence 
Interval on 
the Variance 


One-Sided 
Confidence 
Bounds on 
the Variance 


to the right is X6.05,10 = 18.31.This value is often called an upper 5% point of chi-square with 
10 degrees of freedom. We may write this as a probability statement as follows: 


P(X? > xG05,10) = P(X* > 18.31) = 0.05 


Conversely, a lower 5% point of chi-square with 10 degrees of freedom would be 4.95.10 = 
3.94 (from Appendix A). Both of these percentage points are shown in Figure 8-9(b). 
The construction of the 100(1 — «)% CI for o? is straightforward. Because 
(n — 1)S? 


Xx? = —_ 
co 


is chi-square with n — 1 degrees of freedom, we may write 


2 2 2 = 
P(Xi-a/2n-1 =X" = Xo/ann) = 1-2 
so that 


(n — 1)S? 


2 2 = 
A tactigs = co = ant) =l-a 


This last equation can be rearranged as 
= 2 _ 2 
‘ (n — 1)S mete (n — 1)S ete 
2 2 
Xa/2,n-1 X1-a/2,n-1 


This leads to the following definition of the confidence interval for 0”. 


If s* is the sample variance from a random sample of n observations from a normal dis- 
tribution with unknown variance o7, then a 100(1 — «)% confidence interval on o7 is 


(n — 1)s? (n — 1)s? 


D 
= = 
2 SS 
Xo/2.n-1 X1-a/2.n-1 


(8-19) 


where x~ /2.n-1 and ees are the upper and lower 100a/2 percentage points of 
the chi-square distribution with n — 1 degrees of freedom, respectively. A confidence 
interval for o has lower and upper limits that are the square roots of the correspond- 
ing limits in Equation 8-19. 


It is also possible to find a 100(1 — «)% lower confidence bound or upper confidence bound on o”. 


The 100(1 — «)% lower and upper confidence bounds on o” are 


(n — 1)s? Ss 


2 
Xa,n-1 


2 (n — 1)s? 


5} (8-20) 
XI —a,n—-1 


respectively. 
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EXAMPLE 8-6 Detergent Filling 

An automatic filling machine is used to fill bottles with liquid 
detergent. A random sample of 20 bottles results in a sample 
variance of fill volume of s? = 0.0153 (fluid ounce)”. If the 
variance of fill volume is too large, an unacceptable proportion 
of bottles will be under- or overfilled. We will assume that the 
fill volume is approximately normally distributed. A 95% up- 
per confidence bound is found from Equation 8-26 as follows: 


— 1) 2 
ee (n : 1)s 
X0.95,19 
or 
> (19)0.0153 , 
= 40117. 0.0287 (fluid ounce) 
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This last expression may be converted into a confidence inter- 
val on the standard deviation o by taking the square root of 
both sides, resulting in 


o =0.17 


Practical Interpretation: Therefore, at the 95% level of confi- 
dence, the data indicate that the process standard deviation 
could be as large as 0.17 fluid ounce. The process engineer or 
manager now needs to determine if a standard deviation this 
large could lead to an operational problem with under-or over 
filled bottles. 


8-42. 

X6.05,10» Xb.005,15+ X0.01,12 X0.95,200 X6.99,18» X4,995,16 and X‘,00525+ 

8-43. Determine the yx’ percentile that is required to 

construct each of the following CIs: 

(a) Confidence level = 95%, degrees of freedom = 24, 
one-sided (upper) 

(b) Confidence level = 99%, degrees of freedom = 9, one- 
sided (lower) 

(c) Confidence level = 90%, degrees of freedom = 19, two- 
sided. 


8-44. A rivet is to be inserted into a hole. A random sample 
of n = 15 parts is selected, and the hole diameter is measured. 
The sample standard deviation of the hole diameter measure- 
ments is s = 0.008 millimeters. Construct a 99% lower confi- 
dence bound for o”. 


8-45. Consider the situation in Exercise 8-44. Find a 99% 
lower confidence bound on the standard deviation. 


Determine the values of the following percentiles: 


8-46. The sugar content of the syrup in canned peaches is 
normally distributed. A random sample of n = 10 cans yields 
a sample standard deviation of s = 4.8 milligrams. Calculate 
a 95% two-sided confidence interval for o. 


8-47. The percentage of titanium in an alloy used in aero- 
space castings is measured in 51 randomly selected parts. The 
sample standard deviation is s = 0.37. Construct a 95% two- 
sided confidence interval for o. 


8-48. An article in Medicine and Science in Sports and 
Exercise [“Electrostimulation Training Effects on the Physical 
Performance of Ice Hockey Players” (2005, Vol. 37, pp. 
455-—460)] considered the use of electromyostimulation 
(EMS) as a method to train healthy skeletal muscle. EMS ses- 
sions consisted of 30 contractions (4-second duration, 85 Hz) 
and were carried out three times per week for three weeks on 
17 ice hockey players. The 10-meter skating performance test 
showed a standard deviation of 0.09 seconds. Construct a 95% 


confidence interval of the standard deviation of the skating 
performance test. 


8-49. An article in Urban Ecosystems, “Urbanization and 
Warming of Phoenix (Arizona, USA): Impacts, Feedbacks and 
Mitigation” (2002, Vol. 6, pp. 183-203), mentions that Phoenix 
is ideal to study the effects of an urban heat island because it has 
grown from a population of 300,000 to approximately 3 million 
over the last 50 years and this is a period with a continuous, de- 
tailed climate record. The 50-year averages of the mean annual 
temperatures at eight sites in Phoenix are shown below. Check 
the assumption of normality in the population with a probabil- 
ity plot. Construct a 95% confidence interval for the standard 
deviation over the sites of the mean annual temperatures. 


Average Mean 


Site Temperature (°C) 
Sky Harbor Airport M33) 
Phoenix Greenway 21.7 
Phoenix Encanto 21.6 
Waddell 21.7 
Litchfield PALS} 
Laveen 20.7 
Maricopa 20.9 
Harlquahala 20.1 


8-50. An article in Cancer Research [“Analyses of Litter- 
Matched Time-to-Response Data, with Modifications for 
Recovery of Interlitter Information” (1977, Vol. 37, pp. 
3863-—3868)] tested the tumorigenesis of a drug. Rats were 
randomly selected from litters and given the drug. The times 
of tumor appearance were recorded as follows: 


101, 104, 104, 77, 89, 88, 104, 96, 82, 70, 89, 91, 39, 103, 93, 
85, 104, 104, 81, 67, 104, 104, 104, 87, 104, 89, 78, 104, 86, 
76, 103, 102, 80, 45, 94, 104, 104, 76, 80, 72, 73 
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Calculate a 95% confidence interval on the standard deviation 
of time until a tumor appearance. Check the assumption of 
normality of the population and comment on the assumptions 
for the confidence interval. 

8-51. Anarticle in Technometrics (1999, Vol. 41, pp. 202-211) 
studied the capability of a gauge by measuring the weight of pa- 
per. The data for repeated measurements of one sheet of paper are 
in the following table. Construct a 95% one-sided upper confi- 
dence interval for the standard deviation of these measurements. 
Check the assumption of normality of the data and comment on 
the assumptions for the confidence interval. 


Observations 
3.481 3.448 3.485 3.475 3.472 
3.477 3.472 3.464 3.472 3.470 
3.470 3.470 3.477 3.473 3.474 


8-52. An article in the Australian Journal of Agricultural 
Research [“Non-Starch Polysaccharides and _ Broiler 
Performance on Diets Containing Soyabean Meal as the 
Sole Protein Concentrate” (1993, Vol. 44, No. 8, pp. 
1483-1499)] determined that the essential amino acid 
(Lysine) composition level of soybean meals is as shown 
below (g/kg): 


22.2 24.7 20.9 26.0 27.0 
24.8 26.5 23.8 25.6 23.9 


(a) Construct a 99% two-sided confidence interval for 07. 
(b) Calculate a 99% lower confidence bound for 0”. 

(c) Calculate a 90% lower confidence bound for o. 

(d) Compare the intervals that you have computed. 


8-4 LARGE-SAMPLE CONFIDENCE INTERVAL FOR 


A POPULATION PROPORTION 


It is often necessary to construct confidence intervals on a population proportion. For exam- 
ple, suppose that a random sample of size n has been taken from a large (possibly infinite) 
population and that X(< n) observations in this sample belong to a class of interest. Then 
P=X/nisa point estimator of the proportion of the population p that belongs to this class. 
Note that n and p are the parameters of a binomial distribution. Furthermore, from Chapter 4 
we know that the sampling distribution of Pis approximately normal with mean p and vari- 
ance p(1 — p)/n, if p is not too close to either 0 or | and if 7 is relatively large. Typically, to 
apply this approximation we require that np and n(1 — p) be greater than or equal to 5. We 
will make use of the normal approximation in this section. 


Normal 
Approximation 
for a Binomial 
Proportion 


If n is large, the distribution of 


X — np 


Z 


~ Vap(l =p) (2 — p) 


is approximately standard normal. 


To construct the confidence interval on p, note that 


sO 


Approximate 
Confidence If p is the proportion of observations in a random sample of size n that belongs to a 
Interval ce class of interest, an approximate 100(1 — a«)% confidence interval on the proportion 
=e Einonal p of the population that belongs to this class is 
Proportion 
: P(I — p) . P(1 — BP) 
1H | es re ee) 
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This may be rearranged as 


rs [pQ — p - Ip] = p 
(Pex Ate ee Oe ies 


The quantity Vp(1 — p)/n in Equation 8-21 is called the standard error of the point estima- 
tor P. This was discussed in Chapter 7. Unfortunately, the upper and lower limits of the confi- 
dence interval obtained from Equation 8-21 contain the unknown parameter p. However, as 
suggested at the end of Section 8-1.5, a satisfactory solution is to replace p by P in the stan- 
dard error, which results in 


AU-B Bl — Py 
P\ P— Zuy2 SES pS PH 2572 A nr ag = 


(8-21) 


(8-22) 


This leads to the approximate 100(1 — «)% confidence interval on p. 


where Z,/ is the upper a /2 percentage point of the standard normal distribution. 


This procedure depends on the adequacy of the normal approximation to the binomial. 
To be reasonably conservative, this requires that np and n(1 — p) be greater than or equal 
to 5. In situations where this approximation is inappropriate, particularly in cases where n 
is small, other methods must be used. Tables of the binomial distribution could be used to 
obtain a confidence interval for p. However, we could also use numerical methods based on 
the binomial probability mass function that are implemented in computer programs. 


EXAMPLE 8-7 Crankshaft Bearings 


In arandom sample of 85 automobile engine crankshaft bear- 
ings, 10 have a surface finish that is rougher than the specifi- 
cations allow. Therefore, a point estimate of the proportion of 
bearings in the population that exceeds the roughness specifi- 
cation is p = x/n = 10/85 = 0.12. A 95% two-sided confi- 
dence interval for p is computed from Equation 8-23 as 


: [PCL — P) ' [PC — P) 

P— %025\/ yy =P =P + 0254) yr 

+96, (02088) 
; 85 


or 
0.12 — 1.96 <p<0.12 4 


0.12(0.88) 
85 


which simplifies to 
0.05 = p = 0.19 


Practical Interpretation: This is a wide CI. While the sample 
size does not appear to be small (n = 85), the value of p is 
fairly small, which leads to a large standard error for p con- 
tributing to the wide CI. 


272 


CHAPTER 8 STATISTICAL INTERVALS FOR A SINGLE SAMPLE 


Sample Size 
for a Specified 
Error on 

a Binomial 
Proportion 


Choice of Sample Size 
Since P is the point estimator of p, we can define the error in estimating p by P as 
E =|p — P|. Note that we are approximately 100(1 — «)% confident that this error is less 
than Z4/2 Vp(1 — p)/n. For instance, in Example 8-7, we are 95% confident that the sample 
proportion p = 0.12 differs from the true proportion p by an amount not exceeding 0.07. 

In situations where the sample size can be selected, we may choose n to be 100(1 — a)% 
confident that the error is less than some specified value E. If we set E = Zy. Vp(1 — p)/n 
and solve for n, the appropriate sample size is 


(8-24) 


An estimate of p is required to use Equation 8-24. If an estimate p from a previous sample 
is available, it can be substituted for p in Equation 8-24, or perhaps a subjective estimate can be 
made. If these alternatives are unsatisfactory, a preliminary sample can be taken, p computed, 
and then Equation 8-24 used to determine how many additional observations are required to es- 
timate p with the desired accuracy. Another approach to choosing n uses the fact that the sam- 
ple size from Equation 8-24 will always be a maximum for p = 0.5 [that is, p(1 — p) = 0.25 
with equality for p = 0.5], and this can be used to obtain an upper bound on n. In other words, 
we are at least 100(1 — «)% confident that the error in estimating p by # is less than E if the 
sample size is 


(8-25) 


EXAMPLE 8-8 Crankshaft Bearings 


Consider the situation in Example 8-7. How large a sample is 
required if we want to be 95% confident that the error in using 
Pp to estimate p is less than 0.05? Using 6 = 0.12 as an initial 
estimate of p, we find from Equation 8-24 that the required 
sample size is 


2 2 
20.025 1.96 
= p(1 — p)= .12(0. =1 
n ( . ) a¢ Pp) (772) 0.12(0.88) = 163 


If we wanted to be at least 95% confident that our estimate Pp 
of the true proportion p was within 0.05 regardless of the value 


One-Sided Confidence Bounds 


of p, we would use Equation 8-25 to find the sample size 


n= (22) (0.25) = (8) (0.25) = 385 


Practical Interpretation: Notice that if we have information 
concerning the value of p, either from a preliminary sample 
or from past experience, we could use a smaller sample 
while maintaining both the desired precision of estimation 
and the level of confidence. 


We may find approximate one-sided confidence bounds on p by a simple modification of 


Equation 8-23. 
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Approximate One- 
Sided Confidence 
Bounds on a 
Binomial Proportion 
P ~ %q n 


respectively. 


The approximate 100(1 — a)% lower and upper confidence bounds are 


. (PUB) 


PU — Pp) 


and P=pPt 2 (8-26) 


EXERCISES FOR SECTION 8-4 


8-53. The fraction of defective integrated circuits produced 

in a photolithography process is being studied. A random sam- 

ple of 300 circuits is tested, revealing 13 defectives. 

(a) Calculate a 95% two-sided CI on the fraction of defective 
circuits produced by this particular tool. 

(b) Calculate a 95% upper confidence bound on the fraction 
of defective circuits. 


8-54. An article in Knee Surgery, Sports Traumatology, 
Arthroscopy [Arthroscopic Meniscal Repair with an 
Absorbable Screw: Results and Surgical Technique” (2005, Vol. 
13, pp. 273-279)] showed that only 25 out of 37 tears (67.6%) 
located between 3 and 6 mm from the meniscus rim were healed. 
(a) Calculate a two-sided 95% confidence interval on the 
proportion of such tears that will heal. 
(b) Calculate a 95% lower confidence bound on the propor- 
tion of such tears that will heal. 


8-55. The 2004 presidential election exit polls from the crit- 
ical state of Ohio provided the following results. There were 
2020 respondents in the exit polls and 768 were college grad- 
uates. Of the college graduates, 412 voted for George Bush. 
(a) Calculate a 95% confidence interval for the proportion of 
college graduates in Ohio who voted for George Bush. 
(b) Calculate a 95% lower confidence bound for the proportion 
of college graduates in Ohio who voted for George Bush. 


8-56. Of 1000 randomly selected cases of lung cancer, 823 

resulted in death within 10 years. 

(a) Calculate a 95% two-sided confidence interval on the 
death rate from lung cancer. 

(b) Using the point estimate of p obtained from the prelimi- 
nary sample, what sample size is needed to be 95% confi- 
dent that the error in estimating the true value of p is less 
than 0.03? 

(c) How large must the sample be if we wish to be at least 
95% confident that the error in estimating p is less than 
0.03, regardless of the true value of p? 


8-57. An article in the Journal of the American Statistical 
Association (1990, Vol. 85, pp. 972-985) measured the weight 


of 30 rats under experiment controls. Suppose that there are 12 

underweight rats. 

(a) Calculate a 95% two-sided confidence interval on the true 
proportion of rats that would show underweight from the 
experiment. 

(b) Using the point estimate of p obtained from the prelimi- 
nary sample, what sample size is needed to be 95% confi- 
dent that the error in estimating the true value of p is less 
than 0.02? 

(c) How large must the sample be if we wish to be at least 
95% confident that the error in estimating p is less than 
0.02, regardless of the true value of p? 


8-58. A random sample of 50 suspension helmets used by 
motorcycle riders and automobile race-car drivers was sub- 
jected to an impact test, and on 18 of these helmets some dam- 
age was observed. 

(a) Find a 95% two-sided confidence interval on the true pro- 
portion of helmets of this type that would show damage 
from this test. 

(b) Using the point estimate of p obtained from the prelimi- 
nary sample of 50 helmets, how many helmets must be 
tested to be 95% confident that the error in estimating the 
true value of p is less than 0.02? 

(c) How large must the sample be if we wish to be at least 
95% confident that the error in estimating p is less than 
0.02, regardless of the true value of p? 


8-59. The Arizona Department of Transportation 
wishes to survey state residents to determine what propor- 
tion of the population would like to increase statewide 
highway speed limits to 75 mph from 65 mph. How many 
residents do they need to survey if they want to be at least 
99% confident that the sample proportion is within 0.05 of 
the true proportion? 


8-60. A study is to be conducted of the percentage of home- 
owners who own at least two television sets. How large a 
sample is required if we wish to be 99% confident that the 
error in estimating this quantity is less than 0.017? 


8-5 GUIDELINES FOR CONSTRUCTING CONFIDENCE INTERVALS 


The most difficult step in constructing a confidence interval is often the match of the appro- 
priate calculation to the objective of the study. Common cases are listed in Table 8-1 along with 
the reference to the section that covers the appropriate calculation for a confidence interval test. 
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Table 8-1 The Roadmap for Constracting Confidence Intervals and Performing Hypothesis Tests, 
One-Sample Case 


Parameter to Be 


Bounded by the 

Confidence Interval Confidence Hypothesis 

or Tested with a Interval Test 

Hypothesis? Symbol Other Parameters? Section Section Comments 

Mean of normal we Standard deviation 8-1 9-2 

distribution o known 

Mean of arbitrary we Sample size large 8-1.5 9-2.5 Large sample 

distribution with enough that central size is often 

large sample size limit theorem taken to be 
applies and o is n=40 
essentially known 

Mean of normal we Standard deviation 8-2 9-3 

distribution o unknown and 
estimated 

Variance (or stan- o Mean p unknown 8-3 9-4 

dard deviation) of and estimated 

normal distribution 

Population Pp None 8-4 9-5 

Proportion 


Table 8-1 provides a simple road map to help select the appropriate analysis. Two primary 
comments can help identify the analysis: 


1. Determine the parameter (and the distribution of the data) that will be bounded by the 
confidence interval or tested by the hypothesis. 


2. Check if other parameters are known or need to be estimated. 


In Chapter 9, we will study a procedure closely related to confidence intervals called 
hypothesis testing. Table 8-1 can be used for those procedures also. This road map will be 
extended to more cases in Chapter 10. 


8-6 TOLERANCE AND PREDICTION INTERVALS 
8-6.1 Prediction Interval for a Future Observation 


In some problem situations, we may be interested in predicting a future observation of a 
variable. This is a different problem than estimating the mean of that variable, so a confidence 
interval is not appropriate. In this section we show how to obtain a 100(1 — a)% prediction 
interval on a future value of a normal random variable. 

Suppose that X,, X,..., X,, is arandom sample from a normal population. We wish to 
predict the value X,,,,, a single future observation. A point prediction of X,,,, is X, the sam- 
ple mean. The prediction error is X,,,, — X. The expected value of the prediction error is 


E(Xn41 —X)=p—p=0 


and the variance of the prediction error is 


i) 


ee ee ee 
V(Xi41 ~ X) =O + e(1 +h) 


n 
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because the future observation X,,,; is independent of the mean of the current sample X.The 
prediction error X,,,, — X is normally distributed. Therefore, 


Ye 
las 


= Xntt =X 
= 


which has a ¢ distribution with n — 1 degrees of freedom. Manipulating T as we have done 
previously in the development of a CI leads to a prediction interval on the future observa- 
tion X,,44. 


Prediction 
Interval A 100(1 — «)% prediction interval (PI) on a single future observation from a 
normal distribution is given by 


ie 1 a 1 
Ao) ae Ly/2,n-15 1 + n = Xn+1 SS 3) oe to/2,n—18 1 SF n (8-27) 


The prediction interval for X,,,,; will always be longer than the confidence interval for 
because there is more variability associated with the prediction error than with the error of es- 
timation. This is easy to see because the prediction error is the difference between two random 
variables (X,,,, — X), and the estimation error in the CI is the difference between one random 
variable and a constant (X — w). As n gets larger (n > ~), the length of the CI decreases to 
zero, essentially becoming the single value w, but the length of the PI approaches 2z,/.0. So 
as n increases, the uncertainty in estimating wt goes to zero, although there will always be un- 
certainty about the future value X,,,; even when there is no need to estimate any of the distri- 
bution parameters. 


EXAMPLE 8-9 Alloy Adhesion 


Reconsider the tensile adhesion tests on specimens of 1 
U-700 alloy described in Example 8-5. The load at failure 13.71 — (2.080)3.554/ 1 + rT} = Xx, = 13.71 
for n = 22 specimens was observed, and we found that x = 
13.71 and s = 3.55. The 95% confidence interval on j. was 1 
+ (2.080)3.55,/1 + — 
12.14 = p = 15.28. We plan to test a twenty-third speci- PaO) ea 4 22 
men. A 95% prediction interval on the load at failure for this 6.16 = Xy3 S 21.26 


specimen is : . ; re ; 
Practical Interpretation: Notice that the prediction interval is 


considerably longer than the CI. This is because the CI an 


a | 1 = | 1 estimate of a parameter, while the PI is an interval estimate of 
x — ty/2,n-15 Lost nN = nt+1 Sxt ty/2,n-15 1+ nN %, - 2 
a single future observation. 
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8-6.2 Tolerance Interval for a Normal Distribution 


Tolerance 
Interval 


Consider a population of semiconductor processors. Suppose that the speed of these processors 
has a normal distribution with mean jz = 600 megahertz and standard deviation o = 30 mega- 
hertz. Then the interval from 600 — 1.96(30) = 541.2 to 600 + 1.96(30) = 658.8 megahertz 
captures the speed of 95% of the processors in this population because the interval from 
—1.96 to 1.96 captures 95% of the area under the standard normal curve. The interval from 
Me — Zq/20 to W + Zy/20 is called a tolerance interval. 

If w and o are unknown, we can use the data from a random sample of size n to compute 
x and s, and then form the interval (x — 1.96s, x + 1.965). However, because of sampling 
variability in x and s, it is likely that this interval will contain less than 95% of the values in 
the population. The solution to this problem is to replace 1.96 by some value that will make 
the proportion of the distribution contained in the interval 95% with some level of confidence. 
Fortunately, it is easy to do this. 


A tolerance interval for capturing at least y% of the values in a normal distribution 
with confidence level 10001 — a)% is 
x = [B, x+ks 


where k is a tolerance interval factor found in Appendix Table XII. Values are given 
for y = 90%, 95%, and 99% and for 90%, 95%, and 99% confidence. 


One-sided tolerance bounds can also be computed. The tolerance factors for these bounds are 
also given in Appendix Table XII. 


EXAMPLE 8-10 Alloy Adhesion 

Let’s reconsider the tensile adhesion tests originally described 
in Example 8-5. The load at failure for n = 22 specimens was 
observed, and we found that x = 13.71 and s = 3.55. We want 
to find a tolerance interval for the load at failure that includes 
90% of the values in the population with 95% confidence. 
From Appendix Table XII, the tolerance factor k for n = 22, 
y = 0.90, and 95% confidence is k = 2.264. The desired tol- 
erance interval is 


(x — ks, x + ks) 
or 
[13.71 — (2.264)3.55, 13.71 + (2.264)3.55] 
which reduces to (5.67, 21.74). 
Practical Interpretation: We can be 95% confident that at least 


90% of the values of load at failure for this particular alloy lie 
between 5.67 and 21.74 megapascals. 


From Appendix Table XII, we note that as n — ©, the value of k goes to the z-value associated 
with the desired level of containment for the normal distribution. For example, if we want 
90% of the population to fall in the two-sided tolerance interval, k approaches Zp 95 = 1.645 as 
n— ©, Note that asin ©, a 100(1 — a)% prediction interval on a future value approaches a 


tolerance interval that contains 100(1 — a)% of the distribution. 


EXERCISES FOR SECTION 8-6 


8-61. Consider the tire-testing data described in Exercise 
8-27. Compute a 95% prediction interval on the life of the next 
tire of this type tested under conditions that are similar to 
those employed in the original test. Compare the length of the 


prediction interval with the length of the 95% CI on the 
population mean. 

8-62. Consider the Izod impact test described in Exercise 8-28. 
Compute a 99% prediction interval on the impact strength of 


the next specimen of PVC pipe tested. Compare the length of 
the prediction interval with the length of the 99% CI on the 
population mean. 


8-63. Consider the syrup-dispensing measurements de- 
scribed in Exercise 8-29. Compute a 95% prediction interval 
on the syrup volume in the next beverage dispensed. Compare 
the length of the prediction interval with the length of the 95% 
CI on the population mean. 


8-64. Consider the natural frequency of beams described 
in Exercise 8-32. Compute a 90% prediction interval on the 
diameter of the natural frequency of the next beam of this 
type that will be tested. Compare the length of the prediction 
interval with the length of the 90% CI on the population 
mean. 


8-65. Consider the rainfall in Exercise 8-33. Compute a 
95% prediction interval on the rainfall for the next year. 
Compare the length of the prediction interval with the length 
of the 95% CI on the population mean. 


8-66. Consider the margarine test described in Exercise 8-36. 
Compute a 99% prediction interval on the polyunsaturated 
fatty acid in the next package of margarine that is tested. 
Compare the length of the prediction interval with the length 
of the 99% CI on the population mean. 


8-67. Consider the television tube brightness test described 
in Exercise 8-35. Compute a 99% prediction interval on the 
brightness of the next tube tested. Compare the length of the 
prediction interval with the length of the 99% CI on the popu- 
lation mean. 


8-68. Consider the suspension rod diameter measurements 
described in Exercise 8-38. Compute a 95% prediction inter- 
val on the diameter of the next rod tested. Compare the length 
of the prediction interval with the length of the 95% CI on the 
population mean. 


8-69. Consider the test on the compressive strength of con- 
crete described in Exercise 8-37. Compute a 90% prediction 
interval on the next specimen of concrete tested. 


8-70. Consider the bottle-wall thickness measurements 
described in Exercise 8-40. Compute a 90% prediction interval 
on the wall thickness of the next bottle tested. 


8-71. Consider the fuel rod enrichment data described 
in Exercise 8-41. Compute a 90% prediction interval on the 
enrichment of the next rod tested. Compare the length of 
the prediction interval with the length of the 99% CI on the 
population mean. 


8-72. How would you obtain a one-sided prediction bound 
on a future observation? Apply this procedure to obtain a 95% 
one-sided prediction bound on the wall thickness of the next 
bottle for the situation described in Exercise 8-40. 


8-73. Consider the tire-testing data in Exercise 8-27. 
Compute a 95% tolerance interval on the life of the tires that 
has confidence level 95%. Compare the length of the toler- 
ance interval with the length of the 95% CI on the population 
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mean. Which interval is shorter? Discuss the difference in 
interpretation of these two intervals. 


8-74. Consider the Izod impact test described in Exercise 
8-28. Compute a 99% tolerance interval on the impact 
strength of PVC pipe that has confidence level 90%. Compare 
the length of the tolerance interval with the length of the 99% 
CI on the population mean. Which interval is shorter? Discuss 
the difference in interpretation of these two intervals. 


8-75. Consider the syrup-volume data in Exercise 8-29. 
Compute a 95% tolerance interval on the syrup volume that 
has confidence level 90%. Compare the length of the tolerance 
interval with the length of the 95% CI on the population mean. 


8-76. Consider the margarine test described in Exercise 8-36. 
Compute a 99% tolerance interval on the polyunsaturated 
fatty acid in this particular type of margarine that has confi- 
dence level 95%. Compare the length of the tolerance inter- 
val with the length of the 99% CI on the population mean. 
Which interval is shorter? Discuss the difference in inter- 
pretation of these two intervals. 


8-77. Consider the rainfall data in Exercise 8-33. Compute a 
95% tolerance interval that has confidence level 95%. Compare 
the length of the tolerance interval with the length of the 95% CI 
on the population mean. Discuss the difference in interpretation 
of these two intervals. 


8-78. Consider the suspension rod diameter data in Exercise 
8-38. Compute a 95% tolerance interval on the diameter of the 
rods described that has 90% confidence. Compare the length 
of the tolerance interval with the length of the 95% CI on the 
population mean. Which interval is shorter? Discuss the dif- 
ference in interpretation of these two intervals. 


8-79. Consider the television tube brightness data in 
Exercise 8-35. Compute a 99% tolerance interval on the 
brightness of the television tubes that has confidence level 
95%. Compare the length of the tolerance interval with the 
length of the 99% CI on the population mean. Which interval 
is shorter? Discuss the difference in interpretation of these two 
intervals. 


8-80. Consider the strength-of-concrete data in Exercise 
8-37. Compute a 90% tolerance interval on the compressive 
strength of the concrete that has 90% confidence. 


8-81. Consider the fuel rod enrichment data described in 
Exercise 8-41. Compute a 99% tolerance interval on rod 
enrichment that has confidence level 95%. Compare the 
length of the tolerance interval with the length of the 95% CI 
on the population mean. 


8-82. Consider the bottle-wall thickness measurements 

described in Exercise 8-40. 

(a) Compute a 90% tolerance interval on bottle-wall thick- 
ness that has confidence level 90%. 

(b) Compute a 90% lower tolerance bound on bottle-wall 
thickness that has confidence level 90%. Why would a 
lower tolerance bound likely be of interest here? 
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Supplemental Exercises 


8-83. Consider the confidence interval for ~ with known 
standard deviation o: 


x- Zy,0/Vn = wsxt Zq,0/Vn 


where a, + a, = a. Let a = 0.05 and find the interval for 
Q, = a = a/2 = 0.025. Now find the interval for the case 
a, = 0.01 and a, = 0.04. Which interval is shorter? Is there 
any advantage to a “symmetric” confidence interval? 


8-84. A normal population has a known mean of 50 and 

unknown variance. 

(a) A random sample of n = 16 is selected from this popula- 
tion, and the sample results are x = 52 and s = 8. How 
unusual are these results? That is, what is the probability 
of observing a sample average as large as 52 (or larger) if 
the known, underlying mean is actually 50? 

(b) A random sample of n = 30 is selected from this popula- 
tion, and the sample results are x = 52 and s = 8. How 
unusual are these results? 

(c) Arandom sample of n = 100 is selected from this popula- 
tion, and the sample results are x = 52 and s = 8. How 
unusual are these results? 

(d) Compare your answers to parts (a)—(c) and explain why 
they are the same or different. 

8-85. A normal population has known mean p = 50 and 

variance o* = 5. What is the approximate probability that the 

sample variance is greater than or equal to 7.44? less than or 
equal to 2.56? For a random sample of size 


(a) n= 16 
(b) n = 30 
(c) n=71 


(d) Compare your answers to parts (a)—(c) for the approxi- 
mate probability that the sample variance is greater than 
or equal to 7.44. Explain why this tail probability is 
increasing or decreasing with increased sample size. 

(e) Compare your answers to parts (a)—(c) for the approxi- 
mate probability that the sample variance is less than or 
equal to 2.56. Explain why this tail probability is increas- 
ing or decreasing with increased sample size. 


8-86. Anarticle in the Journal of Sports Science (1987, Vol. 
5, pp. 261-271) presents the results of an investigation of the 
hemoglobin level of Canadian Olympic ice hockey players. 
The data reported are as follows (in g/dl): 

15.3 16.0 14.4 16.2 16.2 

14.9 15.7 15.3 14.6 15.7 

16.0 15.0 15.7 16.2 14.7 

14.8 14.6 15.6 14.5 15.2 


(a) Given the following probability plot of the data, what is a log- 
ical assumption about the underlying distribution of the data? 


Percentage 
ol 
oO 


13.5 140 145 15.0 15.5 160 16.5 17.0 
Hemoglobin Level 


(b) Explain why this check of the distribution underlying the 
sample data is important if we want to construct a confi- 
dence interval on the mean. 

(c) Based on this sample data, a 95% confidence interval for 
the mean is (15.04, 15.62). Is it reasonable to infer that the 
true mean could be 14.5? Explain your answer. 

(d) Explain why this check of the distribution underlying the 
sample data is important if we want to construct a confi- 
dence interval on the variance. 

(e) Based on these sample data, a 95% confidence interval 
for the variance is (0.22, 0.82). Is it reasonable to infer 
that the true variance could be 0.35? Explain your 
answer. 

(f) Is it reasonable to use these confidence intervals to draw 
an inference about the mean and variance of hemoglobin 
levels 

(i) of Canadian doctors? Explain your answer. 
(ii) of Canadian children ages 6-12? Explain your answer. 


8-87. The article “Mix Design for Optimal Strength 
Development of Fly Ash Concrete” (Cement and Concrete 
Research, 1989, Vol. 19, No. 4, pp. 634-640) investigates the 
compressive strength of concrete when mixed with fly ash (a 
mixture of silica, alumina, iron, magnesium oxide, and other 
ingredients). The compressive strength for nine samples in 
dry conditions on the twenty-eighth day are as follows (in 
megapascals): 


40.2 30.4 28.9 30.5 22.4 
25.8 18.4 14.2 15.3 


(a) Given the following probability plot of the data, what is a 
logical assumption about the underlying distribution of 
the data? 


Percentage 


(b) 


(c 


YV 


(d) 


(e 


Ym 


(f 


VY 


(g) 


50 


Strength 


Find a 99% lower one-sided confidence interval on mean 
compressive strength. Provide a practical interpretation of 
this interval. 

Find a 98% two-sided confidence interval on mean com- 
pressive strength. Provide a practical interpretation of this 
interval and explain why the lower end-point of the inter- 
val is or is not the same as in part (b). 

Find a 99% upper one-sided confidence interval on the 
variance of compressive strength. Provide a practical in- 
terpretation of this interval. 

Find a 98% two-sided confidence interval on the variance 
of compression strength. Provide a practical interpretation 
of this interval and explain why the upper end-point of the 
interval is or is not the same as in part (d). 

Suppose that it was discovered that the largest observation 
40.2 was misrecorded and should actually be 20.4. Now 
the sample mean x = 23 and the sample variance s* = 
39.8. Use these new values and repeat parts (c) and (e). 
Compare the original computed intervals and the newly 
computed intervals with the corrected observation value. 
How does this mistake affect the values of the sample 
mean, sample variance, and the width of the two-sided 
confidence intervals? 

Suppose, instead, that it was discovered that the largest 
observation 40.2 is correct, but that the observation 25.8 
is incorrect and should actually be 24.8. Now the sample 
mean x = 25 and the standard deviation s = 8.41. Use 
these new values and repeat parts (c) and (e). Compare 
the original computed intervals and the newly computed 
intervals with the corrected observation value. How does 
this mistake affect the values of the sample mean, the 
sample variance, and the width of the two-sided confi- 
dence intervals? 
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(h) Use the results from parts (f) and (g) to explain the effect 
of mistakenly recorded values on sample estimates. 
Comment on the effect when the mistaken values are near 
the sample mean and when they are not. 


8-88. Anoperating system for a personal computer has been 

studied extensively, and it is known that the standard deviation 

of the response time following a particular command is o = 8 

milliseconds. A new version of the operating system is 

installed, and we wish to estimate the mean response time for 
the new system to ensure that a 95% confidence interval for 
has a length of at most 5 milliseconds. 

(a) If we can assume that response time is normally distributed 
and that o = 8 for the new system, what sample size would 
you recommend? 

(b) Suppose that we are told by the vendor that the standard 
deviation of the response time of the new system is smaller, 
say, 0 = 6; give the sample size that you recommend and 
comment on the effect the smaller standard deviation has on 
this calculation. 


8-89. Consider the hemoglobin data in Exercise 8-86. Find 

the following: 

(a) An interval that contains 95% of the hemoglobin values 
with 90% confidence. 

(b) An interval that contains 99% of the hemoglobin values 
with 90% confidence. 


8-90. Consider the compressive strength of concrete data 
from Exercise 8-87. Find a 95% prediction interval on the 
next sample that will be tested. 


8-91. The maker of a shampoo knows that customers like 

this product to have a lot of foam. Ten sample bottles of the 

product are selected at random and the foam heights observed 

are as follows (in millimeters): 210, 215, 194, 195, 211, 201, 

198, 204, 208, and 196. 

(a) Is there evidence to support the assumption that foam 
height is normally distributed? 

(b) Find a 95% CI on the mean foam height. 

(c) Find a 95% prediction interval on the next bottle of sham- 
poo that will be tested. 

(d) Find an interval that contains 95% of the shampoo foam 
heights with 99% confidence. 

(e) Explain the difference in the intervals computed in parts 
(b), (c), and (d). 


8-92. During the 1999 and 2000 baseball seasons, there was 
much speculation that the unusually large number of home 
runs that were hit was due at least in part to a livelier ball. One 
way to test the “liveliness” of a baseball is to launch the ball at 
a vertical surface with a known velocity V, and measure the 
ratio of the outgoing velocity Vo of the ball to V,. The ratio 
R = VQ/V, is called the coefficient of restitution. Following 
are measurements of the coefficient of restitution for 40 
randomly selected baseballs. The balls were thrown from a 
pitching machine at an oak surface. 
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0.6248 0.6237 0.6118 0.6159 0.6298 0.6192 
0.6520 0.6368 0.6220 0.6151 0.6121 0.6548 
0.6226 0.6280 0.6096 0.6300 0.6107 0.6392 
0.6230 0.6131 0.6223 0.6297 0.6435 0.5978 
0.6351 0.6275 0.6261 0.6262 0.6262 0.6314 
0.6128 0.6403 0.6521 0.6049 0.6170 

0.6134 0.6310 0.6065 0.6214 0.6141 


(a) Is there evidence to support the assumption that the coef- 
ficient of restitution is normally distributed? 

(b) Find a 99% CI on the mean coefficient of restitution. 

(c) Find a 99% prediction interval on the coefficient of resti- 
tution for the next baseball that will be tested. 

(d) Find an interval that will contain 99% of the values of the 
coefficient of restitution with 95% confidence. 

(e) Explain the difference in the three intervals computed in 
parts (b), (c), and (d). 

8-93. Consider the baseball coefficient of restitution data in 

Exercise 8-92. Suppose that any baseball that has a coefficient 

of restitution that exceeds 0.635 is considered too lively. 

Based on the available data, what proportion of the baseballs 

in the sampled population are too lively? Find a 95% lower 

confidence bound on this proportion. 


8-94. An article in the ASCE Journal of Energy Engineering 
(“Overview of Reservoir Release Improvements at 20 TVA 
Dams” (Vol. 125, April 1999, pp. 1-17)] presents data on 
dissolved oxygen concentrations in streams below 20 dams in 
the Tennessee Valley Authority system. The observations are (in 
milligrams per liter): 5.0, 3.4, 3.9, 1.3, 0.2, 0.9, 2.7, 3.7, 3.8, 4.1, 
1.0, 1.0, 0.8, 0.4, 3.8, 4.5, 5.3, 6.1, 6.9, and 6.5. 
(a) Is there evidence to support the assumption that the 
dissolved oxygen concentration is normally distributed? 
(b) Find a 95% CI on the mean dissolved oxygen concentration. 
(c) Find a 95% prediction interval on the dissolved oxygen con- 
centration for the next stream in the system that will be tested. 
(d) Find an interval that will contain 95% of the values of the 
dissolved oxygen concentration with 99% confidence. 
(e) Explain the difference in the three intervals computed in 
parts (b), (c), and (d). 


8-95. The tar content in 30 samples of cigar tobacco follows: 
1.542 1.585 1.532 1.466 1.499 1.611 
1.622 1.466 1.546 1.494 1.548 1.626 
1.440 1.608 1.520 1.478 1.542 1.511 
1.459 1.533 1.532 1.523 1.397 1.487 
1.598 1.498 1.600 1.504 1.545 1.558 


(a) Is there evidence to support the assumption that the tar 
content is normally distributed? 

(b) Find a 99% CI on the mean tar content. 

(c) Find a 99% prediction interval on the tar content for the 
next observation that will be taken on this particular type 
of tobacco. 


(d) Find an interval that will contain 99% of the values of the 
tar content with 95% confidence. 

(e) Explain the difference in the three intervals computed in 
parts (b), (c), and (d). 

8-96. A manufacturer of electronic calculators takes a 

random sample of 1200 calculators and finds that there are 

eight defective units. 

(a) Construct a 95% confidence interval on the population 
proportion. 

(b) Is there evidence to support a claim that the fraction of 
defective units produced is 1% or less? 


8-97. An article in The Engineer (“Redesign for Suspect 

Wiring,” June 1990) reported the results of an investigation 

into wiring errors on commercial transport aircraft that may 

produce faulty information to the flight crew. Such a wiring 
error may have been responsible for the crash of a British 

Midland Airways aircraft in January 1989 by causing the pilot 

to shut down the wrong engine. Of 1600 randomly selected 

aircraft, eight were found to have wiring errors that could 
display incorrect information to the flight crew. 

(a) Find a 99% confidence interval on the proportion of air- 
craft that have such wiring errors. 

(b) Suppose we use the information in this example to 
provide a preliminary estimate of p. How large a sample 
would be required to produce an estimate of p that we are 
99% confident differs from the true value by at most 0.008? 

(c) Suppose we did not have a preliminary estimate of p. How 
large a sample would be required if we wanted to be at 
least 99% confident that the sample proportion differs 
from the true proportion by at most 0.008 regardless of the 
true value of p? 

(d) Comment on the usefulness of preliminary information in 
computing the needed sample size. 

8-98. An article in Engineering Horizons (Spring 1990, 

p. 26) reported that 117 of 484 new engineering graduates were 

planning to continue studying for an advanced degree. 

Consider this as a random sample of the 1990 graduating class. 

(a) Find a 90% confidence interval on the proportion of such 
graduates planning to continue their education. 

(b) Find a 95% confidence interval on the proportion of such 
graduates planning to continue their education. 

(c) Compare your answers to parts (a) and (b) and explain 
why they are the same or different. 

(d) Could you use either of these confidence intervals to 
determine whether the proportion is actually 0.25? 
Explain your answer. Hint: Use the normal approximation 
to the binomial. 

8-99. An article in the Journal of Applied Physiology 

(“Humidity Does Not Affect Central Nervous System Oxygen 

Toxicity” (2001, Vol. 91, pp. 1327—1333)] reported that central 

nervous system (CNS) oxygen toxicity can appear in humans 

on exposure to oxygen pressures >180 kPa. CNS oxygen toxi- 
city can occur as convulsions (similar to epileptic seizures, 
grand mal) and loss of consciousness without any warning 


symptoms. CNS oxygen toxicity is a risk encountered in sev- 
eral fields of human activity, such as combat diving with 
closed-circuit breathing apparatus and diving with mixtures of 
nitrogen and oxygen (nitrox) or nitrogen, oxygen, and helium 
(trimix) in sport and professional diving to depths >30 m. The 
risk of oxygen toxicity is always considered when deep diving 
is planned. The data shown below demonstrate shortened 
latencies in a dry atmosphere (<10% humidity) in 11 rats at O, 
of 507 kPa. The data collected are as follows: 


22 26 19 27 37 27 
14 19 23 18 18 


99 


Percent 
a 
fo) 


Latency 
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(a) Given the above probability plot of the data, what is a logical 
assumption about the underlying distribution of the data? 

(b) Explain why this check of the distribution underlying the 
sample data is important if we want to construct a confi- 
dence interval on the mean. 

(c) Find the 95% confidence interval for the mean. 

(d) Explain why this check of the distribution underlying the 
sample data is important if we want to construct a confi- 
dence interval on the variance. 

(e) Find the 95% confidence interval for the variance. 

8-100. An article in the Journal of Human Nutrition and 

Dietetics (“The Validation of Energy and Protein Intakes by 

Doubly Labeled Water and 24-Hour Urinary Nitrogen 

Excretion in Post-Obese Subjects” (1995, Vol. 8, pp. 51-64)] 

showed the energy intake expressed as a basal metabolic rate, 

BMR (MJ). 


5.40 
5.70 


5.67 
6.08 


5.79 
5.48 


6.85 
5.44 


6.92 
5.51 


(a) Use a normal probability plot to check the normality 
assumption. 

(b) Find a 99% two-sided confidence interval on the mean 
BMR. 


MIND-EXPANDING EXERCISES 


8-101. An electrical component has a time-to-failure 
(or lifetime) distribution that is exponential with param- 
eter \, so the mean lifetime is ~ = 1/d. Suppose that a 
sample of n of these components is put on test, and let 
X; be the observed lifetime of component i. The test con- 
tinues only until the rth unit fails, where r < n. This re- 
sults in a censored life test. Let X, denote the time at 
which the first failure occurred, X, denote the time at 
which the second failure occurred, and so on. Then the 
total lifetime that has been accumulated at test termina- 
tion is 
: 
Lk an 
i=l 

We have previously shown in Exercise 7-74 that T,/r is 
an unbiased estimator for jw. 
(a) Itcan be shown that 2X7, has a chi-square distribution 

with 27 degrees of freedom. Use this fact to develop a 


10011 — «)% confidence interval for mean lifetime 
w= 1/r. 

(b) Suppose 20 units were put on test, and the test 
terminated after 10 failures occurred. The failure 
times (in hours) are 15, 18, 19, 20, 21, 21, 22, 27, 
28, and 29. Find a 95% confidence interval on mean 
lifetime. 


8-102. Consider a two-sided confidence interval for 
the mean p when o is known: 


= Zq,0/Vn S psxt Za, 0/Vn 


where a, + a, = a. Ifa; = a) = a/2, we have the usual 
10011 — «)% confidence interval for p. In the above, 
when a, # Q), the interval is not symmetric about p. 
The length of the interval is L = o(Z, + Z,)/Vn. 
Prove that the length of the interval L is minimized when 
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MIND-EXPANDING EXERCISES 


Q,; = Q = a/2. Hint: Remember that ®(z,) = 1 — a, 
so ®'(1 — a) = zy, and the relationship between the 
derivative of a function y = f(x) and the inverse 
x = f'(y) is (d/dy)f-"(y) = 1/{(d/dx) f(x). 

8-103. It is possible to construct a nonparametric tol- 
erance interval that is based on the extreme values in a 
random sample of size n from any continuous population. 
If p is the minimum proportion of the population con- 
tained between the smallest and largest sample observa- 
tions with confidence 1 — a, it can be shown that 


np"! —(n— p"= a 


and n is approximately 


i, fire eu 


(a) In order to be 95% confident that at least 90% of the 
population will be included between the extreme val- 
ues of the sample, what sample size will be required? 

(b) A random sample of 10 transistors gave the follow- 
ing measurements on saturation current (in mil- 
liamps): 10.25, 10.41, 10.30, 10.26, 10.19, 10.37, 
10.29, 10.34, 10.23, 10.38. Find the limits that con- 
tain a proportion p of the saturation current meas- 
urements at 95% confidence. What is the proportion 
p contained by these limits? 


8-104. Suppose that X,, X,,..., X, is a random 
sample from a continuous probability distribution with 
median L. 

(a) Show that 


P {min(X;) < ji < max(X)} 


Hint: The complement of the event [min(X;) < fi 

< max(X;)] is [max(X;) = fi] U [min(X;) = ji], but 

max(X;) = ji if and only if X; = ji for all i]. 

(b) Write down a 100(1 — «)% confidence interval for 
the median ji, where 


8-105. Students in the industrial statistics lab at ASU 
calculate a lot of confidence intervals on jw. Suppose all 
these CIs are independent of each other. Consider the 
next one thousand 95% confidence intervals that will be 
calculated. How many of these CIs do you expect to 
capture the true value of j1? What is the probability that 
between 930 and 970 of these intervals contain the true 
value of 1? 


IMPORTANT TERMS AND CONCEPTS 


Chi-squared Confidence interval for Error in estimation Prediction interval 
distribution the variance of a Large sample confidence —_‘ Tolerance interval 
Confidence coefficient normal distribution interval Two-sided confidence 

Confidence interval Confidence intervals One-sided confidence interval 
Confidence interval for the mean of a bounds t distribution 


normal distribution 
Confidence level 


Precision of parameter 
estimation 


for a population 
proportion 
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Tests of Hypotheses for a Single Sample 


INTRODUCTION 


In the previous two chapters we showed how a parameter of a population can be estimated 
from sample data, using either a point estimate (Chapter 7) or an interval of likely values 
called a confidence interval (Chapter 8). In many situations a different type of problem is 
of interest; there are two competing claims about the value of a parameter, and the engi- 
neer must determine which claim is correct. For example, suppose that an engineer is 
designing an air crew escape system that consists of an ejection seat and a rocket motor 
that powers the seat. The rocket motor contains a propellant, and in order for the ejection 
seat to function properly, the propellant should have a mean burning rate of 50 cm/sec. If 
the burning rate is too low, the ejection seat may not function properly, leading to an 
unsafe ejection and possible injury of the pilot. Higher burning rates may imply instabil- 
ity in the propellant or an ejection seat that is too powerful, again leading to possible 
pilot injury. So the practical engineering question that must be answered is: Does the 
mean burning rate of the propellant equal 50 cm/sec, or is it some other value (either 
higher or lower)? This type of question can be answered using a statistical technique 
called hypothesis testing. This chapter focuses on the basic principles of hypothesis 
testing and provides techniques for solving the most common types of hypothesis testing 
problems involving a single sample of data. 
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CHAPTER 9 TESTS OF HYPOTHESES FOR A SINGLE SAMPLE 
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Sample Size 9-8 CONTINGENCY TABLE TESTS 
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9-9.2. The Wilcoxon Signed-Rank Test 
9-4.1 Hypothesis Tests on the 
Variance 9-9.3, Comparison to the t-test 
LEARNING OBJECTIVES 


After careful study of this chapter, you should be able to do the following: 


. Structure engineering decision-making problems as hypothesis tests 


. Test hypotheses on the mean of a normal distribution using either a Z-test or a t-test procedure 


. Test hypotheses on the variance or standard deviation of a normal distribution 


. Use the P-value approach for making decisions in hypotheses tests 


1 
2 
3 
4. Test hypotheses on a population proportion 
5 
6 


. Compute power and type II error probability, and make sample size selection decisions for tests 
on means, variances, and proportions 


7. Explain and use the relationship between confidence intervals and hypothesis tests 


8. Use the chi-square goodness of fit test to check distributional assumptions 


9. Use contingency table tests 


9-1 HYPOTHESIS TESTING 


9-1.1 Statistical Hypotheses 


In the previous chapter we illustrated how to construct a confidence interval estimate of a 
parameter from sample data. However, many problems in engineering require that we decide 
which of two competing claim or statements about some parameter is true. The statements are 
called hypotheses, and the decision-making procedure is called hypothesis testing. This is 
one of the most useful aspects of statistical inference, since many types of decision-making 
problems, tests, or experiments in the engineering world can be formulated as hypothesis- 
testing problems. Furthermore, as we will see, there is a very close connection between 
hypothesis testing and confidence intervals. 


Statistical 
Hypothesis 
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Statistical hypothesis testing and confidence interval estimation of parameters are the 
fundamental methods used at the data analysis stage of a comparative experiment, in which 
the engineer is interested, for example, in comparing the mean of a population to a specified 
value. These simple comparative experiments are frequently encountered in practice and 
provide a good foundation for the more complex experimental design problems that we will 
discuss in Chapters 13 and 14. In this chapter we discuss comparative experiments involving 
a single population, and our focus is on testing hypotheses concerning the parameters of the 
population. 

We now give a formal definition of a statistical hypothesis. 


A statistical hypothesis is a statement about the parameters of one or more populations. 


Since we use probability distributions to represent populations, a statistical hypothesis 
may also be thought of as a statement about the probability distribution of a random variable. 
The hypothesis will usually involve one or more parameters of this distribution. 

For example, consider the aircrew escape system described in the introduction. Suppose 
that we are interested in the burning rate of the solid propellant. Now, burning rate is a random 
variable that can be described by a probability distribution. Suppose that our interest focuses 
on the mean burning rate (a parameter of this distribution). Specifically, we are interested in 
deciding whether or not the mean burning rate is 50 centimeters per second. We may express 
this formally as 


Ho: w = 50 centimeters per second 


H: w # 50 centimeters per second (9-1) 


The statement Hp: wp = 50 centimeters per second in Equation 9-1 is called the null 
hypothesis, and the statement H,: ~ # 50 centimeters per second is called the alternative 
hypothesis. Since the alternative hypothesis specifies values of w that could be either greater 
or less than 50 centimeters per second, it is called a two-sided alternative hypothesis. In some 
situations, we may wish to formulate a one-sided alternative hypothesis, as in 


Ho: w = 50 centimeters per second Ho: » = 50 centimeters per second 
or (9-2) 


HA: w < 50 centimeters per second HA: w > 50 centimeters per second 


It is important to remember that hypotheses are always statements about the population or 
distribution under study, not statements about the sample. The value of the population param- 
eter specified in the null hypothesis (50 centimeters per second in the above example) is usu- 
ally determined in one of three ways. First, it may result from past experience or knowledge 
of the process, or even from previous tests or experiments. The objective of hypothesis testing, 
then, is usually to determine whether the parameter value has changed. Second, this value may 
be determined from some theory or model regarding the process under study. Here the objec- 
tive of hypothesis testing is to verify the theory or model. A third situation arises when the 
value of the population parameter results from external considerations, such as design or en- 
gineering specifications, or from contractual obligations. In this situation, the usual objective 
of hypothesis testing is conformance testing. 
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A procedure leading to a decision about a particular hypothesis is called a test of a 
hypothesis. Hypothesis-testing procedures rely on using the information in a random sample 
from the population of interest. If this information is consistent with the hypothesis, we will not 
reject the hypothesis; however, if this information is inconsistent with the hypothesis, we will 
conclude that the hypothesis is false. We emphasize that the truth or falsity of a particular hy- 
pothesis can never be known with certainty, unless we can examine the entire population. This 
is usually impossible in most practical situations. Therefore, a hypothesis-testing procedure 
should be developed with the probability of reaching a wrong conclusion in mind. In our treat- 
ment of hypothesis testing, the null hypothesis will always be stated so that it specifies an exact 
value of the parameter (as in the statement Hp: 1 = 50 centimeters per second in Equation 
9-1). Testing the hypothesis involves taking a random sample, computing a test statistic from 
the sample data, and then using the test statistic to make a decision about the null hypothesis. 


9-1.2 Tests of Statistical Hypotheses 


To illustrate the general concepts, consider the propellant burning rate problem introduced 
earlier. The null hypothesis is that the mean burning rate is 50 centimeters per second, and the 
alternate is that it is not equal to 50 centimeters per second. That is, we wish to test 


Ho: » = 50 centimeters per second 


H,: » # 50 centimeters per second 


Suppose that a sample of n = 10 specimens is tested and that the sample mean burning 
rate x is observed. The sample mean is an estimate of the true population mean w. A value of 
the sample mean xX that falls close to the hypothesized value of 1 = 50 centimeters per second 
does not conflict with the null hypothesis that the true mean w is really 50 centimeters per 
second. On the other hand, a sample mean that is considerably different from 50 centimeters 
per second is evidence in support of the alternative hypothesis H,. Thus, the sample mean is 
the test statistic in this case. 

The sample mean can take on many different values. Suppose that if 48.5 < x <= 51.5, we 
will not reject the null hypothesis Ho: w = 50, and if either x < 48.5 or x > 51.5, we will 
reject the null hypothesis in favor of the alternative hypothesis H;: w # 50. This is illustrated 
in Fig. 9-1. The values of x that are less than 48.5 and greater than 51.5 constitute the critical 
region for the test, while all values that are in the interval 48.5 = x = 51.5 form a region for 
which we will fail to reject the null hypothesis. By convention, this is usually called the 
acceptance region. The boundaries between the critical regions and the acceptance region are 
called the critical values. In our example the critical values are 48.5 and 51.5. It is customary 
to state conclusions relative to the null hypothesis Hp. Therefore, we reject Hy in favor of H, 
if the test statistic falls in the critical region, and fail to reject Hy otherwise. 


Reject Ho Fail to Reject Ho Reject Ho 
uz 50 cm/s p= 50 cm/s uz 50 cm/s 
48.5 50 51.5 x 


Figure 9-1 Decision criteria for testing Hp: p = 
50 centimeters per second versus H,: 1 # 50 centime- 
ters per second. 


Type I Error 


Type I Error 


Probability of 
Type I Error 
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This decision procedure can lead to either of two wrong conclusions. For example, the 
true mean burning rate of the propellant could be equal to 50 centimeters per second. 
However, for the randomly selected propellant specimens that are tested, we could observe a 
value of the test statistic x that falls into the critical region. We would then reject the null 
hypothesis H) in favor of the alternate H, when, in fact, Hp is really true. This type of wrong 
conclusion is called a type I error. 


Rejecting the null hypothesis Hp when it is true is defined as a type I error. 


Now suppose that the true mean burning rate is different from 50 centimeters per second, yet 
the sample mean x falls in the acceptance region. In this case we would fail to reject Hj when 
it is false. This type of wrong conclusion is called a type II error. 


Failing to reject the null hypothesis when it is false is defined as a type II error. 


Thus, in testing any statistical hypothesis, four different situations determine whether the final 
decision is correct or in error. These situations are presented in Table 9-1. 

Because our decision is based on random variables, probabilities can be associated with 
the type I and type II errors in Table 9-1. The probability of making a type I error is denoted 
by the Greek letter a. 


a = P(typel error) = P(reject Hy when H, is true) (9-3) 


Sometimes the type I error probability is called the significance level, or the o-error, or the 
size of the test. In the propellant burning rate example, a type I error will occur when either 
x > 51.5 or x < 48.5 when the true mean burning rate really is ~ = 50 centimeters per sec- 
ond. Suppose that the standard deviation of burning rate is o = 2.5 centimeters per second 
and that the burning rate has a distribution for which the conditions of the central limit theo- 
rem apply, so the distribution of the sample mean is approximately normal with mean p = 50 
and standard deviation o/Vn = 2.5/V/10 = 0.79. The probability of making a type I error 
(or the significance level of our test) is equal to the sum of the areas that have been shaded in 
the tails of the normal distribution in Fig. 9-2. We may find this probability as 


a = P(X < 48.5 when p = 50) + P(X > 51.5 when p = 50) 


a/2 = 0.0287 a/2 = 0.0287 Table 9-1 Decisions in Hypothesis Testing 


Figure 9-2. The critical region for Hy: p = 50 
versus H,: w # 50 andn = 10. 


Decision Hy Is True Hi Is False 


48.5 w=50 51.5 X 
Fail to reject Hy no error type II error 


Reject Hy type I error no error 
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The z-values that correspond to the critical values 48.5 and 51.5 are 


_ 485-50 a : SI ga 
1 "079. si 2 O19. 


Therefore, 
a = P(Z < —1.90) + P(Z > 1.90) = 0.0287 + 0.0287 = 0.0574 


This is the type I error probability. This implies that 5.74% of all random samples would lead 
to rejection of the hypothesis Hg: 1 = 50 centimeters per second when the true mean burning 
rate is really 50 centimeters per second. 

From inspection of Fig. 9-2, notice that we can reduce a by widening the acceptance 
region. For example, if we make the critical values 48 and 52, the value of a is 


48 — 50 52 — 50 
= <——— } + > —_— = < —2.53) + > 2, 
o p(z aE ) p(z Ga6 ) P(Z < —2.53) + P(Z > 2.53) 


= 0.0057 + 0.0057 = 0.0114 


We could also reduce a by increasing the sample size. If n = 16, o/Vn = 2.5/V/16 = 
0.625, and using the original critical region from Fig. 9-1, we find 


_ 48.5 — 50 


_ 51.5 — 50 
se 0.625 


= —240 and Z = 0.625 = 2.40 


Therefore, 
a = P(Z < —2.40) + P(Z > 2.40) = 0.0082 + 0.0082 = 0.0164 


In evaluating a hypothesis-testing procedure, it is also important to examine the proba- 
bility of a type II error, which we will denote by B. That is, 


8 = P(type Il error) = P(fail to reject Hy when H) is false) (9-4) 


To calculate B (sometimes called the $-error), we must have a specific alternative hypothe- 
sis; that is, we must have a particular value of 1. For example, suppose that it is important to 
reject the null hypothesis Hy: ~ = 50 whenever the mean burning rate ws is greater than 52 
centimeters per second or less than 48 centimeters per second. We could calculate the proba- 
bility of a type II error B for the values p = 52 and p = 48 and use this result to tell us some- 
thing about how the test procedure would perform. Specifically, how will the test procedure 
work if we wish to detect, that is, reject Ho, for a mean value of p = 52 or wp = 48? Because 
of symmetry, it is necessary only to evaluate one of the two cases—-say, find the probability of 
accepting the null hypothesis Ho: 1 = 50 centimeters per second when the true mean is p = 
52 centimeters per second. 


Computing the 


Type II Error Ho: w 


Figure 9-3 will help us calculate the probability of type II error B. The normal distribution 
Probability of on the left in Fig. 9-3 is the distribution of the test statistic X when the null hypothesis 
= 50 is true (this is what is meant by the expression “under Hy: = 50”), and the nor- 
mal distribution on the right is the distribution of X when the alternative hypothesis is true and 
the value of the mean is 52 (or “under H,: w = 52”). Now a type II error will be committed if 
the sample mean X falls between 48.5 and 51.5 (the critical region boundaries) when yp. = 52. 
As seen in Fig. 9-3, this is just the probability that 48.5 < X < 51.5 when the true mean is 
w. = 52, or the shaded area under the normal distribution centered at ~ = 52. Therefore, 
referring to Fig. 9-3, we find that 
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B = P(48.5 = X < 51.5 when p = 52) 


_ 48.5 — 52 _ 


4.4 
0.79 


ZI 


Therefore, 


B = P(—4.43 < Z < —0.63) = 
= 0.2643 — 0.0000 = 0.2643 


Thus, if we are testing Hy: w = 50 against H,: w # 50 with n = 10, and the true value of the 
mean is p = 52, the probability that we will fail to reject the false null hypothesis is 0.2643. By 
symmetry, if the true value of the mean is = 48, the value of B will also be 0.2643. 
The probability of making a type II error B increases rapidly as the true value of w 
approaches the hypothesized value. For example, see Fig. 9-4, where the true value of the 
mean is = 50.5 and the hypothesized value is Hp: tp = 50. The true value of ws is very close 
to 50, and the value for B is 


The z-values corresponding to 48.5 and 51.5 when w = 52 are 


_ 51.5 — 52 


079. 0°63 


£2 


P(Z <= —0.63) — P(Z < —4.43) 


8 = P(48.5 = X = 51.5 when p = 50.5) 
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Figure 9-3 The probability of type II error 
when pw = 52 andn = 10. 
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Figure 9-4 The probability of type II error 
when pw = 50.5 andn = 10. 
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Effect of Sample 
Size on B 


Figure 9-5 The 
probability of type I 
error when p = 52 
andn = 16. 


As shown in Fig. 9-4, the z-values corresponding to 48.5 and 51.5 when p = 50.5 are 


_ 485-505 _ 5, F SSIS 508 at 
a =“Gag a 2-079 


Therefore, 


B = P(-2.53 = Z = 1.27) = P(Z = 1.27) — P(Z = —2.53) 
= 0.8980 — 0.0057 = 0.8923 


Thus, the type II error probability is much higher for the case where the true mean is 50.5 
centimeters per second than for the case where the mean is 52 centimeters per second. Of 
course, in many practical situations we would not be as concerned with making a type II error if 
the mean were “close” to the hypothesized value. We would be much more interested in 
detecting large differences between the true mean and the value specified in the null hypothesis. 

The type II error probability also depends on the sample size n. Suppose that the null 
hypothesis is Ho: » = 50 centimeters per second and that the true value of the mean is 
w. = 52. If the sample size is increased from n = 10 ton = 16, the situation of Fig. 9-5 results. 
The normal distribution on the left is the distribution of X when the mean p. = 50, and the 
normal distribution on the right is the distribution of X when = 52. As shown in Fig. 9-5, 
the type II error probability is 


B = P(48.5 < X <= 51.5 when p = 52) 


When n = 16, the standard deviation of X is o/Vn = 2.5/V/16 = 0.625, and the z-values 
corresponding to 48.5 and 51.5 when pp = 52 are 


48.5 — 52 515-52 
= = -5, d a= = -0, 
a1 0.625 Se eT YT 0.80 
Therefore, 
8 = P(-5.60 < Z < -0.80) = P(Z < —0.80) — P(Z = —5.60) 
= 0.2119 — 0.0000 = 0.2119 
0.8 
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Recall that when n = 10 and w = 52, we found that B = 0.2643; therefore, increasing the 
sample size results in a decrease in the probability of type II error. 

The results from this section and a few other similar calculations are summarized in the 
following table. The critical values are adjusted to maintain equal a forn = 10 andn = 16. 
This type of calculation is discussed later in the chapter. 


Acceptance Sample 
Region Size Qa Bat p = 52 B at w = 50.5 
48.5 <x < 51.5 10 0.0576 0.2643 0.8923 
48 <x<52 10 0.0114 0.5000 0.9705 
48.81 <x < 51.19 16 0.0576 0.0966 0.8606 
48.42 <x < 51.58 16 0.0114 0.2515 0.9578 


The results in boxes were not calculated in the text but can easily be verified by the 
reader. This display and the discussion above reveal four important points: 


1. The size of the critical region, and consequently the probability of a type I error a, 
can always be reduced by appropriate selection of the critical values. 


2. Type Land type II errors are related. A decrease in the probability of one type of error 
always results in an increase in the probability of the other, provided that the sample 
size n does not change. 


3. An increase in sample size reduces B, provided that a is held constant. 


4. When the null hypothesis is false, B increases as the true value of the parameter 
approaches the value hypothesized in the null hypothesis. The value of 8 decreases 
as the difference between the true mean and the hypothesized value increases. 


Generally, the analyst controls the type I error probability « when he or she selects the 
critical values. Thus, it is usually easy for the analyst to set the type I error probability at 
(or near) any desired value. Since the analyst can directly control the probability of 
wrongly rejecting Hy, we always think of rejection of the null hypothesis Hp as a strong 
conclusion. 

Because we can control the probability of making a type I error (or significance level), a 
logical question is what value should be used. The type I error probability is a measure of risk, 
specifically, the risk of concluding that the null hypothesis is false when it really isn’t. So, the 
value of a should be chosen to reflect the consequences (economic, social, etc.) of incorrectly 
rejecting the null hypothesis. Smaller values of « would reflect more serious consequences 
and larger values of « would be consistent with less severe consequences. This is often hard to 
do, and what has evolved in much of scientific and engineering practice is to use the value 
a = 0.05 in most situations, unless there is information available that indicates that this is an 
inappropriate choice. In the rocket propellant problem with n = 10, this would correspond to 
critical values of 48.45 and 51.55. 


A widely used procedure in hypothesis testing is to use a type | error or significance 
level of a = 0.05. This value has evolved through experience, and may not be appro- 
priate for all situations. 
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On the other hand, the probability of type II error B is not a constant, but depends on 
the true value of the parameter. It also depends on the sample size that we have selected. 
Because the type II error probability B is a function of both the sample size and the extent to 
which the null hypothesis H) is false, it is customary to think of the decision to accept Hy as a 
weak conclusion, unless we know that B is acceptably small. Therefore, rather than saying we 
“accept Ho,” we prefer the terminology “fail to reject Ho.” Failing to reject Hp implies that we 
have not found sufficient evidence to reject Hp, that is, to make a strong statement. Failing to 
reject Hy does not necessarily mean that there is a high probability that Ho is true. It may 
simply mean that more data are required to reach a strong conclusion. This can have impor- 
tant implications for the formulation of hypotheses. 

There is a useful analog between hypothesis testing and a jury trial. In a trial the defend- 
ant is assumed innocent (this is like assuming the null hypothesis to be true). If strong evi- 
dence is found to the contrary, the defendant is declared to be guilty (we reject the null hy- 
pothesis). If there is insufficiant evidence the defendant is declared to be not guilty. This is 
not the same as proving the defendant innocent and so, like failing to reject the null hypoth- 
esis, it is a weak conclusion. 

An important concept that we will make use of is the power of a statistical test. 


The power of a statistical test is the probability of rejecting the null hypothesis H 
when the alternative hypothesis is true. 


The power is computed as | — £, and power can be interpreted as the probability of 
correctly rejecting a false null hypothesis. We often compare statistical tests by comparing 
their power properties. For example, consider the propellant burning rate problem when we 
are testing Hp: w = 50 centimeters per second against H,: w # 50 centimeters per second. 
Suppose that the true value of the mean is p = 52. Whenn = 10, we found that B = 0.2643, 
so the power of this testis 1 — B = 1 — 0.2643 = 0.7357 when wp = 52. 

Power is a very descriptive and concise measure of the sensitivity of a statistical test, 
where by sensitivity we mean the ability of the test to detect differences. In this case, the 
sensitivity of the test for detecting the difference between a mean burning rate of 50 centime- 
ters per second and 52 centimeters per second is 0.7357. That is, if the true mean is really 
52 centimeters per second, this test will correctly reject Hp: w = 50 and “detect” this differ- 
ence 73.57% of the time. If this value of power is judged to be too low, the analyst can increase 
either a or the sample size n. 


9-1.3 One-Sided and Two-Sided Hypotheses 


In constructing hypotheses, we will always state the null hypothesis as an equality so that the 
probability of type I error a can be controlled at a specific value. The alternative hypothesis 
might be either one-sided or two-sided, depending on the conclusion to be drawn if Ho is 
rejected. If the objective is to make a claim involving statements such as greater than, less 
than, superior to, exceeds, at least, and so forth, a one-sided alternative is appropriate. If no 
direction is implied by the claim, or if the claim “not equal to” is to be made, a two-sided 
alternative should be used. 


EXAMPLE 9-1 Propellant Burning Rate 

Consider the propellant burning rate problem. Suppose that if 
the burning rate is less than 50 centimeters per second, we 
wish to show this with a strong conclusion. The hypotheses 
should be stated as 


Ho: w = 50 centimeters per second 


Hy: w < 50 centimeters per second 


Here the critical region lies in the lower tail of the distribution 
of X. Since the rejection of Hp is always a strong conclusion, 
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this statement of the hypotheses will produce the desired out- 
come if Hp is rejected. Notice that, although the null hypothe- 
sis is stated with an equals sign, it is understood to include any 
value of jw not specified by the alternative hypothesis. 
Therefore, failing to reject Hy does not mean that . = 50 cen- 
timeters per second exactly, but only that we do not have 
strong evidence in support of 1. 


In some real-world problems where one-sided test procedures are indicated, it is 
occasionally difficult to choose an appropriate formulation of the alternative hypothesis. 
For example, suppose that a soft-drink beverage bottler purchases 10-ounce bottles from a 
glass company. The bottler wants to be sure that the bottles meet the specification on mean 
internal pressure or bursting strength, which for 10-ounce bottles is a minimum strength 
of 200 psi. The bottler has decided to formulate the decision procedure for a specific lot 
of bottles as a hypothesis testing problem. There are two possible formulations for this 


problem, either 


or 


Formulating 
One-Sided 
Hypothesis 


Ho: w = 200 psi 
A: w > 200 psi (9-5) 
Ho: w = 200 psi 
Ay: w < 200 psi (9-6) 


Consider the formulation in Equation 9-5. If the null hypothesis is rejected, the bottles will be 
judged satisfactory; if Hp is not rejected, the implication is that the bottles do not conform to 
specifications and should not be used. Because rejecting Hp is a strong conclusion, this for- 


mulation forces the bottle manufacturer to “demonstrate” that the mean bursting strength of 
the bottles exceeds the specification. Now consider the formulation in Equation 9-6. In this 
situation, the bottles will be judged satisfactory unless H, is rejected. That is, we conclude that 
the bottles are satisfactory unless there is strong evidence to the contrary. 

Which formulation is correct, the one of Equation 9-5 or Equation 9-6? The answer is that 
it depends on the objective of the analysis. For Equation 9-5, there is some probability that H 
will not be rejected (i.e., we would decide that the bottles are not satisfactory), even though the 
true mean is slightly greater than 200 psi. This formulation implies that we want the bottle 
manufacturer to demonstrate that the product meets or exceeds our specifications. Such a for- 
mulation could be appropriate if the manufacturer has experienced difficulty in meeting spec- 
ifications in the past or if product safety considerations force us to hold tightly to the 200-psi 
specification. On the other hand, for the formulation of Equation 9-6 there is some probability 
that Hy will be accepted and the bottles judged satisfactory, even though the true mean is 
slightly less than 200 psi. We would conclude that the bottles are unsatisfactory only when 
there is strong evidence that the mean does not exceed 200 psi, that is, when Hp: pp = 200 psi 
is rejected. This formulation assumes that we are relatively happy with the bottle manufac- 
turer’s past performance and that small deviations from the specification of w = 200 psi are 


not harmful. 
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In formulating one-sided alternative hypotheses, we should remember that rejecting 
H) is always a strong conclusion. Consequently, we should put the statement about 
which it is important to make a strong conclusion in the alternative hypothesis. In 
real-world problems, this will often depend on our point of view and experience with 
the situation. 


9-1.4 P-Values in Hypothesis Tests 


P-Value 


One way to report the results of a hypothesis test is to state that the null hypothesis was or was 
not rejected at a specified a-value or level of significance. This is called fixed significance 
level testing. 

The fixed significance level approach to hypothesis testing is very nice because it leads 
directly to the concepts of type II error and power, which are of considerable value in deter- 
mining the appropriate sample sizes to use in hypothesis testing. But the fixed significance 
level approach does have some disadvantages. 

For example, in the propellant problem above, we can say that Ho: w = 50 was rejected at 
the 0.05 level of significance. This statement of conclusions may be often inadequate because 
it gives the decision maker no idea about whether the computed value of the test statistic was 
just barely in the rejection region or whether it was very far into this region. Furthermore, stat- 
ing the results this way imposes the predefined level of significance on other users of the in- 
formation. This approach may be unsatisfactory because some decision makers might be un- 
comfortable with the risks implied by a = 0.05. 

To avoid these difficulties, the P-value approach has been adopted widely in practice. 
The P-value is the probability that the test statistic will take on a value that is at least as 
extreme as the observed value of the statistic when the null hypothesis Hy is true. Thus, a 
P-value conveys much information about the weight of evidence against Ho, and so a decision 
maker can draw a conclusion at any specified level of significance. We now give a formal def- 
inition of a P-value. 


The P-value is the smallest level of significance that would lead to rejection of the 
null hypothesis Hy with the given data. 


It is customary to call the test statistic (and the data) significant when the null hypothesis 
H, is rejected; therefore, we may think of the P-value as the smallest level a at which the data 
are significant. In other words, the P-value is the observed significance level. Once the 
P-value is known, the decision maker can determine how significant the data are without the 
data analyst formally imposing a preselected level of significance. 

Consider the two-sided hypothesis test for burning rate 


Ao: w = 50 Ay: wp #50 


with n = 16 and o = 2.5. Suppose that the observed sample mean is x = 51.3 centimeters 
per second. Figure 9-6 shows a critical region for this test with the value of x = 51.3 and the 
symmetric value 48.7. The P-value of the test is the probability above 51.3 plus the 


Figure 9-6 P-value 
is the area of the 
shaded region when 
x = 51.3. 
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probability below 48.7. The P-value is easy to compute after the test statistic is observed. In 
this example, 


P-value = 1 — P(48.7 < X < 51.3) 
ay (#2 750, 513 =) 
2.5/V16 2.5/V16 
1 — P(-2.08 < Z < 2.08) 
= 1 — 0.962 = 0.038 


The P-value tells us that if the null hypothesis Hj) = 50 is true, the probability of obtain- 
ing a random sample whose mean is at least as far from 50 as 51.3 (or 48.7) is 0.038. 
Therefore, an observed sample mean of 51.3 is a fairly rare event if the null hypothesis Hy = 
50 is really true. Compared to the “standard” level of significance 0.05, our observed P-value 
is smaller, so if we were using a fixed significance level of 0.05, the null hypothesis would be 
rejected. In fact, the null hypothesis Hy) = 50 would be rejected at any level of significance 
greater than or equal to 0.038. This illustrates the boxed definition above; the P-value is the 
smallest level of significance that would lead to rejection of Hy = 50. 

Operationally, once a P-value is computed, we typically compare it to a predefined sig- 
nificance level to make a decision. Often this predefined significance level is 0.05. However, 
in presenting results and conclusions, it is standard practice to report the observed P-value 
along with the decision that is made regarding the null hypothesis. 

Clearly, the P-value provides a measure of the credibility of the null hypothesis. Specifically, 
it is the risk that you have made an incorrect decision if you reject the null hypothesis Ho. The 
P-value is not the probability that the null hypothesis is false, nor is 1 — P the probability that 
the null hypothesis is true. The null hypothesis is either true or false (there is no probability as- 
sociated with this), and so the proper interpretation of the P-value is in terms of the risk of 
wrongly rejecting the null hypothesis Hp. 

It is not always easy to compute the exact P-value for a statistical test. However, most 
modern statistics software packages report the results of hypothesis testing problems in terms 
of P-values. We will use the P-value approach extensively. 


9-1.5 Connection between Hypothesis Tests and Confidence Intervals 


There is a close relationship between the test of a hypothesis about any parameter, say 0, and 
the confidence interval for 0. If [/, u] is a 100(1 — a)% confidence interval for the parameter 
0, the test of size a of the hypothesis 

Ao: 8 = 0 

H 1: 0# 8 
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will lead to rejection of Ho if and only if 6) is not in the 100(1 — a)% CI [/, u]. As an illus- 
tration, consider the escape system propellant problem with x = 51.3, 0 = 2.5, andn = 16. 
The null hypothesis Hp: ~ = 50 was rejected, using a = 0.05. The 95% two-sided CI 
on wp can be calculated using Equation 8-7. This CI is 51.3 + 1.96(2.5/V/16) and this is 
50.075 = p = 52.525. Because the value 4) = 50 is not included in this interval, the null 
hypothesis Ho: pp = 50 is rejected. 

Although hypothesis tests and CIs are equivalent procedures insofar as decision mak- 
ing or inference about w is concerned, each provides somewhat different insights. For 
instance, the confidence interval provides a range of likely values for w at a stated confi- 
dence level, whereas hypothesis testing is an easy framework for displaying the risk levels 
such as the P-value associated with a specific decision. We will continue to illustrate the 
connection between the two procedures throughout the text. 


9-1.6 General Procedure for Hypothesis Tests 


This chapter develops hypothesis-testing procedures for many practical problems. Use of the 
following sequence of steps in applying hypothesis-testing methodology is recommended. 

1. Parameter of interest: From the problem context, identify the parameter of interest. 
Null hypothesis, Hy): State the null hypothesis, Hp. 
Alternative hypothesis, H,: Specify an appropriate alternative hypothesis, H,. 
Test statistic: Determine an appropriate test statistic. 
Reject Hy if: State the rejection criteria for the null hypothesis. 


A we} 


Computations: Compute any necessary sample quantities, substitute these into the 
equation for the test statistic, and compute that value. 


7. Draw conclusions: Decide whether or not Hp should be rejected and report that in 
the problem context. 


Steps 1-4 should be completed prior to examination of the sample data. This sequence of 
steps will be illustrated in subsequent sections. 

In practice, such a formal and (seemingly) rigid procedure is not always necessary. 
Generally, once the experimenter (or decision maker) has decided on the question of interest 
and has determined the design of the experiment (that is, how the data are to be collected, 
how the measurements are to be made, and how many observations are required), only three 
steps are really required: 


1. Specify the test statistic to be used (such as Zo). 
Specify the location of the critical region (two-tailed, upper-tailed, or lower-tailed). 


3. Specify the criteria for rejection (typically, the value of «, or the P-value at which 
rejection should occur). 


These steps are often completed almost simultaneously in solving real-world problems, 
although we emphasize that it is important to think carefully about each step. That is why we 
present and use the eight-step process; it seems to reinforce the essentials of the correct 
approach. While you may not use it every time in solving real problems, it is a helpful frame- 
work when you are first learning about hypothesis testing. 


Statistical versus Practical Significance 
We noted previously that reporting the results of a hypothesis test in terms of a P-value is very 
useful because it conveys more information than just the simple statement “reject Hy” or “fail 
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to reject Hp.” That is, rejection of Hp at the 0.05 level of significance is much more meaning- 
ful if the value of the test statistic is well into the critical region, greatly exceeding the 5% crit- 
ical value, than if it barely exceeds that value. 

Even a very small P-value can be difficult to interpret from a practical viewpoint when 
we are making decisions because, while a small P-value indicates statistical significance in 
the sense that H, should be rejected in favor of H,, the actual departure from Hp that has been 
detected may have little (if any) practical significance (engineers like to say “engineering 
significance”). This is particularly true when the sample size n is large. 

For example, consider the propellant burning rate problem of Example 9-1 where we are 
testing Hy: ~ = 50 centimeters per second versus H,:  ~ 50 centimeters per second with 
o = 2.5. If we suppose that the mean rate is really 50.5 centimeters per second, this is not a se- 
rious departure from Ho: ~ = 50 centimeters per second in the sense that if the mean really is 
50.5 centimeters per second there is no practical observable effect on the performance of the 
air crew escape system. In other words, concluding that . = 50 centimeters per second when 
it is really 50.5 centimeters per second is an inexpensive error and has no practical signifi- 
cance. For a reasonably large sample size, a true value of jt = 50.5 will lead to a sample x that 
is close to 50.5 centimeters per second, and we would not want this value of x from the sam- 
ple to result in rejection of Ho. The following display shows the P-value for testing Ho: w = 50 
when we observe x = 50.5 centimeters per second and the power of the test at a = 0.05 when 
the true mean is 50.5 for various sample sizes n: 


Sample Size P-value Power (at a = 0.05) 
n When x = 50.5 When True p = 50.5 

10 0.527 0.097 

25 0.317 0.170 

50 0.157 0.293 

100 0.046 0.516 

400 6.3 X 10° 0.979 

1000 2.5 x 107!° 1.000 


The P-value column in this display indicates that for large sample sizes, the observed 
sample value of x = 50.5 would strongly suggest that Hy: ~ = 50 should be rejected, even 
though the observed sample results imply that from a practical viewpoint the true mean does 
not differ much at all from the hypothesized value fry = 50. The power column indicates that 
if we test a hypothesis at a fixed significance level a, and even if there is little practical differ- 
ence between the true mean and the hypothesized value, a large sample size will almost 
always lead to rejection of Hy. The moral of this demonstration is clear: 


Be careful when interpreting the results from hypothesis testing when the sample size 
is large, because any small departure from the hypothesized value 1p will probably be 
detected, even when the difference is of little or no practical significance. 


EXERCISES FOR SECTION 9-1 


9-1. Ineach of the following situations, state whether it is a (c) Ho: x = 50, Hi: x # 50 
correctly stated hypothesis testing problem and why. (d) Ap: p = 0.1, Hy: p = 0.5 
(@) Ho: w = 25, Hy: pw # 25 (e) Ho: s = 30, Ay: s > 30 
(b) Hy: o > 10, Hy: 0 = 10 
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9-2. A semiconductor manufacturer collects data from a 
new tool and conducts a hypothesis test with the null hypothe- 
sis that a critical dimension mean width equals 100 nm. The 
conclusion is to not reject the null hypothesis. Does this result 
provide strong evidence that the critical dimension mean 
equals 100 nm? Explain. 

9-3. The standard deviation of critical dimension thickness 

in semiconductor manufacturing is o = 20 nm. 

(a) State the null and alternative hypotheses used to demon- 
strate that the standard deviation is reduced. 

(b) Assume that the previous test does not reject the null 
hypothesis. Does this result provide strong evidence that 
the standard deviation has not been reduced? Explain. 

9-4. The mean pull-off force of a connector depends on cure 

time. 

(a) State the null and alternative hypotheses used to demon- 
strate that the pull-off force is below 25 newtons. 

(b) Assume that the previous test does not reject the null hypoth- 
esis. Does this result provide strong evidence that the pull-off 
force is greater than or equal to 25 newtons? Explain. 


9-5. A textile fiber manufacturer is investigating a new drap- 

ery yarn, which the company claims has a mean thread elonga- 

tion of 12 kilograms with a standard deviation of 0.5 kilo- 

grams. The company wishes to test the hypothesis Ho: w = 12 

against H,: ~ < 12, using arandom sample of four specimens. 

(a) What is the type I error probability if the critical region is 
defined as x < 11.5 kilograms? 

(b) Find 8 for the case where the true mean elongation is 
11.25 kilograms. 

(c) Find B for the case where the true mean is 11.5 kilograms. 


9-6. Repeat Exercise 9-5 using a sample size of n = 16 and 
the same critical region. 


9-7. In Exercise 9-5, find the boundary of the critical region 

if the type I error probability is 

(a) a = 0.01 andn = 4 (c) a = 0.01 andn = 16 

(b) a = 0.05 andn = 4 (d) a = 0.05 and n = 16 

9-8. In Exercise 9-5, calculate the probability of a type II 

error if the true mean elongation is 11.5 kilograms and 

(a) a = 0.05 andn = 4 

(b) a = 0.05 and n = 16 

(c) Compare the values of B calculated in the previous parts. 
What conclusion can you draw? 

9-9. In Exercise 9-5, calculate the P-value if the observed 

statistic is 

(a) x= 11.25 (b) x=11.0 (c) x= 11.75 

9-10. The heat evolved in calories per gram of a cement 

mixture is approximately normally distributed. The mean is 

thought to be 100 and the standard deviation is 2. We wish to 

test Hp: » = 100 versus H,: 1 # 100 with a sample of n = 9 

specimens. 

(a) If the acceptance region is defined as 98.5 = x = 101.5, 
find the type I error probability a. 

(b) Find 8 for the case where the true mean heat evolved is 103. 


(c) Find B for the case where the true mean heat evolved is 
105. This value of 8 is smaller than the one found in part 
(b) above. Why? 

9-11. Repeat Exercise 9-10 using a sample size of n = 5 

and the same acceptance region. 

9-12. In Exercise 9-10, find the boundary of the critical 

region if the type I error probability is 

(a) a = 0.01 andn = 9 (c) a = 0.01 andn = 5 

(b) a = 0.05 and n = 9 (d) a = 0.05 andn = 5 

9-13. In Exercise 9-10, calculate the probability of a type II 

error if the true mean heat evolved is 103 and 

(a) a = 0.05 andn = 9 

(b) a = 0.05 andn = 5 

(c) Compare the values of B calculated in the previous parts. 
What conclusion can you draw? 


9-14. In Exercise 9-10, calculate the P-value if the observed 
statistic is 

(a) x= 98 (b) x= 101 (c) x= 102 

9-15. A consumer products company is formulating a new 


shampoo and is interested in foam height (in millimeters). 

Foam height is approximately normally distributed and has a 

standard deviation of 20 millimeters. The company wishes to 

test Hy: » = 175 millimeters versus H,: ~ > 175 millime- 

ters, using the results of n = 10 samples. 

(a) Find the type I error probability a if the critical region is 
x > 185. 

(b) What is the probability of type II error if the true mean 
foam height is 185 millimeters? 

(c) Find 8 for the true mean of 195 millimeters. 

9-16. Repeat Exercise 9-15 assuming that the sample size is 

n = 16 and the boundary of the critical region is the same. 

9-17. In Exercise 9-15, find the boundary of the critical 

region if the type I error probability is 

(a) a = 0.01 andn = 10 (c) a = 0.01 andn = 16 

(b) a = 0.05 and n = 10 (d) a = 0.05 and n = 16 

9-18. In Exercise 9-15, calculate the probability of a type II 

error if the true mean foam height is 185 millimeters and 

(a) a = 0.05 andn = 10 

(b) a = 0.05 andn = 16 

(c) Compare the values of B calculated in the previous parts. 
What conclusion can you draw? 


9-19. InExercise 9-15, calculate the P-value if the observed 
statistic is 

(a) x = 180 (b) x =190 (c) x =170 

9-20. A manufacturer is interested in the output voltage of a 


power supply used in a PC. Output voltage is assumed to be 

normally distributed, with standard deviation 0.25 volt, and 

the manufacturer wishes to test Hj: = 5 volts against 

Hi: # 5 volts, using n = 8 units. 

(a) The acceptance region is 4.85 = x = 5.15. Find the value 
of a. 

(b) Find the power of the test for detecting a true mean output 
voltage of 5.1 volts. 
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9-21. Rework Exercise 9-20 when the sample size is 16 and 
the boundaries of the acceptance region do not change. What 
impact does the change in sample size have on the results of 
parts (a) and (b)? 

9-22. In Exercise 9-20, find the boundary of the critical re- 
gion if the type I error probability is 

(a) a = 0.01 andn = 8 (c) a = 0.01 andn = 16 

(b) a = 0.05 andn = 8 (d) a = 0.05 and n = 16 


9-23. InExercise 9-20, calculate the P-value if the observed 
statistic is 

(a) x=5.2 (b) x=4.7 (c) x=5.1 

9-24. In Exercise 9-20, calculate the probability of a type II 


error if the true mean output is 5.05 volts and 

(a) a = 0.05 andn = 10 

(b) a = 0.05 and n = 16 

(c) Compare the values of B calculated in the previous parts. 
What conclusion can you draw? 


9-25. The proportion of adults living in Tempe, Arizona, 

who are college graduates is estimated to be p = 0.4. To test 

this hypothesis, a random sample of 15 Tempe adults is 

selected. If the number of college graduates is between 4 and 8, 

the hypothesis will be accepted; otherwise, we will conclude 

that p # 0.4. 

(a) Find the type I error probability for this procedure, assum- 
ing that p = 0.4. 

(b) Find the probability of committing a type II error if the 
true proportion is really p = 0.2. 

9-26. The proportion of residents in Phoenix favoring the 

building of toll roads to complete the freeway system is 

believed to be p = 0.3. If a random sample of 10 residents 
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shows that 1 or fewer favor this proposal, we will conclude 

that p < 0.3. 

(a) Find the probability of type I error if the true proportion is 
p = 0.3. 

(b) Find the probability of committing a type II error with this 
procedure if p = 0.2. 

(c) What is the power of this procedure if the true proportion 
is p = 0.2? 

9-27. Arandom sample of 500 registered voters in Phoenix 

is asked if they favor the use of oxygenated fuels year-round 

to reduce air pollution. If more than 400 voters respond posi- 
tively, we will conclude that more than 60% of the voters favor 
the use of these fuels. 

(a) Find the probability of type I error if exactly 60% of the 
voters favor the use of these fuels. 

(b) What is the type II error probability 8 if 75% of the voters 
favor this action? 

Hint: use the normal approximation to the binomial. 
9-28. If we plot the probability of accepting Hp:~ = po 
versus various values of w and connect the points with a 
smooth curve, we obtain the operating characteristic curve 
(or the OC curve) of the test procedure. These curves are used 
extensively in industrial applications of hypothesis testing to 
display the sensitivity and relative performance of the test. 
When the true mean is really equal to jt, the probability of 
accepting H)is 1 — a. 

(a) Construct an OC curve for Exercise 9-15, using values 
of the true mean p of 178, 181, 184, 187, 190, 193, 196, 
and 199. 

(b) Convert the OC curve into a plot of the power function of 
the test. 


9-2 TESTS ON THE MEAN OF A NORMAL DISTRIBUTION, 


VARIANCE KNOWN 


In this section, we consider hypothesis testing about the mean p of a single normal population 
where the variance of the population o? is known. We will assume that a random sample X,, 
X>,..., X, has been taken from the population. Based on our previous discussion, the sample 
mean X is an unbiased point estimator of with variance o7/n. 


9-2.1 Hypothesis Tests on the Mean 


Suppose that we wish to test the hypotheses 


Ao: & = Bo 


(9-7) 
Ay: w # po 


where jo is a specified constant. We have a random sample X, X3,..., X,, from a normal pop- 
ulation. Since X has a normal distribution (i.e., the sampling distribution of X is normal) 
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Test Statistic 


Zo 


with mean 1 and standard deviation o/Vn if the null hypothesis is true, we could calculate a 
P-value or construct a critical region based on the computed value of the sample mean X, as 
in Section 9-1.2. 

It is usually more convenient to standardize the sample mean and use a test statistic 
based on the standard normal distribution. That is, the test procedure for Hy: 1. = po uses the 
test statistic: 


Gen 


AN IE (0-8) 


If the null hypothesis Ho: ~ = wo is true, E(X) = po, and it follows that the distribution of Z, 
is the standard normal distribution [denoted N(0, 1)]. 

The hypothesis testing procedure is as follows. Take a random sample of size n and com- 
pute the value of the sample mean x. To test the null hypothesis using the P-value approach, 
we would find the probability of observing a value of the sample mean that is at least as ex- 
treme as x, given that the null hypothesis is true. The standard normal z-value that corresponds 
to x is found from the test statistic in Equation 9-8: 


_ X— Mo 


z= 
si ol Vn 


In terms of the standard normal cumulative distribution function (cdf ), the probability we are 
seeking is 1 — ®(|zp|). The reason that the argument of the standard normal cdf is |zo| is that the 
value of z) could be either positive or negative, depending on the observed sample mean. 
Because this is a two-tailed test, this is only one-half of the P-value. Therefore, for the two- 
sided alternative hypothesis, the P-value is 


P= [1 — ®(kzo))] (9-9) 
This is illustrated in Fig. 9-7(a) 
Now let’s consider the one-sided alternatives. Suppose that we are testing 
Ho: b= Bo 
Ay: & > Lo (9-10) 


Once again, suppose that we have a random sample of size n and that the sample mean is X. 
We compute the test statistic from Equation 9-8 and obtain z,. Because the test is an upper-tailed 
test, only values of x that are greater than 9 are consistent with the alternative hypothesis. 


Two-tailed test Upper-tailed test Lower-tailed test 


(a) 


N(O,1) N(O,1) 


P-value = 2[1 - ®(lz9])] P-value = 1 - ®(z0) P-value = ®(z9) 


20 (e) 20 0) (e} 20 


(b) (c) 


Figure 9-7 The P-value for a z-test. (a) The two-sided alternative H,:  # po. (b) The one-sided alternative Hy: p> Wo. 
(c) The one-sided alternative H,: w~ < po. 
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Therefore, the P-value would be the probability that the standard normal random variable is 
greater than the value of the test statistic z). This P-value is computed as 


P =1- ®&z) (9-11) 


This P-value is shown in Fig. 9-7(b) 
The lower-tailed test involves the hypotheses 


Ao: & = Yo 
Ay: & < Mo (9-12) 


Suppose that we have a random sample of size n and that the sample mean is x. We compute 
the test statistic from equation 9-8 and obtain zp). Because the test is a lower-tailed test, only 
values of x that are less than jt) are consistent with the alternative hypothesis. Therefore, the 
P-value would be the probability that the standard normal random variable is less than the 
value of the test statistic z). This P-value is computed as 


P = ®(Z) (9-13) 


and shown in Fig. 9-7(c) 

The reference distribution for this test is the standard normal distribution. The test is 
usually called a z-test. 

We can also use the fixed significance level approach with the z-test. All we have to do is 
determine where to place the critical regions for the two-sided and one-sided alternative hy- 
potheses. First consider the two-sided alternative in Equation 9-10. Now if Ho: w = wo is true, 
the probability is 1 — o that the test statistic Z, falls between —z,,> and z,/2, where z,/2, is the 
100c/2 percentage point of the standard normal distribution. The regions associated with z,,> 
and —z,/2 are illustrated in Fig. 9-8(a). Note that the probability is a that the test statistic Zp will 
fall in the region Zp > z,2 Or Zp < —Zq2 When Ho: KW = [Wo is true. Clearly, a sample producing 
a value of the test statistic that falls in the tails of the distribution of Z) would be unusual if Hp: 
LL = Wo is true; therefore, it is an indication that Hy is false. Thus, we should reject H, if either 


2 > Zo/2 (9-14) 
or 
2 < —Za2 (9-15) 
and we should fail to reject Ho if 
~Za/2 = 20 = Za/2 (9-16) 
Two-tailed test Upper-tailed test Lower-tailed test 


N(O,1) 


Critical region Critical region Critical region 
aN ye es N 


—Za/2 (e) Za/2 Zo (0) Za za (e) Zo 
(a) (b) (c) 


Figure 9-8 The distribution of Z) when Ho: w = po is true, with critical region for (a) the two-sided alternative H,: p # jo. 
(b) The one-sided alternative H,: 1 > po. (c) the one-sided alternative H,: w~ < po. 


Acceptance 
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Summary of 
Tests on the 
Mean, Variance 
Known 


Equations 9-14 and 9-15 define the critical region or rejection region for the test. The type I 
error probability for this test procedure is a. 

We may also develop fixed significance level testing procedures for the one-sided alter- 
natives. Consider the upper-tailed case in Equation 9-10. 

In defining the critical region for this test, we observe that a negative value of the test 
statistic Z, would never lead us to conclude that Hp: w = po is false. Therefore, we would 
place the critical region in the upper tail of the standard normal distribution and reject Hy 
if the computed value Z, is too large. Refer to Fig. 9-8(b). That is, we would reject Ho if 


Z > Zy (9-17) 
Similarly, to test the lower-tailed case in Equation 9-12, we would calculate the test sta- 


tistic Z) and reject H, if the value of Z) is too small. That is, the critical region is in the lower 
tail of the standard normal distribution as in Fig. 9-8(c), and we reject Hy if 


Z9 < —Zq (9-18) 


Testing Hypotheses on the Mean, Variance Known (Z-Tests) 


Null hypothesis: Ho: & = Bo 
Bere ee Ko 
Test statistic: LZ = 
est statistic > SIs 
Rejection Criterion 
Alternative Hypotheses P-Value for Fixed-Level Tests 
Hy: pw # wo Probability above |z,| and 2 > Zu/2 OF  < —Zap 


probability below —|zo|, 
P21 — @((\26\)1 


Hy: 2 > wo Probability above Zp, dy 2 ee, 
ee) 
Ay: w < wo Probability below Zp, hy SK, 
PCG) 


The P-values and critical regions for these situations are shown in Figs. 9-7 and 9-8. 


It is easier to understand the critical region and the test procedure, in general, when the test 
statistic is Z) rather than X. However, the same critical region can always be written in terms of 
the computed value of the sample mean x. A procedure identical to the fixed significance level 
test above is as follows: 

Reject Ho: w = wo if either x > aorx <b 


where 


a = Vo + Zq0/Vn and b = Uo — Z20/Vn 
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EXAMPLE 9-2 Propellant Burning Rate 
Air crew escape systems are powered by a solid propellant. The 
burning rate of this propellant is an important product charac- 
teristic. Specifications require that the mean burning rate must 
be 50 centimeters per second. We know that the standard devi- 
ation of burning rate is o = 2 centimeters per second. The ex- 
perimenter decides to specify a type I error probability or 
significance level of a = 0.05 and selects a random sample 
of n = 25 and obtains a sample average burning rate of 
xX = 51.3 centimeters per second. What conclusions should be 
drawn? 

We may solve this problem by following the seven-step 
procedure outlined in Section 9-1.6. This results in 


1. Parameter of interest: The parameter of interest 
is p, the mean burning rate. 


2. Null hypothesis, Hy: Ho: w~ = 50 centimeters per 
second 


3. Alternative hypothesis, H,: H,: w # 50 centime- 
ters per second 


4. Test statistic: The test statistic is 


: _*— Bo 
0 o/Vn 


5. Reject H, if: Reject H, if the P-value is less than 
0.05. To use a fixed significance level test, the 
boundaries of the critical region would be Zo 95 = 
1.96 and —Zp925 = —1.96. 


6. Computations: Since x = 51.3 ando = 2, 
_ 51.3 — 50 
%0 2/25 
7. Conclusion: Since zy = 3.25 the p-value is 
P-value = 2[1 — ®(3.25)] = 0.0012 


= 3.25 


we reject Ho: = 50 at the 0.05 level of significance. 

Practical Interpretation: We conclude that the mean burn- 

ing rate differs from 50 centimeters per second, based on a sam- 
ple of 25 measurements. In fact, there is strong evidence that the 


mean burning rate exceeds 50 centimeters per second. 


9-2.2 Type II Error and Choice of Sample Size 


In testing hypotheses, the analyst directly selects the type I error probability. However, the 
probability of type II error B depends on the choice of sample size. In this section, we will 
show how to calculate the probability of type II error 8. We will also show how to select the 
sample size to obtain a specified value of B. 


Finding the Probability of Type II Error B 
Consider the two-sided hypotheses 


Ho: & = Bo 
Ay: w # Yo 


Suppose that the null hypothesis is false and that the true value of the mean is wp = wo + 4, 
say, where 5 > 0. The test statistic Zp is 


Z X= wo _ X= (Ho + 8) | 5Vn 
° a/Va o/Vn o 
Therefore, the distribution of Z) when H, is true is 
dVn 
Zy~ v( — 1) (9-19) 


The distribution of the test statistic Z) under both the null hypothesis Hy and the alternate 
hypothesis H, is shown in Fig. 9-9. From examining this figure, we note that if H, is true, a 
type II error will be made only if —z,/7 = Zp) S Za. where Zy ~ N(8Vn/o, 1). That is, the 
probability of the type II error 6 is the probability that Zp falls between —z,/ and Zy/. given 
that H, is true. This probability is shown as the shaded portion of Fig. 9-9. Expressed mathe- 
matically, this probability is 
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Figure 9-9 The 
distribution of Zo 
under Hy and H,. 


Probability of a 
Type II Error for 
a Two-Sided Test 
on the Mean, 
Variance Known 


Sample Size for 

a Two-Sided 

Test on the Mean, 
Variance Known 


Under Hy: = Uo Under H: H#Uy 


PS (cap iz yt) oa ®(-ey2 a ue) (9-20) 


where (z) denotes the probability to the left of z in the standard normal distribution. Note 
that Equation 9-20 was obtained by evaluating the probability that Z) falls in the interval 
[—Zo/2» Za/2] When H, is true. Furthermore, note that Equation 9-20 also holds if 8 < 0, due 
to the symmetry of the normal distribution. It is also possible to derive an equation similar to 
Equation 9-20 for a one-sided alternative hypothesis. 


Sample Size Formulas 
One may easily obtain formulas that determine the appropriate sample size to obtain a partic- 
ular value of B for a given 5 and a. For the two-sided alternative hypothesis, we know from 


Equation 9-20 that 
dVn dVn 
p= 0(e2- o )-@(-aa- o ) 


or, if 5 > 0, 


oun) (9-21) 


B= (can ars 


since ®(—z./. — 8Vn/o) = 0 when 8 is positive. Let zg be the 1008 upper percentile of the 
standard normal distribution. Then, 8 = ®(—zg). From Equation 9-21, 


8Vn 


~ 2p ™ 20/2 ~ 


or 


where 6 = Bo (9-22) 
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If n is not an integer, the convention is to round the sample size up to the next integer. This 
approximation is good when ®(—z,,. — 38Vn/o) is small compared to B. For either of the 
one-sided alternative hypotheses, the sample size required to produce a specified type II error 
with probability B given 6 and a is 


Sample Size for 
a One-Sided Test 


on the Mean, eS 


Variance Known 


where 


5 = — Yo (9-23) 


EXAMPLE 9-3 Propellant Burning Rate Type II Error 


Consider the rocket propellant problem of Example 9-2. 
Suppose that the true burning rate is 49 centimeters per sec- 
ond. What is 8 for the two-sided test with a = 0.05, o = 2, 
and n = 25? 

Here 6 = 1| and z,/. = 1.96. From Equation 9-20, 


= @(—0.54) — ©(—4.46) = 0.295 

The probability is about 0.3 that this difference from 50 cen- 
timeters per second will not be detected. That is, the proba- 
bility is about 0.3 that the test will fail to reject the null hy- 
pothesis when the true burning rate is 49 centimeters per 


Suppose that the analyst wishes to design the test so that 
if the true mean burning rate differs from 50 centimeters per 
second by as much as | centimeter per second, the test will de- 
tect this (i.e., reject Hy: w = 50) with a high probability, say, 
0.90. Now, we note that o = 2,6 = 51 — 50 =1, a = 0.05, and 
B = 0.10. Since zy. = Zo.925 = 1.96 and zg = 2.19 = 1.28, 
the sample size required to detect this departure from Ho: 1 = 50 
is found by Equation 9-22 as 


(Za + Zp)? 0? (1.96 + 1.28)72? 
n= 3 = ae ~ 42 


The approximation is good here, since ®(—z,. — 8Vn/o) 
= ®(—1.96 — (1)V42/2) = ®(—5.20) = 0, which is small 
relative to B. 


second. 


Practical Interpretation: A sample size of n = 25 results in 
reasonable, but not great power = 1 


Practical Interpretation: To achieve a much higher power 
of 0.90 you will need a considerably large sample size, n = 42 
instead of n = 25. 


p=1-0.3=0.70. 


Using Operating Characteristic Curves 

When performing sample size or type II error calculations, it is sometimes more conven- 
ient to use the operating characteristic (OC) curves in Appendix Charts Vla and VID. 
These curves plot B as calculated from Equation 9-20 against a parameter d for various 
sample sizes n. Curves are provided for both a = 0.05 and a = 0.01. The parameter d is 
defined as 


_ |e wol _ [I 


oO oO 


d (9-24) 


so one set of operating characteristic curves can be used for all problems regardless of the 
values of |1) and o. From examining the operating characteristic curves or from Equation 9-20 
and Fig. 9-9, we note that 


1. The further the true value of the mean wp is from po, the smaller the probability of 
type I error B for a given n and a. That is, we see that for a specified sample size and 
a, large differences in the mean are easier to detect than small ones. 
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2. Fora given 6 and a, the probability of type II error B decreases as n increases. That 
is, to detect a specified difference 5 in the mean, we may make the test more power- 
ful by increasing the sample size. 


EXAMPLE 9-4 Propellant Burning Rate Type II Error from OC Curve 


Consider the propellant problem in Example 9-2. Suppose that 
the analyst is concerned about the probability of type II error 
if the true mean burning rate is ~ = 51 centimeters per sec- 
ond. We may use the operating characteristic curves to find B. 
Note that 8 = 51 — 50 = 1, n = 25, o = 2, and a = 0.05. 
Then, using Equation 9-24 gives 


Ik — Bol — [8] 1 
o o 2 


d= 


and from Appendix Chart VIIa, with n = 25, we find that B = 
0.30. That is, if the true mean burning rate is w = 51 centime- 
ters per second, there is approximately a 30% chance that this 
will not be detected by the test with n = 25. 


EXAMPLE 9-5 Propellant Burning Rate Sample Size from OC Curve 


Once again, consider the propellant problem in Example 9-2. 
Suppose that the analyst would like to design the test so that if 
the true mean burning rate differs from 50 centimeters per sec- 
ond by as much as | centimeter per second, the test will detect 
this (i.e., reject Hy: p = 50) with a high probability, say, 0.90. 
This is exactly the same requirement as in Example 9-3, where 


we used Equation 9-22 to find the required sample size to be 
n = 42. The operating characteristic curves can also be used to 
find the sample size for this test. Since d = | — pol/o = 1/2, 
a = 0.05, and B = 0.10, we find from Appendix Chart VIIa that 
the required sample size is approximately n = 40. This closely 
agrees with the sample size calculated from Equation 9-22. 


In general, the operating characteristic curves involve three parameters: 8, d, and n. 
Given any two of these parameters, the value of the third can be determined. There are two 
typical applications of these curves: 


For a given n and d, find B (as illustrated in Example 9-4). This kind of problem is 
often encountered when the analyst is concerned about the sensitivity of an exper- 
iment already performed, or when sample size is restricted by economic or other 
factors. 


Use of OC 1. 
Curves 


2. Fora given B and d, find n. This was illustrated in Example 9-5. This kind of problem 
is usually encountered when the analyst has the opportunity to select the sample size 
at the outset of the experiment. 


Operating characteristic curves are given in Appendix Charts VIIc and VIld for the one- 
sided alternatives. If the alternative hypothesis is either H,: jp > 9 or Hy: pw < po, the abscissa 
scale on these charts is 


d= |e = Pol (9-25) 


oO 


Using the Computer 

Many statistics software packages will calculate sample sizes and type II error probabili- 
ties. To illustrate, here are some computations from Minitab for the propellant burning rate 
problem: 
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Minitab Computations 


Power and Sample Size 


1-Sample Z Test 

Testing mean = null (versus not = null) 
Calculating power for mean = null + difference 
Alpha = 0.05 Sigma = 2 


Sample Target Actual 
Difference Size Power Power 
1 43 0.9000 0.9064 


Power and Sample Size 


1-Sample Z Test 

Testing mean = null (versus not = null) 
Calculating power for mean = null + difference 
Alpha = 0.05 Sigma = 2 


Sample Target Actual 
Difference Size Power Power 
1 28 0.7500 0.7536 


Power and Sample Size 


1-Sample Z Test 

Testing mean = null (versus not = null) 
Calculating power for mean = null + difference 
Alpha = 0.05 Sigma = 2 


Sample 
Difference Size Power 
1 25 0.7054 


In the first part of the boxed display, we asked Minitab to work Example 9-3, that is, to find 
the sample size n that would allow detection of a difference from 1) = 50 of 1 centimeter per 
second with power of 0.9 and a = 0.05. The answer, n = 43, agrees closely with the calcu- 
lated value from Equation 9-22 in Example 9-3, which was n = 42. The difference is due to 
Minitab using a value of z, that has more than two decimal places. The second part of the com- 
puter output relaxes the power requirement to 0.75. Note that the effect is to reduce the 
required sample size to n = 28. The third part of the output is the solution to Example 9-4, 
where we wish to determine the type II error probability of (8) or the power = 1 — 8 for the 
sample size n = 25. Note that Minitab computes the power to be 0.7054, which agrees closely 
with the answer obtained from the OC curve in Example 9-4. Generally, however, the com- 
puter calculations will be more accurate than visually reading values from an OC curve. 


9-2.3 Large-Sample Test 


We have developed the test procedure for the null hypothesis Ho: 1 = po assuming that the 
population is normally distributed and that o* is known. In many if not most practical 
situations, 0 will be unknown. Furthermore, we may not be certain that the population is well 
modeled by a normal distribution. In these situations, if 1 is large (say, n > 40) the sample 
standard deviation s can be substituted for o in the test procedures with little effect. Thus, 
while we have given a test for the mean of a normal distribution with known o”, it can be easily 
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converted into a large-sample test procedure for unknown o” that is valid regardless of the 
form of the distribution of the population. This large-sample test relies on the central limit the- 
orem just as the large-sample confidence interval on wp that was presented in the previous 
chapter did. Exact treatment of the case where the population is normal, o? is unknown, and n 
is small involves use of the ¢ distribution and will be deferred until Section 9-3. 


EXERCISES FOR SECTION 9-2 


9-29. State the null and alternative hypothesis in each case. 

(a) A hypothesis test will be used to potentially provide evi- 
dence that the population mean is greater than 10. 

(b) A hypothesis test will be used to potentially provide evi- 
dence that the population mean is not equal to 7. 

(c) A hypothesis test will be used to potentially provide evi- 
dence that the population mean is less than 5. 


9-30. A hypothesis will be used to test that a population mean 
equals 7 against the alternative that the population mean does 
not equal 7 with known variance o. What are the critical values 
for the test statistic Z) for the following significance levels? 

(a) 0.01 (b) 0.05 (c) 0.10 

9-31. A hypothesis will be used to test that a population 
mean equals 10 against the alternative that the population mean 
is greater than 10 with known variance o. What is the critical 
value for the test statistic Z) for the following significance levels? 
(a) 0.01 (b) 0.05 (c) 0.10 

9-32. A hypothesis will be used to test that a population 
mean equals 5 against the alternative that the population mean 
is less than 5 with known variance o. What is the critical value 
for the test statistic Z) for the following significance levels? 
(a) 0.01 (b) 0.05 (c) 0.10 

9-33. For the hypothesis test Hp: p = 7 against H): w #7 
and variance known, calculate the P-value for each of the 
following test statistics. 

(a) Z = 2.05 (b) 2% = —1.84 (c) z = 0.4 
9-34. For the hypothesis test Hp: » = 10 against Hy: p > 10 
and variance known, calculate the P-value for each of the 
following test statistics. 

(a) Z = 2.05 (b) 2% = —1.84 (c) z = 0.4 
9-35. For the hypothesis test Ho: p = 5 against Hy: w <5 
and variance known, calculate the P-value for each of the 
following test statistics. 

(a) Z = 2.05 (b) 2% = —1.84 (c) 7% = 0.4 
9-36. Output from a software package is given below: 


One-Sample Z: 
Test of mu = 35 = vs not = 35 
The assumed standard deviation = 1.8 


Variable N StDev SEMean Z 
x 25 1.475 4 ? ? 


Mean 
35.710 


(a) Fillin the missing items. What conclusions would you draw? 
(b) Is this a one-sided or a two-sided test? 


(c) Use the normal table and the above data to construct a 
95% two-sided CI on the mean. 

(d) What would the P-value be if the alternative hypothesis is 
H,: w > 35? 


9-37. Output from a software package is given below: 


One-Sample Z: 


Test of mu = 20 vs > 20 
The assumed standard deviation = 0.75 


Variable N StDev 
x 10 =19.889 ? 


Mean SE Mean Z P 


0.237 ? ? 


(a) Fill in the missing items. What conclusions would you 
draw? 

(b) Is this a one-sided or a two-sided test? 

(c) Use the normal table and the above data to construct a 
95% two-sided CI on the mean. 

(d) What would the P-value be if the alternative hypothesis is 
Ay: p # 20? 


9-38. Output from a software package is given below: 


One-Sample Z: 


Test of mu = 14.5 vs > 14.5 

The assumed standard deviation = 1.1 

Variable N StDev SEMean Z 

x 16 1.015 ? ? ? 


Mean 
15.016 


(a) Fillin the missing items. What conclusions would you draw? 

(b) Is this a one-sided or a two-sided test? 

(c) Use the normal table and the above data to construct a 
95% lower bound on the mean. 

(d) What would the P-value be if the alternative hypothesis is 
Ay: p # 14.5? 


9-39. Output from a software package is given below: 


One-Sample Z: 


Test of mu = 99 vs > 99 

The assumed standard deviation = 2.5 

Variable N StDev SE Mean Z P 
x 12 2.365 ? 1.44 0.075 


Mean 
100.039 


(a) Fillin the missing items. What conclusions would you draw? 
(b) Is this a one-sided or a two-sided test? 
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(c) If the hypothesis had been Hy: w = 98 versus H,: p> 98, 
would you reject the null hypothesis at the 0.05 level of 
significance? Can you answer this without referring to the 
normal table? 

(d) Use the normal table and the above data to construct a 
95% lower bound on the mean. 

(ec) What would the P-value be if the alternative hypothesis is 
Ay: pw #99? 

9-40. The mean water temperature downstream from a 
power plant cooling tower discharge pipe should be no more 
than 100°F Past experience has indicated that the standard 
deviation of temperature is 2°K The water temperature is 
measured on nine randomly chosen days, and the average 
temperature is found to be 98°F. 

(a) Is there evidence that the water temperature is acceptable 
at a = 0.05? 

(b) What is the P-value for this test? 

(c) What is the probability of accepting the null hypothesis 
at a = 0.05 if the water has a true mean temperature of 
104°F? 

9-41. A manufacturer produces crankshafts for an automo- 

bile engine. The wear of the crankshaft after 100,000 miles 

(0.0001 inch) is of interest because it is likely to have an 

impact on warranty claims. A random sample of n = 15 shafts 

is tested and x = 2.78. It is known that o = 0.9 and that wear 
is normally distributed. 

(a) Test Hy: wp = 3 versus H,: p #3 using a = 0.05. 

(b) What is the power of this test if p = 3.25? 

(c) What sample size would be required to detect a true mean 
of 3.75 if we wanted the power to be at least 0.9? 

9-42. A melting point test of n = 10 samples of a binder 

used in manufacturing a rocket propellant resulted in 

x = 154.2°F. Assume that the melting point is normally dis- 

tributed with o = 1.5°F. 

(a) Test Hp: w = 155 versus Hy: w # 155 using a = 0.01. 

(b) What is the P-value for this test? 

(c) What is the B-error if the true mean is p = 150? 

(d) What value of n would be required if we want B < 0.1 
when pw = 150? Assume that a = 0.01. 

9-43. The life in hours of a battery is known to be approxi- 

mately normally distributed, with standard deviation 0 = 1.25 

hours. A random sample of 10 batteries has a mean life of 

x = 40.5 hours. 

(a) Is there evidence to support the claim that battery life 
exceeds 40 hours? Use a = 0.05. 

(b) What is the P-value for the test in part (a)? 

(c) What is the B-error for the test in part (a) if the true mean 
life is 42 hours? 

(d) What sample size would be required to ensure that 8 does 
not exceed 0.10 if the true mean life is 44 hours? 

(e) Explain how you could answer the question in part (a) 
by calculating an appropriate confidence bound on life. 

9-44. An engineer who is studying the tensile strength of a 

steel alloy intended for use in golf club shafts knows that 
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tensile strength is approximately normally distributed with 

o = 60 psi. A random sample of 12 specimens has a mean 

tensile strength of x = 3450 psi. 

(a) Test the hypothesis that mean strength is 3500 psi. Use 
a = 0.01. 

(b) What is the smallest level of significance at which you 
would be willing to reject the null hypothesis? 

(c) What is the B-error for the test in part (a) if the true mean 
is 3470? 

(d) Suppose that we wanted to reject the null hypothesis with 
probability at least 0.8 if mean strength 1 = 3500. What 
sample size should be used? 

(e) Explain how you could answer the question in part (a) with 
a two-sided confidence interval on mean tensile strength. 


9-45. Supercavitation is a propulsion technology for undersea 
vehicles that can greatly increase their speed. It occurs above ap- 
proximately 50 meters per second, when pressure drops suffi- 
ciently to allow the water to dissociate into water vapor, forming 
a gas bubble behind the vehicle. When the gas bubble completely 
encloses the vehicle, supercavitation is said to occur. Eight tests 
were conducted on a scale model of an undersea vehicle in a tow- 
ing basin with the average observed speed x = 102.2 meters per 
second. Assume that speed is normally distributed with known 
standard deviation o = 4 meters per second. 

(a) Test the hypothesis Hy: w = 100 versus A: p < 100 using 
a = 0.05. 

(b) What is the P-value for the test in part (a)? 

(c) Compute the power of the test if the true mean speed is as 
low as 95 meters per second. 

(d) What sample size would be required to detect a true mean 
speed as low as 95 meters per second if we wanted the 
power of the test to be at least 0.85? 

(e) Explain how the question in part (a) could be answered by 
constructing a one-sided confidence bound on the mean 
speed. 


9-46. A bearing used in an automotive application is sup- 

posed to have a nominal inside diameter of 1.5 inches. A ran- 

dom sample of 25 bearings is selected and the average inside 

diameter of these bearings is 1.4975 inches. Bearing diameter 

is known to be normally distributed with standard deviation 

o = 0.01 inch. 

(a) Test the hypothesis Hp: p = 1.5 versus H,: p # 1.5 using 
a = 0.01. 

(b) What is the P-value for the test in part (a)? 

(c) Compute the power of the test if the true mean diameter is 
1.495 inches. 

(d) What sample size would be required to detect a true mean 
diameter as low as 1.495 inches if we wanted the power of 
the test to be at least 0.9? 

(e) Explain how the question in part (a) could be answered by 
constructing a two-sided confidence interval on the mean 
diameter. 


9-47. Medical researchers have developed a new artificial 
heart constructed primarily of titanium and plastic. The heart 
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will last and operate almost indefinitely once it is implanted in 
the patient’s body, but the battery pack needs to be recharged 
about every four hours. A random sample of 50 battery packs 
is selected and subjected to a life test. The average life of these 
batteries is 4.05 hours. Assume that battery life is normally 
distributed with standard deviation o = 0.2 hour. 

(a) Is there evidence to support the claim that mean battery 

life exceeds 4 hours? Use a = 0.05. 
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(c) Compute the power of the test if the true mean battery life 
is 4.5 hours. 

(d) What sample size would be required to detect a true mean 
battery life of 4.5 hours if we wanted the power of the test 
to be at least 0.9? 

(e) Explain how the question in part (a) could be answered by 
constructing a one-sided confidence bound on the mean 
life. 


(b) What is the P-value for the test in part (a)? 


9-3 TESTS ON THE MEAN OF A NORMAL DISTRIBUTION, 
VARIANCE UNKNOWN 


9-3.1 Hypothesis Tests on the Mean 


Test Statistic 


We now consider the case of hypothesis testing on the mean of a population with unknown 
variance o”. The situation is analogous to Section 8-2, where we considered a confidence 
interval on the mean for the same situation. As in that section, the validity of the test proce- 
dure we will describe rests on the assumption that the population distribution is at least 
approximately normal. The important result upon which the test procedure relies is that if 
X,, X>,...,X,, is arandom sample from a normal distribution with mean wp and variance o”, the 
random variable 


_X7h 
~ S/Vn 


T 


has a ¢ distribution with n — 1 degrees of freedom. Recall that we used this result in Section 
8-2 to devise the ¢-confidence interval for .. Now consider testing the hypotheses 


Ay: & = bo 
Ay: w # Yo 


We will use the test statistic: 


(9-26) 


If the null hypothesis is true, Ty has a ¢ distribution with n — 1 degrees of freedom. When we 
know the distribution of the test statistic when Hp is true (this is often called the reference 
distribution or the null distribution), we can calculate the P-value from this distribution, or, 
if we use a fixed significance level approach, we can locate the critical region to control the type 
I error probability at the desired level. 

To test Hp: w = po against the two-sided alternative H, :,. # po, the value of the test statis- 
tic tf) in Equation 9-26 is calculated, and the P-value is found from the ¢ distribution with n — 1 
degrees of freedom. Because the test is two-tailed, the P-value is the sum of the probabilities in 
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Two-tailed test One-tailed test One-tailed test 


bo t 
i P-value = 
probability in 


both tails —_ ~ 
to to 


—to fe) to 0 


n-1 n= 


6) 
(a) (b) (ce) 
Figure 9-10 Calculating the P-value for a t-test: (a) H): w # wo: (b) Ay: w > po; (c) Ay: w < po. 


the two tails of the ¢ distribution. Refer to Fig. 9-10(a). The P-value is the probability above |t,| 
plus the probability below —|f,|. Because the r distribution is symmetric around zero, a simple way 
to write this is 

P = 2P(T,—1 > |fol) (9-27) 


A small P-value is evidence against Ho, so if P is of sufficiently small value (typically < 0.05), 


reject the null hypothesis. 
For the one-sided alternative hypotheses 


Ao: & = Bo 
Ay: & > Yo (9-28) 


we calculate the test statistic 4) from Equation 9-26 and calculate the P-value as 


P= P(T,—-1 > to) (9-29) 
For the other one-sided alternative 
Ao: & = bo 
Hy: < po (9-30) 
we calculate the P-value as 
P = P(T,-| < to) (9-31) 


Figure 9-10(b) and (c) show how these P-values are calculated. 

Statistics software packages calculate and display P-values. However, in working problems 
by hand, it is useful to be able to find the P-value for a t-test. Because the t-table in Appendix A 
Table II contains only 10 critical values for each ¢ distribution, determining the exact P-value 
from this table is usually impossible. Fortunately, it’s easy to find lower and upper bounds on the 
P-value by using this table. 

To illustrate, suppose that we are conducting an upper-tailed f-test (so Hy): > Wo) with 
14 degrees of freedom. The relevant critical values from Appendix A Table II are as fol- 


lows: 
Critical Value: 0.258 0.692 1.345 1.761 2.145 2.624 2.977 3.326 3.787 4.140 
Tail Area: 0.40 0.25 0.10 0.05 0.025 0.01 0.005 0.0025 0.001 0.0005 


After calculating the test statistic, we find that fp = 2.8. Now, fo = 2.8 is between two tabulated 
values, 2.624 and 2.977. Therefore, the P-value must be between 0.01 and 0.005. Refer to 
Fig. 9-11. These are effectively the upper and lower bounds on the P-value. 

This illustrates the procedure for an upper-tailed test. If the test is lower-tailed, just change 
the sign on the lower and upper bounds for f) and proceed as above. Remember that for a 
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t distribution 
with 14 degrees 
of freedom 


P(T 4 > 2.624) = 0.01 


P(T\4> 2.977) = 0.005 


Figure 9-11 P-value for 0 A 

ty) = 2.8; an upper-tailed >on 
test is shown to be between to= al 
0.005 and 0.01. 2.977 


(a) 


two-tailed test, the level of significance associated with a particular critical value is twice the cor- 
responding tail area in the column heading. This consideration must be taken into account when 
we compute the bound on the P-value. For example, suppose that f, = 2.8 for a two-tailed al- 
ternative based on 14 degrees of freedom. The value of the test statistic tg > 2.624 (correspond- 
ing toa = 2 X 0.01 = 0.02) and ft) < 2.977 (corresponding to « = 2 X 0.005 = 0.01), so the 
lower and upper bounds on the P-value would be 0.01 < P < 0.02 for this case. 

Some statistics software packages can help you calculate P-values. For example, Minitab 
has the capability to find cumulative probabilities from many standard probability distributions, 
including the f distribution. Simply enter the value of the test statistic f) along with the appro- 
priate number of degrees of freedom. Minitab will display the probability P(7,, < to) where v is 
the degrees of freedom for the test statistic fo. From the cumulative probability, the P-value can 
be determined. 

The single-sample t-test we have just described can also be conducted using the fixed sig- 
nificance level approach. Consider the two-sided alternative hypothesis. The null hypothesis 
would be rejected if the value of the test statistic fy falls in the critical region defined by the 
lower and upper a/2 percentage points of the t distribution with n — 1 degrees of freedom. That 
is, reject Hp if 


to > tyi2n-1 OT fg S ~be2n=1 


For the one-tailed tests, the location of the critical region is determined by the direction that the 
inequality in the alternative hypothesis “points.” So if the alternative is H,: w > jo, reject Hy if 


to > lon—1 
and if the alternative is H): w < po, reject Ho if 
to < ~toen-1 


Figure 9-12 shows the locations of these critical regions. 


a/2 a a 


ty/2, n- dl 0 to n-—1 =be: n-1 ) To 
(b) (c) 


Figure 9-12 The distribution of Tj when Hy: ~ = pois true, with critical region for (a) Hy: ~~ po, (b) Ay: pw > po, 


and (c) Hy: w < po. 
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Summary for 
the One- Testing Hypotheses on the Mean of a Normal Distribution, Variance Unknown 
Sample ¢-Test ag eee gm a mere ea ae ae 
Null hypothesis: Ho: & = Wo 
X = po 
Test statistic: DE ere 
o s/Vn 
Rejection Criterion 
Alternative Hypotheses P-Value for Fixed-Level Tests 
A: be Fx bo Probability above Ito| and to > ty/2,n—1 OF to = Slojn—1 
probability below —|to| 
Hy: 2 > wo Probability above fo i) 2 tngcil 
Ay: 2 < po Probability below ft iy Saal 
The calculations of the P-values and the locations of the critical regions for these 
situations are shown in Figs. 9-10 and 9-12, respectively. 


EXAMPLE 9-6 Golf Club Design 

The increased availability of light materials with high 
strength has revolutionized the design and manufacture of 
golf clubs, particularly drivers. Clubs with hollow heads and 
very thin faces can result in much longer tee shots, especially 
for players of modest skills. This is due partly to the “spring- 
like effect” that the thin face imparts to the ball. Firing a golf 
ball at the head of the club and measuring the ratio of the out- 
going velocity of the ball to the incoming velocity can quan- 
tify this spring-like effect. The ratio of velocities is called the 
coefficient of restitution of the club. An experiment was per- 
formed in which 15 drivers produced by a particular club 
maker were selected at random and their coefficients of resti- 
tution measured. In the experiment the golf balls were 
fired from an air cannon so that the incoming velocity and 
spin rate of the ball could be precisely controlled. It is of in- 
terest to determine if there is evidence (with a = 0.05) to sup- 
port a claim that the mean coefficient of restitution exceeds 
0.82. The observations follow: 


0.8411 0.8191 0.8182 0.8125 0.8750 
0.8580 0.8532 0.8483 0.8276 0.7983 
0.8042 0.8730 0.8282 0.8359 0.8660 


The sample mean and sample standard deviation are 
X = 0.83725 and s = 0.02456. The normal probability plot of 
the data in Fig. 9-13 supports the assumption that the coeffi- 
cient of restitution is normally distributed. Since the objective 
of the experimenter is to demonstrate that the mean coefficient 
of restitution exceeds 0.82, a one-sided alternative hypothesis 
is appropriate. 


The solution using the seven-step procedure for hypothe- 
sis testing is as follows: 


1. Parameter of interest: The parameter of interest is 
the mean coefficient of restitution, w. 


Null hypothesis: Hy: w = 0.82 


Alternative hypothesis: .H,: 1 > 0.82 We want to 
reject Hy if the mean coefficient of restitution exceeds 
0.82. 


4. Test Statistic: The test statistic is 


y 


pe X= Mo 
Oo s/n 
5. Reject Hy if: Reject H, if the P-value is less than 


0.05. 
6. Computations: Since x = 0.83725, s = 0.02456, 
Jy = 0.82, and n = 15, we have 


4, = 0:83725 = 0.82 _ 
° 0.02456/V15 


2.72 


7. Conclusions: From Appendix A Table II we find, for a 
t distribution with 14 degrees of freedom, that fy = 2.72 
falls between two values: 2.624, for which a = 0.01, 
and 2.977, for which a = 0.005. Because this is a 
one-tailed test, we know that the P-value is between 
those two values, that is, 0.005 < P < 0.01. 
Therefore, since P < 0.05, we reject Hy and conclude 
that the mean coefficient of restitution exceeds 0.82. 
To use Minitab to compute the P-value, use the Calc 
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Percentage 
ol 
fo} 


0.78 0.83 0.88 
Coefficient of restitution 


Figure 9-13. Normal probability plot of the coefficient of 
restitution data from Example 9-6. 


menu and select the probability distribution option. = 2.72) = 0.991703. The P-value is P(T,, 2.72) or P = 1 — 
Then, for the f distribution, enter 14 degrees of free- P(Ty4 = 2.72) = 1 — 0.991703 = 0.008297. 

dom and the value of the test statistic fj = 2.72 as the Practical Interpretation: There is strong evidence to con- 
input constant. Minitab returns the probability P (T,, clude that the mean coefficient of restitution exceeds 0.82. 


Minitab will conduct the one-sample f-test. The output from this software package is in the 
following display: 


Minitab Computations 
One-Sample T: COR 


Test of mu = 0.82 vs mu > 0.82 


Variable N Mean StDev SE Mean 
COR 15 0.83725 0.02456 0.00634 
Variable 95.0% Lower Bound T P 

COR 0.82608 2.72 0.008 


Notice that Minitab computes both the test statistic Ty and a 95% lower confidence bound for 
the coefficient of restitution. The reported P-value is 0.008. Because the 95% lower confidence 
bound exceeds 0.82, we would reject the hypothesis that Hp: = 0.82 and conclude that the 
alternative hypothesis H,: w > 0.82 is true. 


9-3.2 Type II Error and Choice of Sample Size 


The type IJ error probability for the t-test depends on the distribution of the test statistic in 
Equation 9-26 when the null hypothesis Ho: . = py is false. When the true value of the mean 
is ~ = Wo + 4, the distribution for Tp is called the noncentral ¢ distribution with n — | 


9-3 TESTS ON THE MEAN OF A NORMAL DISTRIBUTION, VARIANCE UNKNOWN 315 


degrees of freedom and noncentrality parameter 8Vn/o. Note that if 8 = 0, the noncentral t 
distribution reduces to the usual central ¢ distribution. Therefore, the type II error of the two- 
sided alternative (for example) would be 


B = Pi tapas = To = ly/2,n-1 | oF oO} 
= PY tpg = To ta/2n—1t 


A 


where 7% denotes the noncentral tf random variable. Finding the type II error probability B 
for the t-test involves finding the probability contained between two points of the noncentral 
t distribution. Because the noncentral f-random variable has a messy density function, this 
integration must be done numerically. 

Fortunately, this ugly task has already been done, and the results are summarized in a se- 
ries of O.C. curves in Appendix Charts VIIe, VILf, VIlg, and VII/ that plot B for the f-test 
against a parameter d for various sample sizes n. Curves are provided for two-sided alterna- 
tives on Charts VIle and VIIf. The abscissa scale factor d on these charts is defined as 


[mH — Bol _ [8] 
d= = = ay (9-32) 
For the one-sided alternative fp > po or pp < Wo, we use charts VIG and VIH with 
[IM — Bol _ || 
d= (9-33) 


We note that d depends on the unknown parameter o”. We can avoid this difficulty in 
several ways. In some cases, we may use the results of a previous experiment or prior 
information to make a rough initial estimate of o”. If we are interested in evaluating test per- 
formance after the data have been collected, we could use the sample variance s? to estimate 
o”. If there is no previous experience on which to draw in estimating o”, we then define the 
difference in the mean d that we wish to detect relative to o. For example, if we wish to 
detect a small difference in the mean, we might use a value of d = |6|/o =< 1 (for example), 
whereas if we are interested in detecting only moderately large differences in the mean, we 
might select d = |8|/o = 2 (for example). That is, it is the value of the ratio |8|/o that is 
important in determining sample size, and if it is possible to specify the relative size of the 
difference in means that we are interested in detecting, then a proper value of d can usually 
be selected. 


EXAMPLE 9-7 Golf Club Design Sample Size 
Consider the golf club testing problem from Example 9-6. If 
the mean coefficient of restitution exceeds 0.82 by as much as 
0.02, is the sample size n = 15 adequate to ensure that Hp: 
p. = 0.82 will be rejected with probability at least 0.8? 

To solve this problem, we will use the sample standard 
deviation s = 0.02456 to estimate o. Then d = |8|/o = 
0.02/0.02456 = 0.81. By referring to the operating charac- 


teristic curves in Appendix Chart VIIg (for a = 0.05) with 
d = 0.81 and n = 15, we find that B = 0.10, approximately. 
Thus, the probability of rejecting Hy: ~ = 0.82 if the true 
mean exceeds this by 0.02 is approximately 1 — B = 1 — 
0.10 = 0.90, and we conclude that a sample size of n = 15 is 
adequate to provide the desired sensitivity. 


Minitab will also perform power and sample size computations for the one-sample f-test. 
Below are several calculations based on the golf club testing problem: 
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Minitab Computations 


Power and Sample Size 


1-Sample ¢ Test 


Testing mean = null (versus > null) 


Calculating power for mean = null + difference 


Alpha = 0.05 Sigma = 0.02456 


Sample 
Difference Size Power 
0.02 15 0.9117 


Power and Sample Size 


1-Sample ¢ Test 
Testing mean = null (versus > null) 


Calculating power for mean = null + difference 


Alpha = 0.05 Sigma = 0.02456 


Sample 
Difference Size Power 
0.01 15 0.4425 


Power and Sample Size 


1-Sample ¢ Test 
Testing mean = null (versus > null) 


Calculating power for mean = null + difference 


Alpha = 0.05 Sigma = 0.02456 


Sample Target 
Difference Size Power 
0.01 39 0.8000 


Actual 
Power 
0.8029 


In the first portion of the computer output, Minitab reproduces the solution to Example 9-7, 
verifying that a sample size of n = 15 is adequate to give power of at least 0.8 if the mean co- 
efficient of restitution exceeds 0.82 by at least 0.02. In the middle section of the output, we 
used Minitab to compute the power to detect a difference between w and po = 0.82 of 
0.01. Notice that with n = 15, the power drops considerably to 0.4425. The final portion of 
the output is the sample size required for a power of at least 0.8 if the difference between 
w. and wy of interest is actually 0.01. A much larger n is required to detect this smaller 


difference. 


EXERCISES FOR SECTION 9-3 


9-48. A hypothesis will be used to test that a population 
mean equals 7 against the alternative that the population mean 
does not equal 7 with unknown variance o. What are the criti- 
cal values for the test statistic Ty for the following significance 
levels and sample sizes? 

(a) a = 0.01 andn = 20 

(b) a = 0.05 and n = 12 

(c) a = 0.10 andn = 15 


9-49. A hypothesis will be used to test that a population 
mean equals 10 against the alternative that the population 
mean is greater than 10 with known variance o. What is the 
critical value for the test statistic Z) for the following signifi- 
cance levels? 

(a) a = 0.01 andn = 20 

(b) a = 0.05 and n = 12 

(c) a = 0.10 andn = 15 
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9-50. A hypothesis will be used to test that a population 
mean equals 5 against the alternative that the population 
mean is less than 5 with known variance o. What is the criti- 
cal value for the test statistic Z) for the following signifi- 
cance levels? 

(a) a = 0.01 and n = 20 

(b) a = 0.05 and n = 12 

(c) a = 0.10 andn = 15 


9-51. For the hypothesis test Hy: w = 7 against H,: wp #7 
with variance unknown and n = 20, approximate the P-value 
for each of the following test statistics. 

(a) t = 2.05 (b) to = —1.84 (c) f = 0.4 
9-52. For the hypothesis test Hp: » = 10 against H,: p > 10 
with variance unknown and n = 15, approximate the P-value 
for each of the following test statistics. 

(a) t = 2.05 (b) to = —1.84 (c) tf = 04 
9-53. For the hypothesis test Hp: wy = 5 against Hy: w <5 
with variance unknown and n = 12, approximate the P-value 
for each of the following test statistics. 

(a) t = 2.05 (b) to = —1.84 (c) f = 0.4 


9-54. Consider the computer output below. 


ll 


One-Sample T: 


Test of mu = 91 vs > 91 


95% Lower 
Variable N Mean StDev SE Mean Bound T P 
x 20 92.379 0.717 ? ? yo? 


(a) Fill in the missing values. You may calculate bounds on 
the P-value. What conclusions would you draw? 

(b) Is this a one-sided or a two-sided test? 

(c) If the hypothesis had been Hy: w = 90 versus H,: p > 90, 
would your conclusions change? 


9-55. Consider the computer output below. 


One-Sample T: 


Test of mu = 12 vs not = 12 


Variable N 
x 10 


StDev 
12.564 ? 


SE Mean T P 
0.296 uf ? 


Mean 


(a) How many degrees of freedom are there on the f-test 
statistic? 

(b) Fill in the missing values. You may calculate bounds on 
the P-value. What conclusions would you draw? 

(c) Is this a one-sided or a two-sided test? 

(d) Construct a 95% two-sided CI on the mean. 

(e) If the hypothesis had been Ap: p = 12 versus H,: pw > 12, 
would your conclusions change? 

(f) If the hypothesis had been Hp: wp = 11.5, versus 
Hy: w # 11.5, would your conclusions change? Answer 
this question by using the CI computed in part (d). 


9-56. Consider the computer output below. 


One-Sample T: 


Test of mu = 34 vs not = 34 


Variable N Mean StDev SE Mean 
x 16 35.274 1.783 ? 


95% CI T P 
(34.324, 36.224) ? 0.012 


(a) How many degrees of freedom are there on the ¢-test statistic? 

(b) Fill in the missing quantities. 

(c) At what level of significance can the null hypothesis be 
rejected? 

(d) If the hypothesis had been Ho: w = 34 versus H,: wp > 34, 
would the P-value have been larger or smaller? 

(e) If the hypothesis had been Hp: w = 34.5 versus 
Hy: w # 34.5, would you have rejected the null hypothesis 
at the 0.05 level? 


9-57. Anarticle in Growth: A Journal Devoted to Problems 
of Normal and Abnormal Growth [Comparison of Measured 
and Estimated Fat-Free Weight, Fat, Potassium and Nitrogen 
of Growing Guinea Pigs” (Vol. 46, No. 4, 1982, pp. 306-321)] 
reported the results of a study that measured the body weight 
(in grams) for guinea pigs at birth. 


421.0 452.6 456.1 494.6 373.8 

90.5 110.7 96.4 81.7 102.4 
241.0 296.0 317.0 290.9 256.5 
447.8 687.6 705.7 879.0 88.8 
296.0 273.0 268.0 227.5 279.3 
258.5 296.0 


(a) Test the hypothesis that mean body weight is 300 grams. 
Use a = 0.05. 

(b) What is the smallest level of significance at which you 
would be willing to reject the null hypothesis? 

(c) Explain how you could answer the question in part (a) with 
a two-sided confidence interval on mean body weight. 


9-58. An article in the ASCE Journal of Energy 

Engineering (1999, Vol. 125, pp. 59-75) describes a study 

of the thermal inertia properties of autoclaved aerated con- 

crete used as a building material. Five samples of the mate- 

rial were tested in a structure, and the average interior tem- 

peratures (°C) reported were as follows: 23.01, 22.22, 

22.04, 22.62, and 22.59. 

(a) Test the hypotheses Hp: w = 22.5 versus Hy: w # 22.5, 
using a = 0.05. Find the P-value. 

(b) Check the assumption that interior temperature is nor- 
mally distributed. 

(c) Compute the power of the test if the true mean interior 
temperature is as high as 22.75. 

(d) What sample size would be required to detect a true mean 
interior temperature as high as 22.75 if we wanted the 
power of the test to be at least 0.9? 
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(e) Explain how the question in part (a) could be answered by 
constructing a two-sided confidence interval on the mean 
interior temperature. 


9-59. A 1992 article in the Journal of the American Medical 
Association (“A Critical Appraisal of 98.6 Degrees FE the Upper 
Limit of the Normal Body Temperature, and Other Legacies of 
Carl Reinhold August Wunderlich”) reported body temperature, 
gender, and heart rate for a number of subjects. The body tem- 
peratures for 25 female subjects follow: 97.8, 97.2, 97.4, 97.6, 
97.8, 97.9, 98.0, 98.0, 98.0, 98.1, 98.2, 98.3, 98.3, 98.4, 98.4, 
98.4, 98.5, 98.6, 98.6, 98.7, 98.8, 98.8, 98.9, 98.9, and 99.0. 

(a) Test the hypothesis Hy: w = 98.6 versus H): wp # 98.6, 
using a = 0.05. Find the P-value. 

(b) Check the assumption that female body temperature is 
normally distributed. 

(c) Compute the power of the test if the true mean female 
body temperature is as low as 98.0. 

(d) What sample size would be required to detect a true mean 
female body temperature as low as 98.2 if we wanted the 
power of the test to be at least 0.9? 

(e) Explain how the question in part (a) could be answered by 
constructing a two-sided confidence interval on the mean 
female body temperature. 


9-60. Cloud seeding has been studied for many decades as 

a weather modification procedure (for an interesting study of 

this subject, see the article in Technometrics, “A Bayesian 

Analysis of a Multiplicative Treatment Effect in Weather 

Modification,” Vol. 17, pp. 161-166). The rainfall in acre-feet 

from 20 clouds that were selected at random and seeded with 

silver nitrate follows: 18.0, 30.7, 19.8, 27.1, 22.3, 18.8, 31.8, 

23.4, 21.2, 27.9, 31.9, 27.1, 25.0, 24.7, 26.9, 21.8, 29.2, 34.8, 

26.7, and 31.6. 

(a) Can you support a claim that mean rainfall from seeded 
clouds exceeds 25 acre-feet? Use a = 0.01. Find the P-value. 

(b) Check that rainfall is normally distributed. 

(c) Compute the power of the test if the true mean rainfall is 
27 acre-feet. 

(d) What sample size would be required to detect a true mean 
rainfall of 27.5 acre-feet if we wanted the power of the test 
to be at least 0.9? 

(e) Explain how the question in part (a) could be answered by 
constructing a one-sided confidence bound on the mean 
diameter. 


9-61. The sodium content of twenty 300-gram boxes of 

organic cornflakes was determined. The data (in milligrams) are 

as follows: 131.15, 130.69, 130.91, 129.54, 129.64, 128.77, 

130.72, 128.33, 128.24, 129.65, 130.14, 129.29, 128.71, 129.00, 

129.39, 130.42, 129.53, 130.12, 129.78, 130.92. 

(a) Can you support a claim that mean sodium content of this 
brand of cornflakes differs from 130 milligrams? Use 
a = 0.05. Find the P-value. 

(b) Check that sodium content is normally distributed. 

(c) Compute the power of the test if the true mean sodium 
content is 130.5 milligrams. 
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(d) What sample size would be required to detect a true mean 
sodium content of 130.1 milligrams if we wanted the 
power of the test to be at least 0.75? 

(e) Explain how the question in part (a) could be answered by 
constructing a two-sided confidence interval on the mean 
sodium content. 


9-62. Consider the baseball coefficient of restitution data 

first presented in Exercise 8-92. 

(a) Do the data support the claim that the mean coefficient of 
restitution of baseballs exceeds 0.635? Use a = 0.05. Find 
the P-value. 

(b) Check the normality assumption. 

(c) Compute the power of the test if the true mean coefficient 
of restitution is as high as 0.64. 

(d) What sample size would be required to detect a true mean 
coefficient of restitution as high as 0.64 if we wanted the 
power of the test to be at least 0.75? 

(e) Explain how the question in part (a) could be answered 
with a confidence interval. 


9-63. Consider the dissolved oxygen concentration at TVA 

dams first presented in Exercise 8-94. 

(a) Test the hypothesis Hp: » = 4 versus H,: wp # 4. Use 
a = 0.01. Find the P-value. 

(b) Check the normality assumption. 

(c) Compute the power of the test if the true mean dissolved 
oxygen concentration is as low as 3. 

(d) What sample size would be required to detect a true mean 
dissolved oxygen concentration as low as 2.5 if we 
wanted the power of the test to be at least 0.9? 

(e) Explain how the question in part (a) could be answered 
with a confidence interval. 


9-64. Reconsider the data from Medicine and Science in 

Sports and Exercise described in Exercise 8-30. The sample 

size was seven and the sample mean and sample standard 

deviation were 315 watts and 16 watts, respectively. 

(a) Is there evidence that leg strength exceeds 300 watts at 
significance level 0.05? Find the P-value. 

(b) Compute the power of the test if the true strength is 
305 watts. 

(c) What sample size would be required to detect a true mean 
of 305 watts if the power of the test should be at least 0.90? 

(d) Explain how the question in part (a) could be answered 
with a confidence interval. 


9-65. Reconsider the tire testing experiment described in 

Exercise 8-27. 

(a) The engineer would like to demonstrate that the mean life 
of this new tire is in excess of 60,000 kilometers. Formu- 
late and test appropriate hypotheses, and draw conclu- 
sions using a = 0.05. 

(b) Suppose that if the mean life is as long as 61,000 kilome- 
ters, the engineer would like to detect this difference with 
probability at least 0.90. Was the sample size n = 16 used 
in part (a) adequate? 
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9-66. Reconsider the Izod impact test on PVC pipe described 
in Exercise 8-28. Suppose that you want to use the data from 
this experiment to support a claim that the mean impact strength 
exceeds the ASTM standard (one foot-pound per inch). 
Formulate and test the appropriate hypotheses using a = 0.05. 


9-67. Reconsider the television tube brightness experiment 
in Exercise 8-35. Suppose that the design engineer claims that 
this tube will require at least 300 microamps of current to pro- 
duce the desired brightness level. Formulate and test an appro- 
priate hypothesis to confirm this claim using a = 0.05. Find the 
P-value for this test. State any necessary assumptions about the 
underlying distribution of the data. 


9-68. Exercise 6-30 gave data on the heights of female 

engineering students at ASU. 

(a) Can you support a claim that the mean height of female 
engineering students at ASU is at least 65 inches? Use 
a = 0.05. Find the P-value. 

(b) Check the normality assumption. 

(c) Compute the power of the test if the true mean height is 
62 inches. 

(d) What sample size would be required to detect a true mean 
height of 64 inches if we wanted the power of the test to be 
at least 0.8? 


9-69. Exercise 6-33 describes testing golf balls for an over- 

all distance standard. 

(a) Can you support a claim that mean distance achieved by 
this particular golf ball exceeds 280 yards? Use a = 0.05. 
Find the P-value. 

(b) Check the normality assumption. 

(c) Compute the power of the test if the true mean distance is 
290 yards. 

(d) What sample size would be required to detect a true mean 
distance of 290 yards if we wanted the power of the test to 
be at least 0.8? 


9-70. Exercise 6-32 presented data on the concentration of 

suspended solids in lake water. 

(a) Test the hypothesis Hy: w = 55 versus Hy: wp # 55; use 
a = 0.05. Find the P-value. 

(b) Check the normality assumption. 

(c) Compute the power of the test if the true mean concentra- 
tion is as low as 50. 

(d) What sample size would be required to detect a true mean 
concentration as low as 50 if we wanted the power of the 
test to be at least 0.9? 


9-4 TESTS ON THE VARIANCE AND STANDARD DEVIATION OF 


A NORMAL DISTRIBUTION 


Sometimes hypothesis tests on the population variance or standard deviation are needed. 
When the population is modeled by a normal distribution, the tests and intervals described in 


this section are applicable. 


9-4.1 Hypothesis Tests on the Variance 


Suppose that we wish to test the hypothesis that the variance of a normal population o equals 
a specified value, say o@, or equivalently, that the standard deviation o is equal to op. Let X,, 
X>,..., X, be a random sample of n observations from this population. To test 


we will use the test statistic: 


Test Statistic 


HA: = < 


(9-34) 
A: on # oH 


x3 = ~+_,— (9-35) 
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Figure 9-14 Reference distribution for the test of Hp: o” = 0} with critical region values for (a) Hy: 07 # 0, 
(b) Hy: 0? > of, and (c) Hy: 0? < 0%. 


Tests on the 
Variance of 
a Normal 
Distribution 


If the null hypothesis Ho: 0 = 09 is true, the test statistic x defined in Equation 9-35 
follows the chi-square distribution with n — 1 degrees of freedom. This is the reference 
distribution for this test procedure. To perform a fixed significance level test, we would take a 
random sample from the population of interest, calculate x, the value of the test statistic X}, 
and the null hypothesis Hy: 0” = 0 would be rejected if 


2 2 P 2 2 
X0 > Xay2n-1 OFif XO < X7-e/2n-1 


where Xa/2n-l and xj_» 2.n-1 are the upper and lower 100a/2 percentage points of the chi- 
square distribution with n — 1 degrees of freedom, respectively. Figure 9-14(a) shows the 
critical region. 

The same test statistic is used for one-sided alternative hypotheses. For the one-sided 
hypotheses 


Hy: o* = of 
: ° (9-33) 
A: os oO 
we would reject Ho if x6 > oe whereas for the other one-sided hypotheses 
Ho: 0? = 0% 
ees (9-34) 
A: ao =< oO 


we would reject Ho if x6 < Nisasaik The one-sided critical regions are shown in Fig. 
9-14(b) and (c). 


Null hypothesis: 0. 1G 


2 
Ae n—1)S 
Test statistic: x= (e— Ds" 
2 
So 
Alternative hypothesis Rejection criteria 
pw 2 D) 2 2 2 
Hy: 0” # 09 X0 > Xa/2n—-1 OF XO S —Xav2,n-1 
eee) 2 2 2 
A: G2 a) Xo > Xan-1 


2 2 2 2 
Hy: 0 < 09 X0 S FXan—1 
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EXAMPLE 9-8 Automated Filling 


An automated filling machine is used to fill bottles with liquid 4. Test statistic: The test statistic is 
detergent. A random sample of 20 bottles results in a sample 
variance of fill volume of s” = 0.0153 (fluid ounces)”. If the '. (n — 1)s? 
variance of fill volume exceeds 0.01 (fluid ounces)’, an unac- Xo ~ ok 
ceptable proportion of bottles will be underfilled or overfilled. 
Is there evidence in the sample data to suggest that the man- 5. Reject Hy: Use a = 0.05, and reject Hp if 
ufacturer has a problem with underfilled or overfilled bottles? Xi > 6.05.19 = 30.14. 
Use a = 0.05, and assume that fill volume has a normal dis- 6. Computations: 
tribution. 
Using the seven-step procedure results in the following: > _ 19(0.0153) _ 
Xo — 001 = 29.07 
1. Parameter of Interest: The parameter of interest is 
the population variance o”. 7. Conclusions: Since Xb = 29.07 < x5.05,19 = 30.14, 
2. Null hypothesis: Ho: 0? = 0.01 we conclude that there is no strong evidence that the 
. . Z variance of fill volume exceeds 0.01 (fluid ounces)*. So 
3. Alternative hypothesis: H,: 0~ > 0.01 


there is no strong evidence of a problem with incor- 
rectly filled bottles. 


We can also use the P-value approach. Using Appendix Table III, it is easy to place bounds on 
the P-value of a chi-square test. From inspection of the table, we find that X6.10,19 = 27.20 and 
X6.05,19 = 30.14. Since 27.20 < 29.07 < 30.14, we conclude that the P-value for the test in 
Example 9-8 is in the interval 0.05 < P-value < 0.10. The actual P-value can be computed 
from Minitab. For 19 degrees of freedom, Minitab calculates the cumulative chi-square proba- 
bility that is less than or equal to the value of the test statistic as 0.935108 (use the cumulative 
distribution function in the Calc menu). This is the probability to the left of (or below) 29.07, 
and the P-value is the probability above or beyond 29.07, or P = 1 — 0.935108 = 0.064892. 

The P-value for a lower-tail test would be found as the area (probability) in the lower tail 
of the chi-square distribution to the left of (or below) the computed value of the test statistic 
x3. For the two-sided alternative, find the tail area associated with the computed value of the 
test statistic and double it to obtain the P-value. 

Minitab will perform the test on a variance of a normal distribution described in this sec- 
tion. The output for Example 9-8 is as follows: 


Test and CI for One Variance 


Method 
Null hypothesis Sigma-squared = 0.01 
Alternative hypothesis Sigma-squared > 0.01 
Statistics 
N StDev Variance 
20 0.124 0.0153 
95% One-Sided Confidence Intervals 

Lower Bound Lower Bound 
Method for StDev for Variance 
Standard 0.098 0.0096 
Tests 
Method Chi-Square DF  P-Value 
Standard 29.07 19 0.065 


The standard method that is referred to is the method described in this section. Minitab 
also has an adjusted method that can be employed with continuous nonnormal distributions. 
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9-4.2 Type II Error and Choice of Sample Size 


Operating characteristic curves for the chi-square tests in Section 9-4.1 are provided in 
Appendix Charts VIi through VIn for a = 0.05 and a = 0.01. For the two-sided alternative 
hypothesis of Equation 9-34, Charts VIIi and VIIj plot B against an abscissa parameter 


A =— (9-38) 


for various sample sizes n, where o denotes the true value of the standard deviation. Charts 
VIk and VI/ are for the one-sided alternative H,: 0* > 0, while Charts VIIm and VIIn are for 
the other one-sided alternative H,: 0? < 09. In using these charts, we think of o as the value 
of the standard deviation that we want to detect. 

These curves can be used to evaluate the B-error (or power) associated with a particu- 
lar test. Alternatively, they can be used to design a test—that is, to determine what sample 
size is necessary to detect a particular value of o that differs from the hypothesized 


value Oo. 


EXAMPLE 9-9 Automated Filling Sample Size 
Consider the bottle-filling problem from Example 9-8. If the 
variance of the filling process exceeds 0.01 (fluid ounces)’, too 
many bottles will be underfilled. Thus, the hypothesized value 
of the standard deviation is oy = 0.10. Suppose that if the true 
standard deviation of the filling process exceeds this value by 
25%, we would like to detect this with probability at least 0.8. 
Is the sample size of n = 20 adequate? 
To solve this problem, note that we require 


12 
as = = 168 
00 0.10 


EXERCISES FOR SECTION 9-4 


This is the abscissa parameter for Chart VIIk. From this chart, 
with n = 20 and \ = 1.25, we find that B = 0.6. Therefore, 
there is only about a 40% chance that the null hypothesis will 
be rejected if the true standard deviation is really as large as o 
= 0.125 fluid ounce. 

To reduce the B-error, a larger sample size must be used. 
From the operating characteristic curve with 8 = 0.20 and A 
= 1.25, we find that n = 75, approximately. Thus, if we want 
the test to perform as required above, the sample size must be 
at least 75 bottles. 


9-71. Consider the test of Hp: 0? = 7 against H,: 0? # 7. 
What are the critical values for the test statistic Xj for the 
following significance levels and sample sizes? 

(a) a = 0.01 andn = 20 

(b) a = 0.05 and n = 12 

(c) a = 0.10 andn = 15 

9-72. Consider the test of Hp: o? = 10 against H;: 07 > 10. 
What are the critical values for the test statistic X} for the 
following significance levels and sample sizes? 

(a) a = 0.01 and n = 20 

(b) a = 0.05 and n = 12 

(c) a = 0.10 andn = 15 


9-73. Consider the test of Hp: o? = 5 against Hy: 0? < 5. 
What are the critical values for the test statistic X} for the 
following significance levels and sample sizes? 

(a) a = 0.01 andn = 20 

(b) a = 0.05 and n = 12 

(c) a = 0.10 andn = 15 


9-74. Consider the hypothesis test of Ho: 0” = 7 against 
H,: o* # 7. Approximate the P-value for each of the following 
test statistics. 

(a) x4 = 25.2andn = 20 (b) x6 = 15.2andn = 12 

(c) x6 = 23.0 andn = 15 


9-75. Consider the test of Ho: 0? = 5 against H,: 0? <5. 
Approximate the P-value for each of the following test statistics. 
(a) x4 = 25.2 andn = 20 

(b) x = 15.2andn = 12 

(c) x = 4.2 andn = 15 

9-76. Consider the hypothesis test of Ho: 07 = 10 against 

H,: 0? > 10. Approximate the P-value for each of the follow- 

ing test statistics. 

(a) x% = 25.2 andn = 20 

(b) x = 15.2andn = 12 

(c) x6 = 4.2 andn = 15 

9-77. The data from Medicine and Science in Sports and 

Exercise described in Exercise 8-48 considered ice hockey 

player performance after electrostimulation training. In 

summary, there were 17 players and the sample standard devi- 
ation of performance was 0.09 seconds. 

(a) Is there strong evidence to conclude that the standard devi- 
ation of performance time exceeds the historical value of 
0.75 seconds? Use a = 0.05. Find the P-value for this test. 

(b) Discuss how part (a) could be answered by constructing a 
95% one-sided confidence interval for o. 


9-78. The data from Technometrics described in Exercise 

8-51 considered the variability in repeated measurements of 

the weight of a sheet of paper. In summary, the sample stan- 

dard deviation from 15 measurements was 0.0083 grams. 

(a) Does the measurement standard deviation differ from 0.01 
grams at w = 0.05? Find the P-value for this test. 

(b) Discuss how part (a) could be answered by constructing a 
confidence interval for o. 


9-79. Reconsider the percentage of titanium in an alloy used 
in aerospace castings from Exercise 8-47. Recall that s = 0.37 
andn = 51. 

(a) Test the hypothesis Hy: o = 0.25 versus H;: o # 0.25 
using a = 0.05. State any necessary assumptions about 
the underlying distribution of the data. Find the P-value. 

(b) Explain how you could answer the question in part (a) by 
constructing a 95% two-sided confidence interval for o. 


9-80. Data from an Izod impact test was described in 

Exercise 8-28. The sample standard deviation was 0.25 and 

n = 20 specimens were tested. 

(a) Test the hypothesis that o = 0.10 against an alternative 
specifying that o # 0.10, using a = 0.01, and draw a 
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conclusion. State any necessary assumptions about the 
underlying distribution of the data. 

(b) What is the P-value for this test? 

(c) Could the question in part (a) have been answered by 
constructing a 99% two-sided confidence interval for 0”? 


9-81. Data for tire life was described in Exercise 8-27. The 

sample standard deviation was 3645.94 kilometers and n = 16. 

(a) Can you conclude, using a = 0.05, that the standard devia- 
tion of tire life is less than 4000 kilometers? State any nec- 
essary assumptions about the underlying distribution of the 
data. Find the P-value for this test. 

(b) Explain how you could answer the question in part (a) by 
constructing a 95% one-sided confidence interval for o. 


9-82. If the standard deviation of hole diameter exceeds 

0.01 millimeters, there is an unacceptably high probability that 

the rivet will not fit. Suppose that n = 15 and s = 0.008 mil- 

limeter. 

(a) Is there strong evidence to indicate that the standard devi- 
ation of hole diameter exceeds 0.01 millimeter? Use a = 
0.01. State any necessary assumptions about the underly- 
ing distribution of the data. Find the P-value for this test. 

(b) Suppose that the actual standard deviation of hole diame- 
ter exceeds the hypothesized value by 50%. What is the 
probability that this difference will be detected by the test 
described in part (a)? 

(c) If ois really as large as 0.0125 millimeters, what sample size 
will be required to detect this with power of at least 0.8? 


9-83. Recall the sugar content of the syrup in canned 

peaches from Exercise 8-46. Suppose that the variance is 

thought to be o? = 18 (milligrams)’. Recall that a random 
sample of n = 10 cans yields a sample standard deviation of 

s = 4.8 milligrams. 

(a) Test the hypothesis Hy: o? = 18 versus H,: o* # 18 using 
a = 0.05. Find the P-value for this test. 

(b) Suppose that the actual standard deviation is twice as 
large as the hypothesized value. What is the probability 
that this difference will be detected by the test described in 
part (a)? 

(c) Suppose that the true variance is o~ = 40. How large a 
sample would be required to detect this difference with 
probability at least 0.90? 


2, 


9-5 TESTS ON A POPULATION PROPORTION 


It is often necessary to test hypotheses on a population proportion. For example, suppose that 
a random sample of size n has been taken from a large (possibly infinite) population and that 
X(<= n) observations in this sample belong to a class of interest. Then P = X/nisa point 
estimator of the proportion of the population p that belongs to this class. Note that n and p are 
the parameters of a binomial distribution. Furthermore, from Chapter 7 we know that the 
sampling distribution of Pis approximately normal with mean p and variance p(1 — p)/n, if 
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Pp is not too close to either 0 or 1 and if 7 is relatively large. Typically, to apply this approxi- 
mation we require that np and n(1 — p) be greater than or equal to 5. We will give a large- 
sample test that makes use of the normal approximation to the binomial distribution. 


9-5.1 Large-Sample Tests on a Proportion 


Test Statistic 


In many engineering problems, we are concerned with a random variable that follows the 
binomial distribution. For example, consider a production process that manufactures items 
that are classified as either acceptable or defective. It is usually reasonable to model the 
occurrence of defectives with the binomial distribution, where the binomial parameter p rep- 
resents the proportion of defective items produced. Consequently, many engineering decision 
problems involve hypothesis testing about p. 

We will consider testing 


Ay: P = Po (9-39) 
Ay: p F po 


An approximate test based on the normal approximation to the binomial will be given. As 
noted above, this approximate procedure will be valid as long as p is not extremely close to 
zero or one, and if the sample size is relatively large. Let X be the number of observations in 
a random sample of size n that belongs to the class associated with p. Then, if the null 
hypothesis Hp: p = po is true, we have X ~ N[npo, npo(1 — Po)], approximately. To test 
Ho: p = Po, calculate the test statistic 


~ Vapo(l = po) (9-40) 


and determine the P-value. Because the test statistic follows a standard normal distribution if 
Hi is true, the P-value is calculated exactly like the P-value for the z-tests in Section 9-2. So 
for the two-sided alternative hypothesis, the P-value is the sum of the probability in the stan- 
dard normal distribution above |zo| and the probability below the negative value —|zo|, or 


P= 2{1 — (zo))] 


For the one-sided alternative hypothesis Ho: p > po, the P-value is the probability above Zo, or 
P=1- P(zZ) 


and for the one-sided alternative hypothesis Ho: p < po, the P-value is the probability below Zp, 
or 


P = ®(Z) 


We can also perform a fixed-significance-level test. For the two-sided alternative hypothesis, 
we would reject Hp: p # po if 


Zo > Zu2 OF ZS —Zqs2 


Critical regions for the one-sided alternative hypotheses would be constructed in the usual 
manner. 
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Summary of 
Approximate 


Testing Hypotheses on a Binomial Proportion 


Tests ona 
Binomial 
Proportion 


Null hypotheses: 


Alternative Hypotheses 


Hy: Pp = Po 


X = mpo 


Test statistic: Ze 
npo(1 — Po) 


Rejection Criterion 


P-Value for Fixed-Level Tests 


Ay: p # po 


A: p > Po 


A: p < Po 


Probability above |zo| and 
probability below —|zp| 


29 > Zaz OF Zp < —Zoyp 


P = 21 — ®(z) 


Probability above Zo, Zo > Zo 
P= i= P(z) 
Probability below Zp, Hy S = Gan 
P= P(z) 


EXAMPLE 9-10 Automobile Engine Controller 
A semiconductor manufacturer produces controllers used in 
automobile engine applications. The customer requires that the 
process fallout or fraction defective at a critical manufacturing 
step not exceed 0.05 and that the manufacturer demonstrate 
process capability at this level of quality using a = 0.05. The 
semiconductor manufacturer takes a random sample of 200 
devices and finds that four of them are defective. Can the man- 
ufacturer demonstrate process capability for the customer? 
We may solve this problem using the seven-step hypoth- 
esis-testing procedure as follows: 


1. Parameter of Interest: The parameter of interest is 
the process fraction defective p. 


Null hypothesis: Hy: p = 0.05 
Alternative hypothesis: H,: p < 0.05 
This formulation of the problem will allow the man- 


ufacturer to make a strong claim about process capa 
bility if the null hypothesis Hp: p = 0.05 is rejected. 


4. The test statistic is (from Equation 9-40) 
xX — Npo 
2 = == 
Vinpo(l — Po) 
where x = 4, n = 200, and py = 0.05. 


5. Reject Hy if: Reject Hp: p = 0.05 if the p-value is 
less than 0.05. 


6. Computations: The test statistic is 


4 = 200(0.05) _ 1a 
“0 ~ \/200(0.05)(0.95) 
7. Conclusions: Since zy = —1.95, the P-value is 


@(— 1.95) = 0.0256, so we reject Hy and conclude 
that the process fraction defective p is less than 0.05. 


Practical Interpretation: We conclude that the process is 
capable. 


Another form of the test statistic Z) in Equation 9-40 is occasionally encountered. Note 
that if X is the number of observations in a random sample of size n that belongs to a class of 
interest, then P = X/n is the sample proportion that belongs to that class. Now divide both 
numerator and denominator of Z) in Equation 9-40 by n, giving 


or 


2 X/n — po 
V Pol 1 = Po)/n 


(9-41) 
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This presents the test statistic in terms of the sample proportion instead of the number of items 
X in the sample that belongs to the class of interest. 

Minitab can be used to perform the test on a binomial proportion. The following Minitab 
output shows the results for Example 9-10. 


Test and CI for One Proportion 

Test of p = 0.05 vs p < 0.05 

Sample Xx N Sample p 95% Upper Bound Z-Value P-Value 
1 4 200 0.020000 0.036283 —1.95 0.026 


* Note * The normal approximation may be inaccurate for small samples. 


This output also shows a 95% one-sided upper-confidence bound on P. In Section 8-4 we 
showed how CIs on a binomial proportion are computed. This Minitab display shows the re- 
sult of using the normal approximation for tests and CIs. When the sample size is small, this 
may be inappropriate. 


Small Sample Tests on a Binomial Proportion 
Tests on a proportion when the sample size n is small are based on the binomial distribution, not 
the normal approximation to the binomial. To illustrate, suppose we wish to test Ho: p < po. Let 
X be the number of successes in the sample. The P-value for this test would be found from the 
lower tail of a binomial distribution with parameters n and po. Specifically, the P-value would be 
the probability that a binomial random variable with parameters n and py is less than or equal to X. 
P-values for the upper-tail one-sided test and the two-sided alternative are computed similarly. 
Minitab will calculate the exact P-value for a binomial test. The output below contains the 
exact P-value results for Example 9-10. 


Test of p = 0.05 vs p < 0.05 


Sample x N Sample p 95% Upper Bound Exact P-Value 
1 4 200 0.020000 0.045180 0.026 


The P-value is the same as that reported for the normal approximation, because the sample 
size is fairly large. Notice that the CI is different from the one found using the normal ap- 
proximation. 


9-5.2 Type II Error and Choice of Sample Size 


It is possible to obtain closed-form equations for the approximate B-error for the tests in 
Section 9-5.1. Suppose that p is the true value of the population proportion. The approximate 
8-error for the two-sided alternative H,: p # po is 


i o(™ — Pt Zaj2V Pol = Po) o(™ — P= 2ay2V Pol — pa (0-42) 
Vp(l — p)/n Vp(l — p)/n 
If the alternative is H,: p < po, 
_ Po ~ P~ ZeiPa— Po 
a ay — 
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whereas if the alternative is H,: p > po, 


Po — P + Za Vpo(l - pa) 


Vp(1 — p)/n oc 


These equations can be solved to find the approximate sample size n that gives a test of level 
« that has a specified B risk. The sample size equations are 


Approximate 
Sample Size for 
a Two-Sided Test 


Za2V Poll — po) + 2gV pC — p) |? 


on a Binomial 
Proportion 


a (9-45) 


for a two-sided alternative and 


Approximate 
Sample Size for ZaVpo(l — po) + zp /p(1 — p)]2 
a One-Sided Test n= = (9-46) 
on a Binomial p26 
Proportion 


for a one-sided alternative. 


EXAMPLE 9-11 Automobile Engine Controller Type II Error 


Consider the semiconductor manufacturer from Example 
9-10. Suppose that its process fallout is really p = 0.03. What 
is the B-error for a test of process capability that uses n = 200 
and a = 0.05? 

The B-error can be computed using Equation 9-43 as 
follows: 


0.05 — 0.03 — (1.645)V0.05(0.95)/200 
B=1-©® 
V0.03(1 — 0.03)/200 
1 — (—0.44) = 0.67 


Thus, the probability is about 0.7 that the semiconductor 
manufacturer will fail to conclude that the process is capable 
if the true process fraction defective is p = 0.03 (3%). That is, 
the power of the test against this particular alternative is only 
about 0.3. This appears to be a large B-error (or small power), 
but the difference between p = 0.05 and p = 0.03 is fairly 
small, and the sample size n = 200 is not particularly large. 


Suppose that the semiconductor manufacturer was will- 
ing to accept a B-error as large as 0.10 if the true value of the 
process fraction defective was p = 0.03. If the manufacturer 
continues to use a = 0.05, what sample size would be re- 
quired? 

The required sample size can be computed from Equation 
9-46 as follows: 


1.645-V0.05(0.95) + 1.28V0.03(0.97) 12 
0.03 — 0.05 


a= 
= 832 


where we have used p = 0.03 in Equation 9-46. 


Conclusion: Note that n = 832 is a very large sample 
size. However, we are trying to detect a fairly small deviation 
from the null value py = 0.05. 


Minitab will also perform power and sample size calculations for the one-sample Z-test on a 
proportion. Output from Minitab for the engine controllers tested in Example 9-10 follows. 
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Power and Sample Size 


Test for One Proportion 
Testing proportion = 0.05 (versus < 0.05) 


Alpha = 0.05 

Alternative Sample 

Proportion Size Power 
3.00E-02 200 0.3287 


Power and Sample Size 


Test for One Proportion 
Testing proportion = 0.05 (versus < 0.05) 


Alpha = 0.05 

Alternative Sample Target Actual 
Proportion Size Power Power 
3.00E-02 833 0.9000 0.9001 


Power and Sample Size 


Test for One Proportion 
Testing proportion = 0.05 (versus < 0.05) 


Alpha = 0.05 

Alternative Sample Target Actual 

Proportion Size Power Power 
3.00E-02 561 0.7500 0.7503 


The first part of the output shows the power calculation based on the situation described in 
Example 9-11, where the true proportion is really 0.03. The power calculation from Minitab 
agrees with the results from Equation 9-43 in Example 9-11. The second part of the output 
computes the sample size necessary for a power of 0.9 (8 = 0.1) if p = 0.03. Again, the 
results agree closely with those obtained from Equation 9-46. The final portion of the display 
shows the sample size that would be required if p = 0.03 and the power requirement is 
relaxed to 0.75. Notice that the sample size of n = 561 is still quite large because the differ- 
ence between p = 0.05 and p = 0.03 is fairly small. 


EXERCISES FOR SECTION 9-5 


9-84. Consider the computer output below. 9-85. Consider the computer output below. 


Test and Cl for One Proportion Test and Cl for One Proportion 
Test of p = 0.4 vs p not = 0.4 Test of p = 0.6 vs p < 0.6 
xX N_ Sample p 95% CI Z-Value P-Value x N Sample p 95% Upper Bound Z-Value P-Value 
98 275 ? (0.299759, 0.412968) ? 2 287 500 ? ? ? ? 
Using the normal approximation. (a) Is this a one-sided or a two-sided test? 
(a) Is this a one-sided or a two-sided test? (b) Is this a test based on the normal approximation? Is that 
(b) Complete the missing items. appropriate? 
(c) The normal approximation was used in the problem. Was (c) Complete the missing items. 

that appropriate? (d) Suppose that the alternative hypothesis was two-sided. 


What is the P-value for this situation? 


9-6 SUMMARY TABLE OF INFERENCE PROCEDURES FOR A SINGLE SAMPLE 329 


9-86. Suppose that 1000 customers are surveyed and 850 

are satisfied or very satisfied with a corporation’s products and 

services. 

(a) Test the hypothesis Ho: p = 0.9 against H,: p # 0.9 at 
a = 0.05. Find the P-value. 

(b) Explain how the question in part (a) could be answered by 
constructing a 95% two-sided confidence interval for p. 


9-87. Suppose that 500 parts are tested in manufacturing 

and 10 are rejected. 

(a) Test the hypothesis Ho: p = 0.03 against H,: p < 0.03 at 
a = 0.05. Find the P-value. 

(b) Explain how the question in part (a) could be answered by 
constructing a 95% one-sided confidence interval for p. 


9-88. A random sample of 300 circuits generated 13 

defectives. 

(a) Use the data to test Hy: p = 0.05 versus H,: p # 0.05. Use 
a = 0.05. Find the P-value for the test. 

(b) Explain how the question in part (a) could be answered 
with a confidence interval. 

9-89. An article in the British Medical Journal 

(“Comparison of Treatment of Renal Calculi by Operative 

Surgery, Percutaneous Nephrolithotomy, and Extra-Corporeal 

Shock Wave Lithotrips,” (1986, Vol. 292, pp. 879-882)] found 

that percutaneous nephrolithotomy (PN) had a success rate in 

removing kidney stones of 289 out of 350 patients. The tradi- 

tional method was 78% effective. 

(a) Is there evidence that the success rate for PN is greater 
than the historical success rate? Find the P-value. 

(b) Explain how the question in part (a) could be answered 
with a confidence interval. 


9-90. A manufacturer of interocular lenses is qualifying a 

new grinding machine and will qualify the machine if there is 

evidence that the percentage of polished lenses that contain 

surface defects does not exceed 2%. A random sample of 250 

lenses contains six defective lenses. 

(a) Formulate and test an appropriate set of hypotheses to de- 
termine if the machine can be qualified. Use a = 0.05. 
Find the P-value. 

(b) Explain how the question in part (a) could be answered 
with a confidence interval. 

9-91. A researcher claims that at least 10% of all football 

helmets have manufacturing flaws that could potentially cause 

injury to the wearer. A sample of 200 helmets revealed that 16 

helmets contained such defects. 


(a) Does this finding support the researcher’s claim? Use 
a = 0.01. Find the P-value. 

(b) Explain how the question in part (a) could be answered 
with a confidence interval. 


9-92. An article in Fortune (September 21, 1992) claimed 
that nearly one-half of all engineers continue academic studies 
beyond the B.S. degree, ultimately receiving either an M.S. or 
a Ph.D. degree. Data from an article in Engineering Horizons 

(Spring 1990) indicated that 117 of 484 new engineering 

graduates were planning graduate study. 

(a) Are the data from Engineering Horizons consistent with 
the claim reported by Fortune? Use a = 0.05 in reaching 
your conclusions. Find the P-value for this test. 

(b) Discuss how you could have answered the question in part 
(a) by constructing a two-sided confidence interval on p. 


9-93. The advertised claim for batteries for cell phones is 
set at 48 operating hours, with proper charging procedures. A 
study of 5000 batteries is carried out and 15 stop operating 
prior to 48 hours. Do these experimental results support the 
claim that less than 0.2 percent of the company’s batteries 
will fail during the advertised time period, with proper 
charging procedures? Use a hypothesis-testing procedure 
with a = 0.01. 


9-94. Arandom sample of 500 registered voters in Phoenix 

is asked if they favor the use of oxygenated fuels year-round 

to reduce air pollution. If more than 315 voters respond posi- 

tively, we will conclude that at least 60% of the voters favor 

the use of these fuels. 

(a) Find the probability of type I error if exactly 60% of the 
voters favor the use of these fuels. 

(b) What is the type II error probability 8 if 75% of the voters 
favor this action? 


9-95. In arandom sample of 85 automobile engine crank- 

shaft bearings, 10 have a surface finish roughness that exceeds 

the specifications. Does this data present strong evidence that 

the proportion of crankshaft bearings exhibiting excess 

surface roughness exceeds 0.10? 

(a) State and test the appropriate hypotheses using a = 0.05. 

(b) If it is really the situation that p = 0.15, how likely is it 
that the test procedure in part (a) will not reject the null 
hypothesis? 

(c) If p = 0.15, how large would the sample size have to be 
for us to have a probability of correctly rejecting the null 
hypothesis of 0.9? 


9-6 SUMMARY TABLE OF INFERENCE PROCEDURES 


FOR A SINGLE SAMPLE 


The table in the end papers of this book (inside back cover) presents a summary of all the 
single-sample inference procedures from Chapters 8 and 9. The table contains the null 
hypothesis statement, the test statistic, the various alternative hypotheses and the criteria 
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for rejecting Hy, and the formulas for constructing the 100(1 — a)% two-sided confidence 
interval. It would also be helpful to refer to the roadmap table in Chapter 8 that provides 
guidance to match the problem type to the information inside the back cover. 


9-7 TESTING FOR GOODNESS OF FIT 


Goodness of Fit 
Test Statistic 


The hypothesis-testing procedures that we have discussed in previous sections are designed 
for problems in which the population or probability distribution is known and the hypotheses 
involve the parameters of the distribution. Another kind of hypothesis is often encountered: 
We do not know the underlying distribution of the population, and we wish to test the hypoth- 
esis that a particular distribution will be satisfactory as a population model. For example, we 
might wish to test the hypothesis that the population is normal. 

We have previously discussed a very useful graphical technique for this problem called 
probability plotting and illustrated how it was applied in the case of a normal distribution. In 
this section, we describe a formal goodness-of-fit test procedure based on the chi-square 
distribution. 

The test procedure requires a random sample of size n from the population whose proba- 
bility distribution is unknown. These n observations are arranged in a frequency histogram, 
having k bins or class intervals. Let O; be the observed frequency in the ith class interval. From 
the hypothesized probability distribution, we compute the expected frequency in the ith class 
interval, denoted E;. The test statistic is 


k : : 
= — (9-47) 


It can be shown that, if the population follows the hypothesized distribution, X} has, approx- 
imately, a chi-square distribution with k — p — 1 degrees of freedom, where p represents the 
number of parameters of the hypothesized distribution estimated by sample statistics. This 
approximation improves as n increases. We should reject the null hypothesis that the popula- 
tion is the hypothesized distribution if the test statistic is too large. Therefore, the P-value 
would be the probability under the chi-square distribution with k — p — 1 degrees of freedom 
above the computed value of the test statistic yj or P = P(x7_ p-1> x6). For a fixed-level test, 
we would reject the hypothesis that the distribution of the population is the hypothesized dis- 
tribution if the calculated value of the test statistic xj > foe 

One point to be noted in the application of this test procedure concerns the magnitude of 
the expected frequencies. If these expected frequencies are too small, the test statistic X§ will 
not reflect the departure of observed from expected, but only the small magnitude of the 
expected frequencies. There is no general agreement regarding the minimum value of expected 
frequencies, but values of 3, 4, and 5 are widely used as minimal. Some writers suggest that an 
expected frequency could be as small as 1 or 2, so long as most of them exceed 5. Should an 
expected frequency be too small, it can be combined with the expected frequency in an adjacent 
class interval. The corresponding observed frequencies would then also be combined, and k 
would be reduced by 1. Class intervals are not required to be of equal width. 

We now give two examples of the test procedure. 


EXAMPLE 9-12 Printed Circuit Board Defects 
Poisson Distribution 

The number of defects in printed circuit boards is hypothe- 
sized to follow a Poisson distribution. A random sample of 
n = 60 printed boards has been collected, and the following 
number of defects observed. 


Number of Observed 
Defects Frequency 
0 32 
1 15 
D} 9 
3 4 


The mean of the assumed Poisson distribution in this ex- 
ample is unknown and must be estimated from the sample 
data. The estimate of the mean number of defects per board is 
the sample average, that is, (32-0 + 15-1 + 9-2 + 4-3)/60 = 
0.75. From the Poisson distribution with parameter 0.75, we 
may compute p;, the theoretical, hypothesized probability as- 
sociated with the ith class interval. Since each class interval 
corresponds to a particular number of defects, we may find the 
p; as follows: 


e %79(0.75)° 


pi = P(X =0)=—— = 0472 
e °-75(0,75)! 

pr = P(X = 1) = = 0.354 
e 79(0.75)? 

ps = P(X = 2) = = 0.133 


pa = P(X = 3) = 1 — (p1 + pp + ps) = 0.041 


The expected frequencies are computed by multiplying the 
sample size n = 60 times the probabilities p, That is, E; = 
np;. The expected frequencies follow: 


Number of Expected 
Defects Probability Frequency 

0 0.472 28.32 

1 0.354 21.24 

oD} 0.133 7.98 

3 (or more) 0.041 2.46 


EXAMPLE 9-13 Power Supply Distribution 
Continuous Distribution 

A manufacturing engineer is testing a power supply used in a 
notebook computer and, using a = 0.05, wishes to determine 
whether output voltage is adequately described by a normal 
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Since the expected frequency in the last cell is less than 3, we 
combine the last two cells: 


Number of Observed Expected 
Defects Frequency Frequency 

0 32 28.32 

1 15 21.24 

2 (or more) 13 10.44 


The chi-square test statistic in Equation 9-47 will have k — 
p-—1=3-1-1=1 degree of freedom, because the 
mean of the Poisson distribution was estimated from the data. 

The seven-step hypothesis-testing procedure may now be 
applied, using a = 0.05, as follows: 


1. Parameter of interest: The variable of interest is 
the form of the distribution of defects in printed 
circuit boards. 

2. Null hypothesis: Hy: The form of the distribution of 
defects is Poisson. 


3. Alternative hypothesis: H,: The form of the distri- 
bution of defects is not Poisson. 


4. Test statistic: The test statistic is 
k (0; - Ej) 


a 
Xo > E; 


i=1 


5. Reject Hy if: Reject Ho if the P-value is less than 
0.05. 


6. Computations: 


, __ 32 — 28.32) es 21.24) 
mm 28.32 21.24 
(13 — 10.44) 
a 


7. Conclusions: We find from Appendix Table III that 
Xb.10,1 = 2.71 and Xb.05.1 = 3.84. Because xj = 2.94 
lies between these values, we conclude that the P-value 
is between 0.05 and 0.10. Therefore, since the P-value 
exceeds 0.05 we are unable to reject the null hypothesis 
that the distribution of defects in printed circuit boards 
is Poisson. The exact P-value computed from Minitab 
is 0.0864. 


distribution. Sample estimates of the mean and standard devia- 
tion of ¥ = 5.04V and s = 0.08 V are obtained from a random 
sample of n = 100 units. 
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A common practice in constructing the class intervals for 
the frequency distribution used in the chi-square goodness-of- 
fit test is to choose the cell boundaries so that the expected 
frequencies E; = np; are equal for all cells. To use this method, 
we want to choose the cell boundaries do, a), ... , a, for the 
k cells so that all the probabilities 


qj 


are equal. Suppose we decide to use k = 8 cells. For the stan- 
dard normal distribution, the intervals that divide the scale into 
eight equally likely segments are [0, 0.32), [0.32, 0.675), [0.675, 
1.15), [1.15, ©), and their four “mirror image” intervals on the 
other side of zero. For each interval p; = 1/8 = 0.125, so the 
expected cell frequencies are E; = np;= 100(0.125) = 12.5. 
The complete table of observed and expected frequencies is 
as follows: 


Class Observed Expected 
Interval Frequency 0; Frequency £; 

x < 4.948 12 12.5 
4.948 = x < 4.986 14 12.5 
4.986 =x <5.014 12 12.5 
5.014 = x < 5.040 13 12.5 
5.040 = x < 5.066 12 12.5 
5.066 = x < 5.094 11 12.5 
5.094 = x < 5.132 12 12.5 
5.132 =x 14 12.5 
Totals 100 100 


The boundary of the first class interval is x — 1.15s = 4.948. 
The second class interval is [x — 1.15s, x — 0.675s) and so 


EXERCISES FOR SECTION 9-7 


forth. We may apply the seven-step hypothesis-testing proce- 
dure to this problem. 


1. Parameter of Interest: The variable of interest is 
the form of the distribution of power supply voltage. 

2. Null hypothesis: H): The form of the distribution is 
normal. 


3. Alternative hypothesis: H,: The form of the distri- 
bution is nonnormal. 


4. Test statistic: The test statistic is 


5. Reject Hy if: Since two parameters in the normal 
distribution have been estimated, the chi-square sta- 
tistic above will have kK —-p —-1=8—-2-—1=5 
degrees of freedom. We will use a fixed significance 
level test with a = 0.05. Therefore, we will reject 
Hy if X$ > Xb05.5 = 11.07. 

6. Computations: 


8 (0; — Ej? 
2 l L 
, ep ane 
(12-1257 | (14-125) | _ (14 = 12.5)? 
12.5 12.5 mo 12.5 
= 0.64 


7. Conclusions: Since yj = 0.64 < xGos5 = 11.07, 
we are unable to reject Hy, and there is no strong 
evidence to indicate that output voltage is not normally 
distributed. The P-value for the chi-square statistic 
X56 = 0.64 is P = 0.9861. 


9-96. Consider the following frequency table of observa- 
tions on the random variable X. 


Values 0 1 2 S 4 
Observed Frequency 24 30 31 11 4 


(a) Based on these 100 observations, is a Poisson distribution 
with a mean of 1.2 an appropriate model? Perform a 
goodness-of-fit procedure with a = 0.05. 

(b) Calculate the P-value for this test. 


9-97. Let X denote the number of flaws observed on a 
large coil of galvanized steel. Seventy-five coils are in- 


spected and the following data were observed for the values 
of X: 


Values 1 2 3 4 5 6 7 8 


Observed 
Frequency 1 11 8 13 11 12 10 9 


(a) Does the assumption of the Poisson distribution seem 
appropriate as a probability model for these data? Use 
a = 0.01. 

(b) Calculate the P-value for this test. 


9-98. The number of calls arriving at a switchboard 
from noon to 1:00 pM. during the business days Monday 
through Friday is monitored for six weeks (1.e., 30 days). Let X 
be defined as the number of calls during that one-hour period. 
The relative frequency of calls was recorded and reported as 


Value 5 6 8 9 10 
Relative 

Frequency 0.067 0.067 0.100 0.133 0.200 
Value 11 12 13 14 15 
Relative 

Frequency 0.133 0.133 0.067 0.033 0.067 


(a) Does the assumption of a Poisson distribution seem 
appropriate as a probability model for this data? Use a = 
0.05. 

(b) Calculate the P-value for this test. 

9-99. Consider the following frequency table of observa- 

tions on the random variable X: 


Values 0 1 2 3 4 
Frequency 4 21 10 13 2 


(a) Based on these 50 observations, is a binomial distribution 
with n = 6 and p = 0.25 an appropriate model? Perform 
a goodness-of-fit procedure with a = 0.05. 

(b) Calculate the P-value for this test. 

9-100. Define X as the number of underfilled bottles from a 

filling operation in a carton of 24 bottles. Seventy-five cartons 

are inspected and the following observations on X are recorded: 


Values 0 1 2 3 
Frequency 39 23 12 1 


(a) Based on these 75 observations, is a binomial distribution 
an appropriate model? Perform a goodness-of-fit proce- 
dure with a = 0.05. 

(b) Calculate the P-value for this test. 
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9-101. The number of cars passing eastbound through the in- 

tersection of Mill and University Avenues has been tabulated by 

a group of civil engineering students. They have obtained the 

data in the adjacent table: 

(a) Does the assumption of a Poisson distribution seem ap- 
propriate as a probability model for this process? Use 
a = 0.05. 

(b) Calculate the P-value for this test. 


Vehicles Vehicles 
per Observed per Observed 
Minute Frequency Minute Frequency 
40 14 as} 102 
41 24 54 96 
42 57 55 90 
43 111 56 81 
44 194 ST 73 
45 256 58 64 
46 296 59 61 
47 378 60 59 
48 250 61 50 
49 185 62 42 
50 171 63 29) 
51 150 64 18 
52 110 65 15 


9-102. Reconsider Exercise 6-71. The data were the number 
of earthquakes per year of magnitude 7.0 and greater since 
1900. 

(a) Use computer software to summarize these data into a fre- 
quency distribution. Test the hypothesis that the number of 
earthquakes of magnitude 7.0 or greater each year follows 
a Poisson distribution at a = 0.05. 

(b) Calculate the P-value for the test. 


Many times, the n elements of a sample from a population may be classified according to two 
different criteria. It is then of interest to know whether the two methods of classification are 
statistically independent; for example, we may consider the population of graduating 
engineers, and we may wish to determine whether starting salary is independent of academic 
disciplines. Assume that the first method of classification has r levels and that the second 
method has c levels. We will let O;; be the observed frequency for level i of the first classifica- 
tion method and level j on the second classification method. The data would, in general, appear 
as shown in Table 9-2. Such a table is usually called an r X c contingency table. 
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Table 9-2. Anr X c Contingency Table 


Columns 
1 2 c 
1 Orn On O,. 


Rows 


We are interested in testing the hypothesis that the row-and-column methods of classifi- 
cation are independent. If we reject this hypothesis, we conclude there is some interaction be- 
tween the two criteria of classification. The exact test procedures are difficult to obtain, but an 
approximate test statistic is valid for large n. Let p,; be the probability that a randomly selected 
element falls in the ijth cell, given that the two classifications are independent. Then p;; = u,v;, 
where u; is the probability that a randomly selected element falls in row class i and v; is the 
probability that a randomly selected element falls in column class j. Now, assuming inde- 
pendence, the estimators of u; and v; are 


nels 
i= > 0% 
j=l 
~ 1 
i= — DO% (9-48) 


= 1 fe F 
Ej = nu; j = n 0; Oj (9-49) 
j=l i=l 
Then, for large n, the statistic 
re (Oy — Ey 
= De ee (9-50) 
i=l j=l ij 


has an approximate chi-square distribution with (r — 1)(c — 1) degrees of freedom if the null 
hypothesis is true. We should reject the null hypothesis if the value of the test statistic x is too 
large. The P-value would be calculated as the probability beyond x on the Xr IXc— 1) distri- 
bution, or P = P(XG- (c-1) > Xd). For a fixed-level test, we would reject the hypothesis of in- 


dependence if the observed value of the test statistic yj exceeded Xeee-ie-1y 


EXAMPLE 9-14 Health Insurance Plan Preference 

A company has to choose among three health insurance plans. The opinions of a random sample of 500 employees are shown 
Management wishes to know whether the preference for plans in Table 9-3. 

is independent of job classification and wants to use a = 0.05. 


Table 9-3 Observed Data for Example 9-14 
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Table 9-4 Expected Frequencies for Example 9-14 


Health Insurance Plan Health Insurance Plan 
Job Classification 1 2 3 Totals Job Classification 1 2 3 Totals 
Salaried workers 160 140 40 340 Salaried workers 136 136 68 340 
Hourly workers 40 60 60 160 Hourly workers 64 64 32 160 
Totals 200 200 100 500 200 200 100 500 


To find the expected frequencies, we must first compute 
ij= (340/500) = 0.68, % = (160/500) = 0.32, }, = 
(200/500) = 0.40, % = (200/500) = 0.40, and >; = 
(100/500) = 0.20. The expected frequencies may now be 
computed from Equation 9-49. For example, the expected 
number of salaried workers favoring health insurance plan 1 is 


Ey, = niy>, = 500(0.68)(0.40) = 136 


The expected frequencies are shown in Table 9-4. 
The seven-step hypothesis-testing procedure may now be 


applied to this problem. 
1. Parameter of Interest: The variable of interest is 
employee preference among health insurance plans. 
2. Null hypothesis: Ho: Preference is independent of 
salaried versus hourly job classification. 
3. Alternative hypothesis: H,: Preference is not inde- 
pendent of salaried versus hourly job classification. 
4. Test statistic: The test statistic is 
ro¢ (0; E;)° 
ij i 
Xo > > E 
i=17=1 ij 
5. Reject Hjif: We will use a fixed-significance level test 


with a = 0.05. Therefore, since r = 2 and c = 3, the 


degrees of freedom for chi-square are (r — 1)(c — 1) 
= (1)(2) = 2, and we would reject Hp if x5 > Xd052 
= 5.99. 


Computations: 


J 
(160 — 136)? (140 — 136)? (40 — 68) 


136 136 68 
(40 - 64 (60-64)? (60 — 32) 


64 64 32 


= 49.63 


Conclusions: Since x§ = 49.63 > xé05.2 = 5.99, 
we reject the hypothesis of independence and con- 
clude that the preference for health insurance plans 
is not independent of job classification. The P-value 
for xj = 49.63 is P = 1.671 X 107". (This value 
was computed from computer software.) Further 
analysis would be necessary to explore the nature of 
the association between these factors. It might be 
helpful to examine the table of observed minus ex- 
pected frequencies. 


Using the two-way contingency table to test independence between two variables of 
classification in a sample from a single population of interest is only one application of con- 
tingency table methods. Another common situation occurs when there are r populations of 
interest and each population is divided into the same c categories. A sample is then taken from 
the ith population, and the counts are entered in the appropriate columns of the ith row. In this 
situation we want to investigate whether or not the proportions in the c categories are the same 
for all populations. The null hypothesis in this problem states that the populations are homo- 
geneous with respect to the categories. For example, when there are only two categories, such 
as success and failure, defective and nondefective, and so on, the test for homogeneity is really 
a test of the equality of r binomial parameters. Calculation of expected frequencies, determi- 
nation of degrees of freedom, and computation of the chi-square statistic for the test for ho- 
mogeneity are identical to the test for independence. 
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9-103. A company operates four machines in three shifts 
each day. From production records, the following data on the 
number of breakdowns are collected: 


Machines 
Shift A B Cc D 
1 41 20 12 16 
2 31 11 9 14 
3 15 17 16 10 


Test the hypothesis (using a = 0.05) that breakdowns are 
independent of the shift. Find the P-value for this test. 
9-104. Patients in a hospital are classified as surgical or 
medical. A record is kept of the number of times patients re- 
quire nursing service during the night and whether or not these 
patients are on Medicare. The data are presented here: 


Patient Category 

Medicare Surgical Medical 
Yes 46 52 
No 36 43 


Test the hypothesis (using a = 0.01) that calls by surgical- 
medical patients are independent of whether the patients are 
receiving Medicare. Find the P-value for this test. 

9-105. Grades in a statistics course and an operations 
research course taken simultaneously were as follows for a 
group of students. 


Operation Research Grade 


Statistics Grade A B C Other 
A 25 6 17 13 
B 17 16 15 6 
C 18 4 18 10 
Other 10 8 11 20 


Are the grades in statistics and operations research related? 
Use a = 0.01 in reaching your conclusion. What is the 
P-value for this test? 

9-106. An experiment with artillery shells yields the fol- 
lowing data on the characteristics of lateral deflections and 
ranges. Would you conclude that deflection and range are in- 
dependent? Use a = 0.05. What is the P-value for this test? 


Lateral Deflection 
Range (yards) Left Normal Right 
0-1,999 6 14 8 
2,000-5,999 9 11 4 
6,000-11,999 8 7 6 
9-107. A study is being made of the failures of an elec- 


tronic component. There are four types of failures possible and 
two mounting positions for the device. The following data 
have been taken: 


Failure Type 
Mounting Position A B C D 
1 22) 46 18 9 
2 4 17 6 12 


Would you conclude that the type of failure is independent of the 
mounting position? Use a = 0.01. Find the P-value for this test. 
9-108. Arandom sample of students is asked their opinions on 
a proposed core curriculum change. The results are as follows. 


Opinion 
Class Favoring Opposing 
Freshman 120 80 
Sophomore 70 130 
Junior 60 70 
Senior 40 60 


Test the hypothesis that opinion on the change is independent of 
class standing. Use a = 0.05. What is the P-value for this test? 


9-109. An article in the British Medical Journal 
(“Comparison of Treatment of Renal Calculi by Operative Sur- 
gery, Percutaneous Nephrolithotomy, and Extracorporeal Shock 
Wave Lithotripsy” (1986, Vol. 292, pp. 879-882)] found that 
percutaneous nephrolithotomy (PN) had a success rate in 
removing kidney stones of 289 out of 350 (83%) patients. 
However, when the stone diameter was considered, the results 
looked different. For stones of <2 cm, 87% (234/270) of cases 
were successful. For stones of =2 cm, a success rate of 69% 
(55/80) was observed for PN. 

(a) Are the successes and size of stones independent? Use 

a = 0.05. 
(b) Find the P-value for this test. 
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9-9 NONPARAMETRIC PROCEDURES 


Most of the hypothesis-testing and confidence interval procedures discussed previously are 
based on the assumption that we are working with random samples from normal populations. 
Traditionally, we have called these procedures parametric methods because they are based on 
a particular parametric family of distributions—in this case, the normal. Alternately, sometimes 
we say that these procedures are not distribution-free because they depend on the assumption 
of normality. Fortunately, most of these procedures are relatively insensitive to moderate de- 
partures from normality. In general, the t- and F-tests and the t-confidence intervals will have ac- 
tual levels of significance or confidence levels that differ from the nominal or advertised levels 
chosen by the experimenter, although the difference between the actual and advertised levels is 
usually fairly small when the underlying population is not too different from the normal. 

In this section we describe procedures called nonparametric and distribution-free meth- 
ods, and we usually make no assumptions about the distribution of the underlying population 
other than that it is continuous. These procedures have actual level of significance « or confi- 
dence level 10011 — a) % for many different types of distributions. These procedures have 
some appeal. One of their advantages is that the data need not be quantitative but can be cate- 
gorical (such as yes or no, defective or nondefective) or rank data. Another advantage is that 
nonparametric procedures are usually very quick and easy to perform. 

The procedures described in this chapter are alternatives to the parametric ¢- and F-procedures 
described earlier. Consequently, it is important to compare the performance of both parametric and 
nonparametric methods under the assumptions of both normal and nonnormal populations. In 
general, nonparametric procedures do not utilize all the information provided by the sample. 
As a result, a nonparametric procedure will be less efficient than the corresponding paramet- 
ric procedure when the underlying population is normal. This loss of efficiency is reflected by 
a requirement of a larger sample size for the nonparametric procedure than would be required 
by the parametric procedure in order to achieve the same power. On the other hand, this loss of 
efficiency is usually not large, and often the difference in sample size is very small. When the 
underlying distributions are not close to normal, nonparametric methods may have much to of- 
fer. They often provide improvement over the normal-theory parametric methods. Generally, 
if both parametric and nonparametric methods are applicable to a particular problem, we 
should use the more efficient parametric procedure. 

Another approach that can be used is to transform the original data, say, by taking loga- 
rithms, square roots, or a reciprocal, and then analyze the transformed data using a parametric 
technique. A normal probability plot often works well to see if the transformation has been 
successful. When this approach is successful, it is usually preferable to using a nonparametric 
technique. However, sometimes transformations are not satisfactory. That is, no transforma- 
tion makes the sample observations look very close to a sample from a normal distribution. 
One situation where this happens is when the data are in the form of ranks. These situations 
frequently occur in practice. For instance, a panel of judges may be used to evaluate 10 differ- 
ent formulations of a soft-drink beverage for overall quality, with the “best” formulation as- 
signed rank 1, the “next-best” formulation assigned rank 2, and so forth. It is unlikely that rank 
data satisfy the normality assumption. Transformations may not prove satisfactory either. 
Many nonparametric methods involve the analysis of ranks and consequently are directly 
suited to this type of problem. 


9-9.1 The Sign Test 


The sign test is used to test hypotheses about the median {i of a continuous distribution. The 
median of a distribution is a value of the random variable X such that the probability is 0.5 that 
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an observed value of X is less than or equal to the median, and the probability is 0.5 that an ob- 
served value of X is greater than or equal to the median. That is, P(X = ji) = P(X = fi) = 0.5 

Since the normal distribution is symmetric, the mean of a normal distribution equals the 
median. Therefore, the sign test can be used to test hypotheses about the mean of a normal dis- 
tribution. This is the same problem for which we previously used the t-test. We will briefly dis- 
cuss the relative merits of the two procedures in Section 9-9.3 Note that, although the /-test 
was designed for samples from a normal distribution, the sign test is appropriate for samples 
from any continuous distribution. Thus, the sign test is a nonparametric procedure. 

Suppose that the hypotheses are 


Ay: h = Bo 
Ay: < po (9-51) 
The test procedure is easy to describe. Suppose that X,, X5,..., X, is arandom sample from the 


population of interest. Form the differences 
X; — fio, i=1,2,...,n (9-52) 


Now if the null hypothesis Ho: {i = jig is true, any difference X; — jin is equally likely to 
be positive or negative. An appropriate test statistic is the number of these differences that are 
positive, say, R*. Therefore, to test the null hypothesis we are really testing that the number of 
plus signs is a value of a binomial random variable that has the parameter p = 1/2. A P-value 
for the observed number of plus signs r* can be calculated directly from the binomial distri- 
bution. For instance, in testing the hypotheses in Equation 9-51, we will reject H, in favor of 
H, only if the proportion of plus signs is sufficiently less than 1/2 (or equivalently, whenever 
the observed number of plus signs 7* is too small). Thus, if the computed P-value 


1 
p= P(e =" when p = +) 


is less than or equal to some preselected significance level a, we will reject Hy and conclude 
HA, is true. 
To test the other one-sided hypotheses 


A: |b = po 
A: ph > jo (9-53) 


we will reject H, in favor of H, only if the observed number of plus signs, say, r*, is large or, 
equivalently, whenever the observed fraction of plus signs is significantly greater than 1/2. 
Thus, if the computed P-value 


1 
p= P(R* ae when p = 4) 


is less than a, we will reject Hy, and conclude that H, is true. 
The two-sided alternative may also be tested. If the hypotheses are 
Ao: & = Bo 
Hy: & # flo (9-54) 
we should reject Ho: jt = fio if the proportion of plus signs is significantly different from 
(either less than or greater than) 1/2. This is equivalent to the observed number of plus signs 7* 
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being either sufficiently large or sufficiently small. Thus, if r* < n/2, the P-value is 


1 
P= 2P( < r* when p = x) 


and if r* > n/2, the P-value is 
1 
P= 2P(R* = r* when p = x) 


If the P-value is less than some preselected level «, we will reject H, and conclude that H, is true. 


EXAMPLE 9-15 Propellant Shear Strength Sign Test 
Montgomery, Peck, and Vining (2006) report on a study in 
which a rocket motor is formed by binding an igniter propel- 
lant and a sustainer propellant together inside a metal housing. 7 ; 
The shear strength of the bond between the two propellant 2. Null hypothesis: Ho: 4. = 2000 psi 
types is an important characteristic. The results of testing 20 Alternative hypothesis: H,: (§ # 2000 psi 
randomly selected nO are shown in Table 9-5. We would Test statistic: The test statistic is the observed num- 
a test the ee that the median shear strength is ber of plus differences in Table 9-5, or r* = 14. 

Lia ve eae . : ' 5. Reject H, if: We will reject Hy if the P-value corre- 

This problem can be solved using the eight-step hypothesis- ; if . 

5 : sponding to r’ = 14 is less than or equal to 

testing procedure: 


1. Parameter of Interest: The parameter of interest is 
the median of the distribution of propellant shear 
strength. 


> 


a = 0.05. 
Table 9-5 Propellant Shear Strength Data 
Observation Shear Strength Differences 
i X; x; — 2000 Sign 
1 2158.70 +158.70 a 
2 1678.15 —321.85 - 
3 2316.00 +316.00 + 
4 2061.30 +61.30 + 
5 2207.50 +207.50 ap 
6 1708.30 —291.70 = 
i 1784.70 —215.30 = 
8 2575.10 +575.10 + 
9 2357.90 +357.90 ap 
10 2256.70 +256.70 + 
11 2165.20 +165.20 ate 
12 2399.55 +399.55 + 
13} 1779.80 —220.20 = 
14 2336.75 +336.75 + 
15 1765.30 —234.70 = 
16 2053.50 +53.50 + 
17 2414.40 +414.40 a 
18 2200.50 +200.50 te 
19 2654.20 +654.20 ap 
20 1753.70 — 246.30 = 
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6. Computations: Since r* = 14 is greater than 7. Conclusions: Since P = 0.1153 is not less than a = 0.05, 


n/2 = 20/2 = 10, we calculate the P-value from we cannot reject the null hypothesis that the median shear 
strength is 2000 psi. Another way to say this is that the ob- 
P=2pP (e > 14 when p = +) served number of plus signs r* = 14 was not large or small 
2 enough to indicate that median shear strength is different 
20/79 from 2000 psi at the a = 0.05 level of significance. 
=2> ( Jo sos * 
r=14\\T 
= 0.1153 


Normal 
Approximation 
for Sign Test 
Statistic 


It is also possible to construct a table of critical values for the sign test. This table is shown as 
Appendix Table VIII. The use of this table for the two-sided alternative hypothesis in Equation 
9-54 is simple. As before, let R* denote the number of the differences (X; — fig) that are positive 
and let R~ denote the number of these differences that are negative. Let R = min 
(R*, R’). Appendix Table VIII presents critical values r* for the sign test that ensure that P (type 
I error) = P (reject Hp when H) is true) = a for a = 0.01, a = 0.05 and a = 0.10. If the 
observed value of the test statistic r < r*, the null hypothesis Ho: {1 = [io should be rejected. 

To illustrate how this table is used, refer to the data in Table 9-5 that were used in Example 
9-15. Now r* = 14 andr” = 6; therefore, r = min (14, 6) = 6. From Appendix Table VIII 
with n = 20 and a = 0.05, we find that ré 9; = 5. Since r = 6 is not less than or equal to the 
critical value r%9; = 5, we cannot reject the null hypothesis that the median shear strength is 
2000 psi. 

We can also use Appendix Table VIII for the sign test when a one-sided alternative 
hypothesis is appropriate. If the alternative is H): J > flo, reject Hp: P = Po ifr = ré; 
if the alternative is H): [i > fig, reject Ho: fi = fio if r* < r*. The level of significance of a 
one-sided test is one-half the value for a two-sided test. Appendix Table VIII shows the one- 
sided significance levels in the column headings immediately below the two-sided levels. 

Finally, note that when a test statistic has a discrete distribution such as R does in the sign 
test, it may be impossible to choose a critical value r* that has a level of significance exactly 
equal to a. The approach used in Appendix Table VII is to choose r* to yield an a that is as 
close to the advertised significance level a as possible. 


Ties in the Sign Test 

Since the underlying population is assumed to be continuous, there is a zero probability that 
we will find a “tie” —that is, a value of X; exactly equal to ji9. However, this may sometimes 
happen in practice because of the way the data are collected. When ties occur, they should be 
set aside and the sign test applied to the remaining data. 


The Normal Approximation 

When p = 0.5, the binomial distribution is well approximated by a normal distribution when 
n is at least 10. Thus, since the mean of the binomial is np and the variance is np(1 — p), the 
distribution of R* is approximately normal with mean 0.5n and variance 0.25n whenever n is 
moderately large. Therefore, in these cases the null hypothesis Ho: ) = [to can be tested using 
the statistic 


(9-55) 
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A P-value approach could be used for decision making. The fixed significance level approach 
could also be used. 

The two-sided alternative would be rejected if the observed value of the test statistic 
| Zo| > Zay2, and the critical regions of the one-sided alternative would be chosen to reflect the 
sense of the alternative. (If the alternative is H): > io, reject Ho if z) > z,, for example.) 


Type II Error for the Sign Test 

The sign test will control the probability of type I error at an advertised level a for testing the 
null hypothesis Ho: # = p for any continuous distribution. As with any hypothesis-testing 
procedure, it is important to investigate the probability of a type IJ error, 8. The test should be 
able to effectively detect departures from the null hypothesis, and a good measure of this 
effectiveness is the value of 8 for departures that are important. A small value of B implies an 
effective test procedure. 

In determining 8, it is important to realize not only that a particular value of [1, say, fig + A, 
must be used but also that the form of the underlying distribution will affect the calculations. To 
illustrate, suppose that the underlying distribution is normal with o = 1 and we are testing the 
hypothesis Ho: f) = 2 versus H,: > 2. (Since J = p in the normal distribution, this is equiv- 
alent to testing that the mean equals 2.) Suppose that it is important to detect a departure from 
jt = 2 to p = 3. The situation is illustrated graphically in Fig. 9-15(a). When the alternative 
hypothesis is true (H: (1 = 3), the probability that the random variable X is less than or equal to the 
value 2 is 


P(X <2) = P(Z < -1) = ®(-1) = 0.1587 


Suppose we have taken a random sample of size 12. At the « = 0.05 level, Appendix Table VIII 
indicates that we would reject Hp: Wp = 2ifr” Sr§o; = 2. Therefore, B is the probability that 
we do not reject Hy: ) = 2 when in fact 1 = 3, or 


2 
B=1-)> (1?) o.rssryo.ssis)® = 0.2944 


x=0\% 


If the distribution of X had been exponential rather than normal, the situation would be as 
shown in Fig. 9-15(b), and the probability that the random variable X is less than or equal 
to the value x = 2 when pt = 3 (note that when the median of an exponential distribution 
is 3, the mean is 4.33) is 


e433" dx = 0.3699 


2 
1 
rx s2)= | 
) 4 


In this case, 


2 
B=1-> (7)(0.3699) (0.6301) = 0.8794 


x=0 \*X 


Thus, B for the sign test depends not only on the alternative value of jt but also on the area 
to the right of the value specified in the null hypothesis under the population probability 
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o=1 o=1 
0.1587 

-l1 0 1 2 3 4 5 x -1 0 1 2 3 4 5 6 x 

Under Hp: w= 2 Under H, : =3 

(a) 
0.3699 
p=2 p=2.89 x 2 = 4.33 x 
Under Hp: p=2 Under H, : 1 =3 


(b) 


Figure 9-15 Calculation of B for the sign test. (a) Normal distributions. (b) Exponential 
distributions. 


distribution. This area is highly dependent on the shape of that particular probability 
distribution. In this example, 8 is large so the ability of the test to detect this departure from 
the null hypothesis with the current sample size is poor. 


The Wilcoxon Signed-Rank Test 


The sign makes use only of the plus and minus signs of the differences between the observa- 
tions and the median jp (or the plus and minus signs of the differences between the observa- 
tions in the paired case). It does not take into account the size or magnitude of these differ- 
ences. Frank Wilcoxon devised a test procedure that uses both direction (sign) and magnitude. 
This procedure, now called the Wilcoxon signed-rank test, is discussed and illustrated in this 
section. 

The Wilcoxon signed-rank test applies to the case of symmetric continuous distribu- 
tions. Under these assumptions, the mean equals the median, and we can use this procedure to 
test the null hypothesis pp = [Up. 


The Test Procedure 

We are interested in testing Hp: w = Wo against the usual alternatives. Assume that X,, 
X,...,X,, is arandom sample from a continuous and symmetric distribution with mean (and 
median) w. Compute the differences X; — wo, i = 1, 2,...,. Rank the absolute differences 
|X; — pol, = 1,2,..., in ascending order, and then give the ranks the signs of their 
corresponding differences. Let W* be the sum of the positive ranks and W” be the absolute 
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value of the sum of the negative ranks, and let W = min(W~, W). Appendix Table IX contains 
critical values of W, say, w%. If the alternative hypothesis is H;: w # wo, then if the ob- 
served value of the statistic w = w%, the null hypothesis Hp: w = jo is rejected. Appendix 
Table IX provides significance levels of a = 0.10, a = 0.05, a = 0.02, a = 0.01 for the 
two-sided test. 

For one-sided tests, if the alternative is H): ~ > po, reject Hp: w = poifw SS w%; andif 
the alternative is H): w < po, reject Hy: ~ = wo if w~ < w*. The significance levels for one- 
sided tests provided in Appendix Table IX are a = 0.05, 0.025, 0.01, and 0.005. 


EXAMPLE 9-16 Propellant Shear Strength Wilcoxon Signed-Rank Test 


We will illustrate the Wilcoxon signed-rank test by applying it 


to the propellant shear strength data from Table 9-5. Assume = aca rat 
that the underlying distribution is a continuous symmetric dis- 20 — 246.30 —10 
tribution. The seven-step procedure is applied as follows: 10 +256.70 aril 
1. Parameter of Interest: The parameter of interest is ee = 

the mean (or median) of the distribution of propel- 3 +316.00 +13 

lant shear strength. 2 —321.85 —14 

Null hypothesis: Ho: 1. = 2000 psi 14 +336.75 +15 
Alternative hypothesis: H,: 1 # 2000 psi 9 +357.90 +16 

Test statistic: The test statistic is 12 $399.55 als) 

at ee 17 +414.40 +18 

ee dee 8 +575.10 +19 

5. Reject Hy if: We will reject Hy if w = wis = 52 19 +654.20 +20 


from Appendix Table IX. 


6. Computations: The signed ranks from Table 9-5 
are shown in the following display: 


The sum of the positive ranks is w* = (1 +2+34+4 
+54+6+4+ 114+ 134+ 154+ 16+ 174+ 18+ 19 4 


Difference 20) = 150, and the sum of the absolute values of the neg- 
Observation x; — 2000 Signed Rank ative ranks is w =(7+8+4+9+4+10+ 12+ 14) = 
16 453.50 aLj| 60. Therefore, 
hi baa — w = min(150, 60) = 60 
1 +158.70 ara) 
11 +165.20 +4 : : ; 
7. Conclusions: Since w = 60 is not less than or equal 
le aoe aS to the critical value woo; = 52, we cannot reject the 
5 +207.50 +6 null hypothesis that the mean (or median, since the 
7 —215.30 =) population is assumed to be symmetric) shear 
13 —220.20 -8 strength is 2000 psi. 
continued 


Ties in the Wilcoxon Signed-Rank Test 

Because the underlying population is continuous, ties are theoretically impossible, although 
they will sometimes occur in practice. If several observations have the same absolute magni- 
tude, they are assigned the average of the ranks that they would receive if they differed slightly 
from one another. 
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Normal 
Approximation 
for Wilcoxon 
Signed-Rank 
Statistic 


Large Sample Approximation 
If the sample size is moderately large, say, n > 20, it can be shown that W* (or W_) has 
approximately a normal distribution with mean 


n(n + 1) 
eat 
ar 
and variance 
n(n + 1)(2n + 1) 
o4 = 


24 


Therefore, a test of Hp: 4 = by can be based on the statistic: 


Wt — n(n + 1)/4 
Vn(n + 1)(2n + 1)/24 


Ze (9-56) 


An appropriate critical region for either the two-sided or one-sided alternative hypotheses can 
be chosen from a table of the standard normal distribution. 


9-9.3 Comparison to the t-Test 


If the underlying population is normal, either the sign test or the t-test could be used to test 
a hypothesis about the population median. The f-test is known to have the smallest value of 
8 possible among all tests that have significance level a for the one-sided alternative and for 
tests with symmetric critical regions for the two-sided alternative, so it is superior to the 
sign test in the normal distribution case. When the population distribution is symmetric and 
nonnormal (but with finite mean), the t-test will have a smaller B (or a higher power) than 
the sign test, unless the distribution has very heavy tails compared with the normal. Thus, 
the sign test is usually considered a test procedure for the median rather than as a serious 
competitor for the t-test. The Wilcoxon signed-rank test is preferable to the sign test and 
compares well with the t-test for symmetric distributions. It can be useful in situations 
where a transformation on the observations does not produce a distribution that is reason- 
ably close to the normal. 


EXERCISES FOR SECTION 9-9 


9-110. Ten samples were taken from a plating bath used in (a) Do the sample data indicate that this statement is correct? 
an electronics manufacturing process, and the bath pH was de- Use the sign test with a = 0.05 to investigate this hypoth- 
termined. The sample pH values are 7.91, 7.85, 6.82, 8.01, esis. Find the P-value for this test. 

7.46, 6.95, 7.05, 7.35, 7.25, and 7.42. Manufacturing engi- (b) Use the normal approximation for the sign test to test 
neering believes that pH has a median value of 7.0. Ho: j& = 7.0 versus H,: f. # 7.0. What is the P-value for 


this test? 


9-111. The titanium content in an aircraft-grade alloy is an 
important determinant of strength. A sample of 20 test 
coupons reveals the following titanium content (in percent): 


8.32, 8.05, 8.93, 8.65, 8.25, 8.46, 8.52, 8.35, 8.36, 8.41, 8.42, 
8.30, 8.71, 8.75, 8.60, 8.83, 8.50, 8.38, 8.29, 8.46 


The median titanium content should be 8.5%. 

(a) Use the sign test with a = 0.05 to investigate this hypoth- 
esis. Find the P-value for this test. 

(b) Use the normal approximation for the sign test to test 
Ho: j = 8.5 versus A: i # 8.5, with a = 0.05. What is 
the P-value for this test? 

9-112. The impurity level (in ppm) is routinely measured in 

an intermediate chemical product. The following data were 

observed in a recent test: 


2.4, 2.5, 1.7, 1.6, 1.9, 2.6, 1.3, 1.9, 2.0, 2.5, 2.6, 2.3, 2.0, 1.8, 
1.3, 1.7, 2.0, 1.9, 2.3, 1.9, 2.4, 1.6 


Can you claim that the median impurity level is less than 

2.5 ppm? 

(a) State and test the appropriate hypothesis using the sign 
test with a = 0.05. What is the P-value for this test? 

(b) Use the normal approximation for the sign test to test 
A: fi = 2.5 versus Hy: jf < 2.5. What is the P-value for 
this test? 

9-113. Consider the margarine fat content data in Exercise 

8-36. Use the sign test to test Hp: = 17.0 versus 

Ay: & # 17.0 with a = 0.05. 

(a) Find the P-value for the test statistic and use this quantity 
to make your decision. 

(b) Use the normal approximation to test the same hypothesis 
that you formulated in part (a). What is the P-value for 
this test? 


9-114. Consider the compressive strength data in 

Exercise 8-37. 

(a) Use the sign test to investigate the claim that the median 
strength is at least 2250 psi. Use a = 0.05. 

(b) Use the normal approximation to test the same hypothesis 
that you formulated in part (a). What is the P-value for 
this test? 

9-115. The diameter of a ball bearing was measured by an 

inspector using a new type of caliper. The results were as fol- 

lows (in mm): 0.265, 0.263, 0.266, 0.267, 0.267, 0.265, 0.267, 

0.267, 0.265, 0.268, 0.268, and 0.263. 

(a) Use the Wilcoxon signed-rank test to evaluate the claim 
that the mean ball diameter is 0.265 mm. Use a = 0.05. 

(b) Use the normal approximation for the test. With a = 0.05, 
what conclusions can you draw? 

9-116. A new type of tip can be used in a Rockwell hard- 

ness tester. Eight coupons from test ingots of a nickel-based 

alloy are selected, and each coupon is tested using the new tip. 
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The Rockwell C-scale hardness readings are 63, 65, 58, 60, 
55, 57, 53, and 59. Do the results support the claim that the 
mean hardness exceeds 60 at a 0.05 level? 


9-117. A primer paint can be used on aluminum panels. The 
drying time of the primer is an important consideration in the 
manufacturing process. Twenty panels are selected and the dry- 
ing times are as follows: 1.6, 1.3, 1.5, 1.6, 1.7, 1.9, 1.8, 1.6, 1.4, 
1.8, 1.9, 1.8, 1.7, 1.5, 1.6, 1.4, 1.3, 1.6, 1.5, and 1.8. Is there ev- 
idence that the mean drying time of the primer exceeds 1.5 hr? 


Supplemental Exercises 


9-118. Consider the computer output below. 


One-Sample Z: 
Test of mu = 26 vs > 26 
The assumed standard deviation = 1.5 


Variable N StDev 
x ? 2.032 


SE Mean Z P 
0.401 ? ? 


Mean 
26.541 


(a) Fill in the missing information. 

(b) Is this a one-sided or a two-sided test? 
(c) What are your conclusions if a = 0.05? 
(d) Find a 95% two-sided CI on the mean. 


9-119. Consider the computer output below. 


One-Sample T: 


Test of mu = 100 vs not = 100 


Variable N Mean StDev SEMean 95%CI T P 
xX 16 98.33 4.61 ? (?, ?) ary 


(a) How many degrees of freedom are there on the f-statistic? 

(b) Fill in the missing information. You may use bounds on 
the P-value. 

(c) What are your conclusions if a = 0.05? 

(d) What are your conclusions if the hypothesis is Hp: w = 
100 versus Ho: . > 100? 


9-120. Consider the computer output below. 


One-Sample T: 


Test of mu = 85 vs < 85 


Variable N 
x 25 


StDev 
84.331 ? 


SE Mean T P 
0.631 ? ? 


Mean 


(a) How many degrees of freedom are there on the t-statistic? 

(b) Fill in the missing information. You may use bounds on 
the P-value. 

(c) What are your conclusions if a = 0.05? 

(d) Find a 95% upper-confidence bound on the mean. 

(e) What are your conclusions if the hypothesis is Ho: w = 
100 versus Ho: p. > 100? 


9-121. An article in Transfusion Science (“Early Total 
White Blood Cell Recovery Is a Predictor of Low Number of 
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Apheresis and Good CD34+ Cell Yield” (Vol. 23, 2000, 
pp. 91-100)] studied the white blood cell recovery of patients 
with haematological malignancies after a new chemotherapy 
treatment. Data (in days) on white blood cell recovery (WBC) 
for 19 patients consistent with summary data reported in the 
paper follow: 18, 16, 13, 16, 15, 12, 9, 14, 12, 8, 16, 12, 10, 8, 
14, 9, 5, 18, and 12. 

(a) Is there sufficient evidence to support a claim that the 

mean WBC recovery exceeds 12 days? 
(b) Find a 95% two-sided CI on the mean WBC recovery. 


9-122. An article in Fire Technology [“An Experimental 
Examination of Dead Air Space for Smoke Alarms” (Vol. 45, 
2009, pp. 97-115)] studied the performance of smoke detec- 
tors installed not less than 100 mm from any adjoining wall if 
mounted on a flat ceiling, and not closer than 100 mm and not 
farther than 300 mm from the adjoining ceiling surface if 
mounted on walls. The purpose of this rule is to avoid installa- 
tion of smoke alarms in the “dead air space,’ where it is 
assumed to be difficult for smoke to reach. A number of inter- 
esting experiments were described in the paper. Results on the 
time to signal (in seconds) for one such experiment with pine 
stick fuel in an open bedroom using photoelectric smoke 
alarms are as follows: 220, 225, 297, 315, 282, and 313. 
(a) Is there sufficient evidence to support a claim that the 
mean time to signal is less than 300 seconds? 
(b) Is there practical concern about the assumption of a nor- 
mal distribution as a model for the time-to-signal data? 
(c) Find a 95% two-sided CI on the mean time to signal. 


9-123. Suppose we wish to test the hypothesis Hp: w = 85 
versus the alternative H,: p > 85 where o = 16. Suppose that 
the true mean is 1 = 86 and that in the practical context of the 
problem this is not a departure from jy = 85 that has practical 
significance. 

(a) For a test with a = 0.01, compute B for the sample sizes 
n = 25, 100, 400, and 2500 assuming that pp = 86. 

(b) Suppose the sample average is x = 86. Find the P-value 
for the test statistic for the different sample sizes speci- 
fied in part (a). Would the data be statistically significant 
ata = 0.01? 

(c) Comment on the use of a large sample size in this problem. 

9-124. A manufacturer of semiconductor devices takes a 

random sample of size n of chips and tests them, classifying 

each chip as defective or nondefective. Let X; = 0 if the chip 
is nondefective and X; = 1 if the chip is defective. The sample 
fraction defective is 


n 


Xt X, t+ X, 
p= 


n 


What are the sampling distribution, the sample mean, and 
sample variance estimates of j when 

(a) The sample size is n = 50? 

(b) The sample size is n = 80? 

(c) The sample size is n = 100? 
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(d) Compare your answers to parts (a)—(c) and comment on 
the effect of sample size on the variance of the sampling 
distribution. 


9-125. Consider the situation of Exercise 9-124. After col- 

lecting a sample, we are interested in testing Hp: p = 0.10 versus 

H,: p # 0.10 with a = 0.05. For each of the following situa- 

tions, compute the p-value for this test: 

(a) n = 50, p = 0.095 

(b) n = 100, p = 0.095 

(c) n = 500, p = 0.095 

(d) n = 1000, p = 0.095 

(e) Comment on the effect of sample size on the observed 
P-value of the test. 


9-126. An inspector of flow metering devices used to admin- 
ister fluid intravenously will perform a hypothesis test to 
determine whether the mean flow rate is different from the flow 
rate setting of 200 milliliters per hour. Based on prior 
information, the standard deviation of the flow rate is assumed 
to be known and equal to 12 milliliters per hour. For each of the 
following sample sizes, and a fixed a = 0.05, find the probabil- 
ity of a type II error if the true mean is 205 milliliters per hour. 
(a) n= 20 

(b) n = 50 

(c) n= 100 

(d) Does the probability of a type II error increase or decrease 

as the sample size increases? Explain your answer. 


9-127. Suppose that in Exercise 9-126, the experimenter 

had believed that o = 14. For each of the following sample 

sizes, and a fixed a = 0.05, find the probability of a type II 

error if the true mean is 205 milliliters per hour. 

(a) n= 20 

(b) n = 50 

(c) n= 100 

(d) Comparing your answers to those in Exercise 9-126, does 
the probability of a type II error increase or decrease with 
the increase in standard deviation? Explain your answer. 


9-128. The marketers of shampoo products know that cus- 

tomers like their product to have a lot of foam. A manufacturer 

of shampoo claims that the foam height of his product exceeds 

200 millimeters. It is known from prior experience that the 

standard deviation of foam height is 8 millimeters. For each of 

the following sample sizes, and a fixed a = 0.05, find the 

power of the test if the true mean is 204 millimeters. 

(a) n= 20 

(b) n = 50 

(c) n= 100 

(d) Does the power of the test increase or decrease as the sam- 
ple size increases? Explain your answer. 


9-129. Suppose we are testing Ho: p = 0.5 versus Ho: p # 0.5. 

Suppose that p is the true value of the population proportion. 

(a) Using a = 0.05, find the power of the test for n = 100, 
150, and 300 assuming that p = 0.6. Comment on the 
effect of sample size on the power of the test. 


(b) Using a = 0.01, find the power of the test for n = 100, 
150, and 300 assuming that p = 0.6. Compare your an- 
swers to those from part (a) and comment on the effect of 
a on the power of the test for different sample sizes. 

(c) Using a = 0.05, find the power of the test form = 100, as- 
suming p = 0.08. Compare your answer to part (a) and 
comment on the effect of the true value of p on the power 
of the test for the same sample size and a level. 

(d) Using a = 0.01, what sample size is required if p = 0.6 
and we want B = 0.05? What sample is required if p = 0.8 
and we want B = 0.05? Compare the two sample sizes and 
comment on the effect of the true value of p on sample 
size required when B is held approximately constant. 


9-130. The cooling system in a nuclear submarine consists 

of an assembly of welded pipes through which a coolant is cir- 

culated. Specifications require that weld strength must meet or 
exceed 150 psi. 

(a) Suppose that the design engineers decide to test the 
hypothesis Hp: » = 150 versus H,: w > 150. Explain 
why this choice of alternative hypothesis is better than 
Ay: pw < 150. 

(b) A random sample of 20 welds results in x = 153.7 psi and 
s = 11.3 psi. What conclusions can you draw about the 
hypothesis in part (a)? State any necessary assumptions 
about the underlying distribution of the data. 


9-131. The mean pull-off force of an adhesive used in man- 
ufacturing a connector for an automotive engine application 
should be at least 75 pounds. This adhesive will be used unless 
there is strong evidence that the pull-off force does not meet 
this requirement. A test of an appropriate hypothesis is to be 
conducted with sample size n = 10 and a = 0.05. Assume that 
the pull-off force is normally distributed, and o is not known. 
(a) If the true standard deviation is o = 1, what is the risk that 
the adhesive will be judged acceptable when the true mean 
pull-off force is only 73 pounds? Only 72 pounds? 
(b) What sample size is required to give a 90% chance of 
detecting that the true mean is only 72 pounds when o = 1? 
(c) Rework parts (a) and (b) assuming that 0 = 2. How much 
impact does increasing the value of o have on the answers 
you obtain? 


9-132. A manufacturer of precision measuring instruments 

claims that the standard deviation in the use of the instruments 

is at most 0.00002 millimeter. An analyst, who is unaware of 
the claim, uses the instrument eight times and obtains a sam- 
ple standard deviation of 0.00001 millimeter. 

(a) Confirm using a test procedure and an a level of 0.01 that 
there is insufficient evidence to support the claim that the 
standard deviation of the instruments is at most 0.00002. 
State any necessary assumptions about the underlying 
distribution of the data. 

(b) Explain why the sample standard deviation, s = 0.00001, 
is less than 0.00002, yet the statistical test procedure 
results do not support the claim. 
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9-133. <A biotechnology company produces a therapeutic 
drug whose concentration has a standard deviation of 4 grams 
per liter. A new method of producing this drug has been pro- 
posed, although some additional cost is involved. Management 
will authorize a change in production technique only if the 
standard deviation of the concentration in the new process is 
less than 4 grams per liter. The researchers chose n = 10 and 
obtained the following data in grams per liter. Perform the nec- 
essary analysis to determine whether a change in production 
technique should be implemented. 


16.628 16.630 
16.622 16.631 
16.627 16.624 
16.623 16.622 
16.618 16.626 


9-134. Consider the 40 observations collected on the num- 
ber of nonconforming coil springs in production batches of 
size 50 given in Exercise 6-93. 

(a) Based on the description of the random variable and these 
40 observations, is a binomial distribution an appropriate 
model? Perform a goodness of fit procedure with a = 0.05. 

(b) Calculate the P-value for this test. 


9-135. Consider the 20 observations collected on the num- 
ber of errors in a string of 1000 bits of a communication chan- 
nel given in Exercise 6-94. 

(a) Based on the description of the random variable and these 
20 observations, is a binomial distribution an appropriate 
model? Perform a goodness of fit procedure with a = 0.05. 

(b) Calculate the P-value for this test. 


9-136. Consider the spot weld shear strength data in Exercise 
6-31. Does the normal distribution seem to be a reasonable 
model for these data? Perform an appropriate goodness of fit 
test to answer this question. 


9-137. Consider the water quality data in Exercise 6-32. 

(a) Do these data support the claim that mean concentration 
of suspended solids does not exceed 50 parts per million? 
Use a = 0.05. 

(b) What is the P-value for the test in part (a)? 

(c) Does the normal distribution seem to be a reasonable 
model for these data? Perform an appropriate goodness of 
fit test to answer this question. 


9-138. Consider the golf ball overall distance data in 

Exercise 6-33. 

(a) Do these data support the claim that the mean overall dis- 
tance for this brand of ball does not exceed 270 yards? 
Use a = 0.05. 

(b) What is the P-value for the test in part (a)? 

(c) Do these data appear to be well modeled by a normal dis- 
tribution? Use a formal goodness of fit test in answering 
this question. 
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9-139. Consider the baseball coefficient of restitution data 

in Exercise 8-92. If the mean coefficient of restitution exceeds 

0.635, the population of balls from which the sample has been 

taken will be too “lively” and considered unacceptable for play. 

(a) Formulate an appropriate hypothesis testing procedure to 
answer this question. 

(b) Test these hypotheses and draw conclusions, using 
a = 0.01. 

(c) Find the P-value for this test. 

(d) In Exercise 8-92(b), you found a 99% confidence interval 
on the mean coefficient of restitution. Does this interval, 
or a one-sided CI, provide additional useful information to 
the decision maker? Explain why or why not. 

9-140. Consider the dissolved oxygen data in Exercise 

8-94. Water quality engineers are interested in knowing 

whether these data support a claim that mean dissolved oxy- 

gen concentration is 2.5 milligrams per liter. 

(a) Formulate an appropriate hypothesis testing procedure to 
investigate this claim. 

(b) Test these hypotheses and draw conclusions, using a = 0.05. 

(c) Find the P-value for this test. 

(d) In Exercise 8-94(b) you found a 95% CI on the mean dis- 
solved oxygen concentration. Does this interval provide 
useful additional information beyond that of the hypothe- 
sis testing results? Explain your answer. 

9-141. An article in Food Testing and Analysis (“Improving 

Reproducibility of Refractometry Measurements of Fruit 

Juices” (1999, Vol. 4, No. 4, pp. 13-17)] measured the sugar 

concentration (Brix) in clear apple juice. All readings were 

taken at 20°C: 


11.48 11.45 11.48 11.47 11.48 
11.50 11.42 11.49 11.45 11.44 
11.45 11.47 11.46 11.47 11.43 
11.50 11.49 11.45 11.46 11.47 


(a) Test the hypothesis Hp: wp = 11.5 versus Hy: p # 11.5 
using a = 0.05. Find the P-value. 

(b) Compute the power of the test if the true mean is 11.4. 

(c) What sample size would be required to detect a true mean 
sugar concentration of 11.45 if we wanted the power of the 
test to be at least 0.9? 

(d) Explain how the question in part (a) could be answered by 
constructing a two-sided confidence interval on the mean 
sugar concentration. 

(e) Is there evidence to support the assumption that the sugar 
concentration is normally distributed? 


9-142. Consider the computer output below 


Test and Cl for One Proportion 


Test of p = 0.25 vs p < 0.25 


xX N Bound Z-Value P-Value 
53 225 0.282088 ? ? 


Sample p 
0.235556 


Using the normal approximation. 

(a) Fill in the missing information. 

(b) What are your conclusions if a = 0.05? 

(c) The normal approximation to the binomial was used here. 
Was that appropriate? 

(d) Find a 95% upper-confidence bound on the true proportion. 

(e) What are the P-value and your conclusions if the alterna- 
tive hypothesis is Hy: p # 0.25? 

9-143. An article in Food Chemistry [“A Study of Factors 

Affecting Extraction of Peanut (Arachis Hypgaea L.) Solids 

with Water’ (1991, Vol. 42, No. 2, pp. 153-165)] found the 

percent protein extracted from peanut milk as follows: 


78.3 771 713 845 87.8 75.7 64.8 72.5 
78.2 91.2 862 80.9 82.1 89.3 894 81.6 


(a) Can you support a claim that mean percent protein ex- 
tracted exceeds 80 percent? Use a = 0.05. 

(b) Is there evidence that percent protein extracted is nor- 
mally distributed? 

(c) What is the P-value of the test statistic computed in part (a)? 

9-144. An article in Biological Trace Element Research 

(“Interaction of Dietary Calcium, Manganese, and Manganese 

Source (Mn Oxide or Mn Methionine Complex) or Chick 

Performance and Manganese Utilization” (1991, Vol. 29, 

No. 3, pp. 217—228)] showed the following results of tissue 

assay for liver manganese (ppm) in chicks fed high-Ca diets. 


6.02 6.08 7.11 3:73 3:32: 7.10 
5.29 5.84 6.03 5.99 4.53 6.81 


(a) Test the hypothesis Hy: 0? = 0.6 versus H,: 07 # 0.6 
using a = 0.01. 

(b) What is the P-value for this test? 

(c) Discuss how part (a) could be answered by constructing a 
99% two-sided confidence interval for o. 

9-145. An article in Experimental Brain Research 

(“Synapses in the Granule Cell Layer of the Rat Dentate 

Gyrus: Serial-Sectionin Study” (1996, Vol. 112, No. 2, pp. 

237-243)] showed the ratio between the numbers of sym- 

metrical and total synapses on somata and azon initial 

segments of reconstructed granule cells in the dentate gyrus 

of a 12-week-old rat: 


0.65 0.90 0.78 0.94 0.40 0.94 
0.91 0.86 0.53 0.84 0.42 0.50 
0.50 0.68 1.00 0.57 1.00 1.00 


0.84 0.9 0.91 0.92 0.96 
0.96 0.56 0.67 0.96 0.52 
0.89 0.60 0.54 


(a) Use the data to test Ho: o? = 0.02 versus H,: 0? # 0.02 
using a = 0.05. 
(b) Find the P-value for the test. 


9-146. An article in the Journal of Electronic Material 
(“Progress in CdZnTe Substrate Producibility and Critical 
Drive of IRFPA Yield Originating with CdZnTe Substrates” 
(1998, Vol. 27, No. 6, pp. 564—572)] improved the quality of 
CdZnTe substrates used to produce the HgCdTe infrared focal 
plane arrays (IRFPAs), also defined as sensor chip assemblies 
(SCAs). The cut-on wavelength (jm) on 11 wafers was mea- 
sured and is shown below: 


6.06 6.16 6.57 6.67 6.98 6.17 6.17 6.93 6.73 6.87 6.76 


(a) Is there evidence that the mean of cut-on wave length is 
not 6.50 wm? 

(b) What is the P-value for this test? 

(c) What sample size would be required to detect a true mean 
cut-on wavelength of 6.25 ym with probability 95%? 

(d) What is the type II error probability if the true mean 
cut-on wavelength is 6.95 wm? 
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9-147. Consider the fatty acid measurements for the diet 

margarine described in Exercise 8-36. 

(a) For the sample size n = 6, using a two-sided alternative 
hypothesis and a = 0.01, test Ho: o? = 1.0. 

(b) Suppose that instead of n = 6, the sample size was n = 51. 
Repeat the analysis performed in part (a) using n = 51. 

(c) Compare your answers and comment on how sample size 
affects your conclusions drawn in parts (a) and (b). 


9-148. Consider the television picture tube brightness ex- 

periment described in Exercise 8-35. 

(a) For the sample size n = 10, do the data support the 
claim that the standard deviation of current is less than 
20 microamps? 

(b) Suppose that instead of n = 10, the sample size was 51. 
Repeat the analysis performed in part (a) using n = 51. 

(c) Compare your answers and comment on how sample size 
affects your conclusions drawn in parts (a) and (b). 


MIND-EXPANDING EXERCISES 


9-149. Suppose that we wish to test Hp: ~ = po versus 
H,: w # (Wo, where the population is normal with known 
o. Let 0 < € < q, and define the critical region so that 
we will reject Hp if z) > z, or if 77 < —Zy_¢, where Zy is 
the value of the usual test statistic for these hypotheses. 
(a) Show that the probability of type I error for this test 
is a. 
(b) Suppose that the true mean is wp; = po + 5. Derive 
an expression for 8 for the above test. 


9-150. Derive an expression for B for the test on the 
variance of a normal distribution. Assume that the two- 
sided alternative is specified. 

9-151. When X,, X5,..., X, are independent Poisson 
random variables, each with parameter X, and n is large, 
the sample mean X has an approximate normal distribu- 
tion with mean X and variance A/n. Therefore, 


i 


VA/n 


has approximately a standard normal distribution. Thus 
we can test Hy: \ = No by replacing A in Z by Ny. When X; 
are Poisson variables, this test is preferable to the large- 
sample test of Section 9-2.3, which would use S/Vn in 
the denominator, because it is designed just for the 


Poisson distribution. Suppose that the number of open cir- 
cuits on a semiconductor wafer has a Poisson distribution. 
Test data for 500 wafers indicate a total of 1038 opens. 
Using a = 0.05, does this suggest that the mean number 
of open circuits per wafer exceeds 2.0? 


9-152. When X,, X),..., X, is arandom sample from 
a normal distribution and n is large, the sample stan- 
dard deviation has approximately a normal distribu- 
tion with mean o and variance o”/(2n). Therefore, a 
large-sample test for Hy: © = op can be based on the 
statistic 


S— 09 


V 04/(2n) 


(a) Use this result to test Hy: o = 10 versus H,: o < 10 
for the golf ball overall distance data in Exercise 
6-33. 

(b) Find an approximately unbiased estimator of the 95 
percentile 6 = « + 1.6450. From the fact that X 
and S are independent random variables, find the 
standard error of the estimator of 8. How would you 
estimate the standard error? 

(c) Consider the golf ball overall distance data in 
Exercise 6-33. We wish to investigate a claim that 
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MIND-EXPANDING EXERCISES 


the 95 percentile of overall distance does not exceed 
285 yards. Construct a test statistic that can be used 
for testing the appropriate hypotheses. Apply this 
procedure to the data from Exercise 6-33. What are 


your conclusions? 
9-153. Let X,, X,.. 


., X, be a sample from an exponen- 


tial distribution with parameter \. It can be shown that 


IMPORTANT TERMS AND CONCEPTS 


2d > ,X; has a chi-square distribution with 2n degrees 
of freedom. Use this fact to devise a test statistic and 
critical region for Hj: \ = Xo versus the three usual 


alternatives. 


aand B 

Connection between 
hypothesis tests and 
confidence intervals 

Critical region for a test 
statistic 

Goodness of fit test 

Homogeneity test 

Hypothesis test 

Inference 

Independence test 


Nonparametric or 
distribution free 
methods 

Normal approximation 
to nonparametric 
tests 

Null distribution 

Null hypothesis 

One- and two-sided 
alternative hypotheses 


Operating characteristic 
(OC) curves 

Power of a test 

P-value 

Ranks 

Reference distribution 
for a test statistic 

Sample size determina- 
tion for hypothesis 
tests 


Significance level of 
a test 

Sign test 

Statistical hypotheses 

Statistical versus practi- 
cal significance 

Test statistic 

Type I and type II errors 

Wilcoxon signed-rank 
test 
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Statistical Inference for Two Samples 


The safety of drinking water is a serious public health issue. An article appeared in the Arizona 
Republic on May 27, 2001, that reported on arsenic contamination in the water sampled from 
10 communities in the metropolitan Phoenix area and 10 communities from rural Arizona. 
The data showed dramatic differences in the arsenic concentration, ranging from 3 parts 
per billion (ppb) to 48 ppb. There are some important questions suggested by this article. Is 
there a real difference between the arsenic concentrations in the Phoenix area and in the rural 
communities in Arizona? How large is this difference? Is it large enough to require action 
on the part of the public health service and other state agencies to correct the problem? Are the 
levels of reported arsenic concentration large enough to constitute a public health risk? 

Some of these questions can be answered by statistical methods. If we think of the met- 
ropolitan Phoenix communities as one population and the rural Arizona communities as a 
second population, we could determine whether there is a statistically significant differ- 
ence in the mean arsenic concentration between the two populations by testing the hypoth- 
esis that the two means, say, (4; and ps, are different. This is a relatively simple extension 
to two samples of the one-sample hypothesis testing procedures of Chapter 9. We could 
also use a confidence interval to estimate the difference in the two means, say, , — p>. 

The arsenic concentration problem is very typical of many problems in engineering 
and science that involve statistics. Some of the questions can be answered by the applica- 
tion of appropriate statistical tools, while other questions require using engineering or 
scientific knowledge and expertise to answer satisfactorily. 


CHAPTER OUTLINE 

10-1 INFERENCE ON THE DIFFERENCE 10-1.2 Type II Error and Choice of 
IN MEANS OF TWO NORMAL Sample Size 
DISTRIBUTIONS, VARIANCES 10-1.3 Confidence Interval on the 
KNOWN 


Difference in Means, 
10-1.1 Hypothesis Tests on the Variances Known 
Difference in Means, 
Variances Known 
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10-2. INFERENCE ON THE DIFFERENCE 10-5.1 F Distribution 
IN MEANS OF TWO NORMAL 10-5.2 Hypothesis Tests on the Ratio 
DISTRIBUTIONS, VARIANCES : 
of Two Variances 
ee 10-5.3 Type ILE d Choice of 
10-2.1 Hypothesis Tests on the _ esc je a [ oe 
Difference in Means, Variances P 
Unknown 10-5.4 Confidence Interval on the 
10-2.2 Type II Error and Choice of Iatibion Iwo Vanances 
Sample Size 10-6 INFERENCE ON TWO 
10-2.3 Confidence Interval on the POPULATION PROPORTIONS 
Difference in Means, Variances 10-6.1 Large-Sample Tests on the 
Unknown Difference in Population 
10-3 A NONPARAMETRIC TEST ON Proportions 
THE DIFFERENCE IN TWO MEANS 10-6.2 Type II Error and Choice of 
10-3.1 Description of the Wilcoxon Sample Size 
Rank-Sum Test 10-6.3 Confidence Interval on the 
10-3.2 Large-Sample Approximation Difference in Population 
10-3.3 Comparison to the t-Test Proportions 
10-4 PAIRED t-TEST 10-7 SUMMARY TABLE AND ROADMAP 
10-5 INFERENCE ON THE VARIANCES FOE DEER ENCE PR eDURES 
OF TWO NORMAL DISTRIBUTIONS EOE TOS a MEL 
LEARNING OBJECTIVES 
After careful study of this chapter you should be able to do the following: 
1. Structure comparative experiments involving two samples as hypothesis tests 
2. Test hypotheses and construct confidence intervals on the difference in means of two normal 
distributions 
3. Test hypotheses and construct confidence intervals on the ratio of the variances or standard 
deviations of two normal distributions 
4. Test hypotheses and construct confidence intervals on the difference in two population proportions 
5. Use the P-value approach for making decisions in hypotheses tests 
6. Compute power, type II error probability, and make sample size decisions for two-sample tests on 


means, variances, and proportions 


. Explain and use the relationship between confidence intervals and hypothesis tests 


10-1 INFERENCE ON THE DIFFERENCE IN MEANS OF TWO 
NORMAL DISTRIBUTIONS, VARIANCES KNOWN 


The previous two chapters presented hypothesis tests and confidence intervals for a single 
population parameter (the mean yp, the variance o7, or a proportion p). This chapter extends 
those results to the case of two independent populations. 


The general situation is shown in Fig. 10-1. Population 1 has mean 2; and variance or, 


while population 2 has mean 1» and variance ws. Inferences will be based on two random 
samples of sizes n, and ny, respectively. That is, X;,, X12, p, Xin, is a random sample of n, 
observations from population 1, and X,, X39, p, Xz, is a random sample of n, observa- 
tions from population 2. Most of the practical applications of the procedures in this chapter 
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Figure 10-1 Two 


independent popula- 
tions. 
Assumptions 
for 
Two-Sample 
Inference 


Population 1 Population 2 
2 2 
Py oy 
ic ie 
Sample 1: Sample 2: 
X11 XQ Lng Xa11 X291+++1 Xony 


arise in the context of simple comparative experiments in which the objective is to study 
the difference in the parameters of the two populations. 

Engineers and scientists are often interested in comparing two different conditions to de- 
termine whether either condition produces a significant effect on the response that is observed. 
These conditions are sometimes called treatments. Example 10-1 illustrates such an experi- 
ment; the two different treatments are two paint formulations, and the response is the dry- 
ing time. The purpose of the study is to determine whether the new formulation results in a 
significant effect—reducing drying time. In this situation, the product developer (the experi- 
menter) randomly assigned 10 test specimens to one formulation and 10 test specimens to the 
other formulation. Then the paints were applied to the test specimens in random order until all 
20 specimens were painted. This is an example of a completely randomized experiment. 

When statistical significance is observed in a randomized experiment, the experimenter can 
be confident in the conclusion that it was the difference in treatments that resulted in the differ- 
ence in response. That is, we can be confident that a cause-and-effect relationship has been found. 

Sometimes the objects to be used in the comparison are not assigned at random to the 
treatments. For example, the September 1992 issue of Circulation (a medical journal 
published by the American Heart Association) reports a study linking high iron levels in the 
body with increased risk of heart attack. The study, done in Finland, tracked 1931 men for five 
years and showed a statistically significant effect of increasing iron levels on the incidence of 
heart attacks. In this study, the comparison was not performed by randomly selecting a sample 
of men and then assigning some to a “low iron level” treatment and the others to a “high iron 
level” treatment. The researchers just tracked the subjects over time. Recall from Chapter | 
that this type of study is called an observational study. 

It is difficult to identify causality in observational studies, because the observed statisti- 
cally significant difference in response between the two groups may be due to some other 
underlying factor (or group of factors) that was not equalized by randomization and not due to 
the treatments. For example, the difference in heart attack risk could be attributable to the dif- 
ference in iron levels, or to other underlying factors that form a reasonable explanation for the 
observed results—such as cholesterol levels or hypertension. 

In this section we consider statistical inferences on the difference in means p, — py. of 
two normal distributions, where the variances o7 and 03 are known. The assumptions for this 
section are summarized as follows. 


1. Xj), Xj),---, Xjn,i8 a random sample from population 1. 

2. Xi, X29,..., Xon, is arandom sample from population 2. 

3. The two populations represented by X, and X, are independent. 
4. Both populations are normal. 
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A logical point estimator of 1, — | is the difference in sample means X, — X>. Based 
on the properties of expected values, 


E(X, — Xz) = E(X) — E(X2) = pi — be 


and the variance of X; — X3 is 


oT 3 
V(X, — X2) = V(X1) + VX2) = mt Mm 


Based on the assumptions and the preceding results, we may state the following. 


The quantity 
X= X= (bi — ba) 
Z= = 5 (10-1) 
i, © 
m Ny 


has a N(0, 1) distribution. 


This result will be used to form tests of hypotheses and confidence intervals on pw, — po. 


Essentially, we may think of 4; — pf. as a parameter 0, and its estimator is © = X, — X, 
with variance or = 03/n, + o5/n. If 89 is the null hypothesis value specified for 0, the test 


statistic will be (O — 89)/o@ . Notice how similar this is to the test statistic for a single mean 
used in Equation 9-8 of Chapter 9. 


10-1.1 Hypothesis Tests on the Difference in Means, Variances Known 


We now consider hypothesis testing on the difference in the means wp, — [2 of two normal 
populations. Suppose that we are interested in testing that the difference in means ww, — pW 
is equal to a specified value Ap. Thus, the null hypothesis will be stated as Hp: wy — bo = 
Ao. Obviously, in many cases, we will specify Aj = 0 so that we are testing the equality of 
two means (i.e., Hp: &, = M2). The appropriate test statistic would be found by replacing 
kt; — 2 in Equation 10-1 by Ao, and this test statistic would have a standard normal distribu- 
tion under H. That is, the standard normal distribution is the reference distribution for the 
test statistic. Suppose that the alternative hypothesis is H,: w,; — 2 ~ Ap. Now, a sample 
value of x, — X, that is considerably different from Ap is evidence that H, is true. Because 
Zo has the N(O, 1) distribution when H) is true, we would calculate the P-value as the sum of 
the probabilities beyond the test statistic value z) and —2Zg in the standard normal distribu- 
tion. That is, P = 2[1 — (| zo|)]. This is exactly what we did in the one-sample z-test of 
Section 4-4.1. If we wanted to perform a fixed-significance-level test, we would take —Z,/ 
and Z,,/2 as the boundaries of the critical region just as we did in the single-sample z-test. This 
would give a test with level of significance a. P-values or critical regions for the one-sided 
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alternatives would be determined similarly. Formally, we summarize these results in the fol- 


lowing display. 


Tests on the 
ii dacacias Null hypothesis: Ho: 1; — 2 = Ao 
Variance, rr 
est statistic: Zo = = (10-2) 
Known 05 
Ny 
Rejection Criterion For 
Alternative Hypotheses P-Value for Fixed-Level Tests 
Ay: by — Bo # Ag Probability above |zp| and 29 > Ze/2 OF Z < Za 
probability below —|z)|, 
P = 2[1 — ®(d))] 
Hy: wy — B2 > Ao Probability above Zo, in > Ex 
fe les) 
Hy: py — po < Ag Probability below Zp, iy SE, 
P= DZ) 


EXAMPLE 10-1 Paint Drying Time 
A product developer is interested in reducing the drying time 
of a primer paint. Two formulations of the paint are tested; for- 
mulation 1 is the standard chemistry, and formulation 2 has a 
new drying ingredient that should reduce the drying time. 
From experience, it is known that the standard deviation of 
drying time is 8 minutes, and this inherent variability should 
be unaffected by the addition of the new ingredient. Ten spec- 
imens are painted with formulation 1, and another 10 speci- 
mens are painted with formulation 2; the 20 specimens are 
painted in random order. The two sample average drying times 
are x; = 121 minutes and x, = 112 minutes, respectively. 
What conclusions can the product developer draw about the 
effectiveness of the new ingredient, using a = 0.05? 

We apply the seven-step procedure to this problem as 
follows: 


1. Parameter of interest: The quantity of interest is the dif- 
ference in mean drying times, 4; — [2, and Ay = 0. 
Non hypothesis: Hp: ww; — 2 = 0, or Ao: wy = p. 


Alternative hypothesis: H,: 1, > .. We want to reject 
H) if the new ingredient reduces mean drying time. 


4. Test statistic: The test statistic is 


_ x, 7X2 — 0 
Zo = 2 7 
OT | 92 
Sek cll ee 


ny nN 


where oj} = 03 = (8)? = 64 and n, =n, = 10. 
5. Reject Hy if: Reject Ho: w, = pw. if the P-value is less 
than 0.05. 


6. Computations: Since x, = 121 minutes and x, = 112 
minutes, the test statistic is 


121 — 112 252 
0 aa eye 
@ 
10 10 


7. Conclusion: Since z) = 2.52, the P-value is P = 
1 — B(2.52) = 0.0059, so we reject Hy at the a = 0.05 level 


Practical Interpretation: We conclude that adding the 
new ingredient to the paint significantly reduces the drying 
time. This is a strong conclusion. 


When the population variances are unknown, the sample variances sj and s3 can be substituted 
into the test statistic Equation 10-2 to produce a large-sample test for the difference in means. 
This procedure will also work well when the populations are not necessarily normally distrib- 
uted. However, both n, and n, should exceed 40 for this large-sample test to be valid. 
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10-1.2 Type II Error and Choice of Sample Size 


Use of Operating Characteristic Curves 

The operating characteristic curves (OC) in Appendix Charts VIla, VIIb, VIIc, and Vd may 
be used to evaluate the type II error probability for the hypotheses in the display (10-2). These 
curves are also useful in determining sample size. Curves are provided for a = 0.05 anda = 
0.01. For the two-sided alternative hypothesis, the abscissa scale of the operating characteris- 
tic curve in charts VIIa and VIIb is d, where 


I — By — Aol |A — Aol 
Vo? + 6 Vo? + 6 


d 


(10-3) 


and one must choose equal sample sizes, say, n = n,; = n3. The one-sided alternative hypothe- 
ses require the use of Charts VIIc and VIlId. For the one-sided alternatives H,: w, — 2 > Ay or 
Hy: by — by < Ao, the abscissa scale is also given by 


Ii — 2 — Aol |A — Ao| 
Vo? + 0 Vo? + of 


d 


It is not unusual to encounter problems where the costs of collecting data differ substantially 
between the two populations, or where one population variance is much greater than the other. 
In those cases, we often use unequal sample sizes. If n; # np, the operating characteristic curves 
may be entered with an equivalent value of n computed from 


oi + 03 


25 10-4 
n= Fn, + om, > 


If n, # ny, and their values are fixed in advance, Equation 10-4 is used directly to calculate n, 
and the operating characteristic curves are entered with a specified d to obtain B. If we are 
given d and it is necessary to determine 7, and n, to obtain a specified B, say, B*, we guess at 
trial values of n, and n5, calculate n in Equation 10-4, and enter the curves with the specified 
value of d to find B. If 8 = 8%, the trial values of n, and ny are satisfactory. If B # B*, 
adjustments to n, and n, are made and the process is repeated. 


EXAMPLE 10-2 Paint Drying Time, Sample Size from OC Curves 


Consider the paint drying time experiment from Example 10-1. = [Pi — Bo] _ 10 = 0.88 

If the true difference in mean drying times is as much as 10 min- V a? + 03 V/32 + 82 : 

utes, find the sample sizes required to detect this difference 

with probability at least 0.90. and since the detection probability or power of the test must be 
The appropriate value of the abscissa parameter is (since at least 0.9, with a = 0.05, we find from Appendix Chart VIIc 

Ay = 0, and A = 10) thatn =n, =n) = 11. 


Sample Size Formulas 

It is also possible to obtain formulas for calculating the sample sizes directly. Suppose 
that the null hypothesis Hp: w, — pb. = Ag is false and that the true difference in means is 
kL, — bo = A, where A > Aj. One may find formulas for the sample size required to obtain 
a specific value of the type II error probability B for a given difference in means A and level 
of significance a. 
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For example, we first write the expression for the B-error for the two-sided alternative, 


which is 


B= Za/2 ~ 


a A— Ao 

3 Zo/2 ~ 5} 2 
07 Oj 07 
Ny ny Ny 


The derivation for sample size closely follows the single-sample case in Section 9-2.2. 


Sample Size for 
a Two-Sided 
Test on the 
Difference in 
Means with 

ny = Ny, 
Variances 
Known 


1 — Bis 


For the two-sided alternative hypothesis with significance level a, the sample size 
nN, = Ny = n required to detect a true difference in means of A with power at least 


(Zayo + 2p)*(o7 + 03) 


(A = Ay? a 


This approximation is valid when ®(—z,/. — (A — Ao) Vn/Vo? + 3) is small compared to B. 


Sample Size for 
a One-Sided 
Test on the 
Difference in 
Means with 

ny = Ny, 
Variances 
Known 


at least 1 — B is 


For a one-sided alternative hypothesis with significance level a, the sample size 
n,; = nN) = n required to detect a true difference in means of A(#Ao) with power 


(Z. + 2g)°(of + 05) 


(A- AF ae 


where A is the true difference in means of interest. Then by following a procedure similar 
to that used to obtain Equation 9-17, the expression for B can be obtained for the case 


where 1 = ny = No. 


EXAMPLE 10-3 Paint Drying Time Sample Size 
To illustrate the use of these sample size equations, consider 
the situation described in Example 10-1, and suppose that if 
the true difference in drying times is as much as 10 minutes, 
we want to detect this with probability at least 0.90. Under the 
null hypothesis, Aj = 0. We have a one-sided alternative hy- 
pothesis with A = 10, a = 0.05 (so zy = 295 = 1.645), and 
since the power is 0.9, B = 0.10 (so zg = 219 = 1.28). 
Therefore, we may find the required sample size from 
Equation 10-6 as follows: 


(Z_ + zg)°(ot + 03) 
(A= Ao)? 


(1.645 + 1.28)°[(8)? + (8)7] 
(10 — 0)? 


=11 


This is exactly the same as the result obtained from using the 
OC curves. 


10-1.3 Confidence Interval on the Difference in Means, 


Variances Known 


The 100(1 — «)% confidence interval on the difference in two means 4, — ~ when the vari- 
ances are known can be found directly from results given previously in this section. Recall 
that X,), Xjo,--., Xj,,i8 a random sample of n, observations from the first population and X>,, 
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Confidence 
Interval on the 
Difference 

in Means, 
Variances 
Known 


Xy9,.-+, Xn, is arandom sample of, observations from the second population. The difference 
in sample means X, — X> is a point estimator of ~, — pi, and 


X — X) — (p41 — p2) 


L= 


oT , 9 
= + a 
ny Ng 


has a standard normal distribution if the two populations are normal or is approximately stan- 
dard normal if the conditions of the central limit theorem apply, respectively. This implies that 
P(-Zq/2 =Z= Za/2) =1- Q, Or 


P| —2a/2 = 7 . = Za? La 
Oo} 02 
ny ny 


This can be rearranged as 


ee o] 0 = 2 oj 
PX — Xo — Zaynl gs + py = Bi Be SX — X02 t+ Zayanl ay t ny 


Therefore, the 100(1 — «)% confidence interval for j1; — 12 is defined as follows. 


If x; and x, are the means of independent random samples of sizes n, and n, from 
two independent normal populations with known variances oj and 0%, respectively, 
a 1001 — a)% confidence interval for jx, — p is 


oh oT | 9 2 ee oT | 9 
ty %2 = evi a gy, ME (Me At 2 cop * , UO) 


where Z,/2 is the upper a /2 percentage point of the standard normal distribution. 


The confidence level 1 — a is exact when the populations are normal. For nonnormal popu- 
lations, the confidence level is approximately valid for large sample sizes. 


EXAMPLE 10-4 Aluminum Tensile Strength 


Tensile strength tests were performed on two different grades i oo 60 

of aluminum spars used in manufacturing the wing of a com- *1~ ¥2 ~ Za/2\I ny 1 Sp — Pe 

mercial transport aircraft. From past experience with the spar 5 5 
manufacturing process and the testing procedure, the standard a [Oy 05 
deviations of tensile strengths are assumed to be known. The oN ny Me 
data obtained are as follows: n,; = 10, x, = 87.6, o, = 1, ay? (sy? 

Ny = 12, X) = 74.5, and op = 1.5. If w; and po denote the 87.6 — 74.5 — 1.645 10 | DD = py — py 


true mean tensile strengths for the two grades of spars, we may 


2 
find a 90% confidence interval on the difference in mean (1?) 


(1.5) 
= 87.6 — 74.5 + 1645/75 + 4 5— 


strength 1, — p» as follows: 0 12 
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Therefore, the 90% confidence interval on the difference in 
mean tensile strength (in kilograms per square millimeter) 
is 


12.22 S wp, — py = 13.98 (in kilograms per square 
millimeter) 


Choice of Sample Size 


359 


Practical Interpretation: Notice that the confidence inter- 
val does not include zero, implying that the mean strength of 
aluminum grade 1 (w,) exceeds the mean strength of 
aluminum grade 2 (12). In fact, we can state that we are 90% 
confident that the mean tensile strength of aluminum grade 1 
exceeds that of aluminum grade 2 by between 12.22 and 13.98 
kilograms per square millimeter. 


If the standard deviations o, and o, are known (at least approximately) and the two sample 
sizes n, and n, are equal (n; = n, =n, say), we can determine the sample size required so that 
the error in estimating 4, — 42 by x; — x2 will be less than E at 100(1 — a)% confidence. The 
required sample size from each population is 


Sample Size for a 
Confidence Interval 
on the Difference in 

Means, Variances 


Za/2 \2 
n=(=4) (oi + 03) 


(10-8) 


Known 


Remember to round up if 7 is not an integer. This will ensure that the level of confidence does 


not drop below 100(1 — a)%. 


One-Sided Confidence Bounds 


One-sided confidence bounds on 1, — 2 may also be obtained. A 100(1 — a)% upper- 


confidence bound on ju, — po is 


One-Sided 
Upper z of 03 
Confidence eS Po = XS Xo oe Ze ny aie a (10-9) 
Bound 
and a 100(1 — a)% lower-confidence bound is 
One-Sided 
Lower _ = o7 03 
Confidence M1 — %2~ Za\ln, + ny = py — M2 (10-10) 
Bound 


EXERCISES FOR SECTION 10-1 


10-1. Consider the hypothesis test Ho: w; = w2 against 

A: py # po with known variances o, = 10 and o, = S. 

Suppose that sample sizes nj = 10 and nz = 15 and that 

x; = 4.7 and x, = 7.8. Use a = 0.05. 

(a) Test the hypothesis and find the P-value. 

(b) Explain how the test could be conducted with a confi- 
dence interval. 

(c) What is the power of the test in part (a) for a true differ- 
ence in means of 3? 

(d) Assuming equal sample sizes, what sample size should be 
used to obtain B = 0.05 if the true difference in means is 3? 
Assume that a = 0.05. 


10-2. Consider the hypothesis test Ho: wy = w2 against 

A: 4) < 2 with known variances a; = 10 and oy = S. 

Suppose that sample sizes nj = 10 and nz = 15 and that 

X, = 14.2 and xX, = 19.7. Use a = 0.05. 

(a) Test the hypothesis and find the P-value. 

(b) Explain how the test could be conducted with a confi- 
dence interval. 

(c) What is the power of the test in part (a) if 4; is 4 units less 
than pro? 

(d) Assuming equal sample sizes, what sample size should be 
used to obtain B = 0.05 if pw, is 4 units less than py? 
Assume that a = 0.05. 
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10-3. Consider the hypothesis test Ho: w,; = pw. against 
Hy: > ~2 with known variances a, = 10 and oy = S. 
Suppose that sample sizes nj = 10 and nz = 15 and that 
X, = 24.5 and x, = 21.3. Usea = 0.01. 

(a) Test the hypothesis and find the P-value. 

(b) Explain how the test could be conducted with a confi- 
dence interval. 

(c) What is the power of the test in part (a) if w, is 2 units 
greater than jz? 

(d) Assuming equal sample sizes, what sample size should be 
used to obtain B = 0.05 if w, is 2 units greater than py? 
Assume that a = 0.05. 

10-4. Two machines are used for filling plastic bottles with 

a net volume of 16.0 ounces. The fill volume can be assumed 

normal, with standard deviation o, = 0.020 and o, = 0.025 

ounces. A member of the quality engineering staff suspects 

that both machines fill to the same mean net volume, whether 
or not this volume is 16.0 ounces. A random sample of 10 bot- 
tles is taken from the output of each machine. 


Machine 1 Machine 2 
16.03 16.01 16.02 16.03 
16.04 15.96 15.97 16.04 
16.05 15.98 15.96 16.02 
16.05 16.02 16.01 16.01 
16.02 15.99 15.99 16.00 


(a) Do you think the engineer is correct? Use a = 0.05. What 
is the P-value for this test? 

(b) Calculate a 95% confidence interval on the difference in 
means. Provide a practical interpretation of this interval. 

(c) What is the power of the test in part (a) for a true differ- 
ence in means of 0.04? 

(d) Assuming equal sample sizes, what sample size should be 
used to assure that 8B = 0.05 if the true difference in 
means is 0.04? Assume that a = 0.05. 


10-5. Two types of plastic are suitable for use by an elec- 

tronics component manufacturer. The breaking strength of this 

plastic is important. It is known that o, = o, = 1.0 psi. From 

a random sample of size n, = 10 and n, = 12, we obtain 

X, = 162.5 and x, = 155.0. The company will not adopt plas- 

tic 1 unless its mean breaking strength exceeds that of plastic 

2 by at least 10 psi. 

(a) Based on the sample information, should it use plastic 1? 
Use a = 0.05 in reaching a decision. Find the P-value. 

(b) Calculate a 95% confidence interval on the difference in 
means. Suppose that the true difference in means is really 
12 psi. 

(c) Find the power of the test assuming that a = 0.05. 

(d) If it is really important to detect a difference of 12 psi, are 
the sample sizes employed in part (a) adequate, in your 
opinion? 


10-6. The burning rates of two different solid-fuel propel- 
lants used in aircrew escape systems are being studied. It is 
known that both propellants have approximately the same 
standard deviation of burning rate; that is 0; = 0, = 3 
centimeters per second. Two random samples of n, = 20 and 
Ny = 20 specimens are tested; the sample mean burning rates 
are x, = 18 centimeters per second and x, = 24 centimeters 
per second. 
(a) Test the hypothesis that both propellants have the same 
mean burning rate. Use a = 0.05. What is the P-value? 
(b) Construct a 95% confidence interval on the difference in 
means }t; — >. What is the practical meaning of this 
interval? 


(c) What is the B-error of the test in part (a) if the true differ- 
ence in mean burning rate is 2.5 centimeters per second? 
(d) Assuming equal sample sizes, what sample size is needed to 
obtain power of 0.9 at a true difference in means of 14 cm/s? 


10-7. Two different formulations of an oxygenated motor 
fuel are being tested to study their road octane numbers. The 
variance of road octane number for formulation 1 is 07 = 

1.5, and for formulation 2 it is of = 1.2. Two random sam- 

ples of size n, = 15 and n, = 20 are tested, and the mean road 

octane numbers observed are x, = 89.6 and x, = 92.5. 

Assume normality. 

(a) If formulation 2 produces a higher road octane number 
than formulation 1, the manufacturer would like to detect 
it. Formulate and test an appropriate hypothesis, using 
a = 0.05. What is the P-value? 

(b) Explain how the question in part (a) could be answered 
with a 95% confidence interval on the difference in mean 
road octane number. 

(c) What sample size would be required in each population 
if we wanted to be 95% confident that the error in esti- 
mating the difference in mean road octane number is less 
than 1? 


10-8. A polymer is manufactured in a batch chemical 
process. Viscosity measurements are normally made on each 
batch, and long experience with the process has indicated that 
the variability in the process is fairly stable with o = 20. 
Fifteen batch viscosity measurements are given as follows: 


724, 718, 776, 760, 745, 759, 795, 756, 742, 740, 761, 749, 
739, 747, 742 


A process change is made which involves switching the type 
of catalyst used in the process. Following the process change, 
eight batch viscosity measurements are taken: 


735, 775, 729, 755, 783, 760, 738, 780 


Assume that process variability is unaffected by the catalyst 

change. If the difference in mean batch viscosity is 10 or less, 

the manufacturer would like to detect it with a high probability. 

(a) Formulate and test an appropriate hypothesis using a = 
0.10. What are your conclusions? Find the P-value. 
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(b) Find a 90% confidence interval on the difference in mean 
batch viscosity resulting from the process change. 

(c) Compare the results of parts (a) and (b) and discuss your 
findings. 

10-9. The concentration of active ingredient in a liquid 

laundry detergent is thought to be affected by the type of 

catalyst used in the process. The standard deviation of active 

concentration is known to be 3 grams per liter, regardless of 

the catalyst type. Ten observations on concentration are taken 

with each catalyst, and the data follow: 


Catalyst 1: 57.9, 66.2, 65.4, 65.4, 65.2, 62.6, 67.6, 63.7, 
67.2, 71.0 

Catalyst 2: 66.4, 71.7, 70.3, 69.3, 64.8, 69.6, 68.6, 69.4, 65.3, 
68.8 


(a) 


(b) 


(c) 


(d) 


Find a 95% confidence interval on the difference in mean 
active concentrations for the two catalysts. Find the 
P-value. 

Is there any evidence to indicate that the mean active con- 
centrations depend on the choice of catalyst? Base your 
answer on the results of part (a). 

Suppose that the true mean difference in active concentra- 
tion is 5 grams per liter. What is the power of the test to 
detect this difference if « = 0.05? 

If this difference of 5 grams per liter is really important, 
do you consider the sample sizes used by the experimenter 
to be adequate? Does the assumption of normality seem 
reasonable for both samples? 


10-2. INFERENCE ON THE DIFFERENCE IN MEANS OF TWO 
NORMAL DISTRIBUTIONS, VARIANCES UNKNOWN 


We now extend the results of the previous section to the difference in means of the two dis- 
tributions in Fig. 10-1 when the variances of both distributions 07 and 03 are unknown. If the 
sample sizes n, and n, exceed 40, the normal distribution procedures in Section 10-1 could be 
used. However, when small samples are taken, we will assume that the populations are normally 
distributed and base our hypotheses tests and confidence intervals on the ¢ distribution. This 
nicely parallels the case of inference on the mean of a single sample with unknown variance. 


10-2.1 Hypotheses Tests on the Difference in Means, Variances Unknown 


We now consider tests of hypotheses on the difference in means jw, — 2 of two normal 
distributions where the variances oj and 0% are unknown. A t-statistic will be used to test these 
hypotheses. As noted above and in Section 9-3, the normality assumption is required to 
develop the test procedure, but moderate departures from normality do not adversely affect 
the procedure. Two different situations must be treated. In the first case, we assume that the 
variances of the two normal distributions are unknown but equal; that is, of = 0% = o”. In the 
second, we assume that of and o3 are unknown and not necessarily equal. 


Case 1: 0? = 07 =o? 


Suppose we have two independent normal populations with unknown means j, and 2, and 


unknown but equal variances, oj = 03 = 07. We wish to test 


Ho: by — Bo = Ao 


Hy: py — pe # Ao (10-11) 
Let X11, X12, -.., Xin, be a random sample of n, observations from the first population and 
X31, Xy,-.., X2,, be a random sample of n. observations from the second population. 


Let Xj, X, S7, and S3 be the sample means and sample variances, respectively. Now the ex- 
pected value of the difference in sample means X, — X) is E(X; — Xz) = 1 — p2, 8o 


X, — X> is an unbiased estimator of the difference in means. The variance of X, — X> is 


2 2 
2s i 4 
VR - %)= T+ T= o°( + ) 


ny Ng ny nN 
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Pooled 
Estimator of 
Variance 


It seems reasonable to combine the two sample variances S7 and $3 to form an estimator 
of o”. The pooled estimator of o is defined as follows. 


The pooled estimator of o°, denoted by S?, is defined by 
ny, — 1)Si + (nz — 1)83 
wee )Si + (m2 — 1)S3 (10-12) 
ip ny + ny — 2 


It is easy to see that the pooled estimator 2 can be written as 


n,— 1 ny — | 
1 S2 + 2. 
ny +n — 2 ny +n, — 2 


Sp= S3 = wS7 + (1 — w)S3 


where 0 < w = 1. Thus S Fe is a weighted average of the two sample variances S7 and S3, 
where the weights w and 1 — w depend on the two sample sizes n, and ny. Obviously, ifn, = 
Ny =n, w = 0.5 and Ss, is just the arithmetic average of S? and S3. If n, = 10 and n, = 20 
(say), w = 0.32 and 1 — w = 0.68. The first sample contributes n, — 1 degrees of freedom 
to 5; and the second sample contributes n, — 1 degrees of freedom. Therefore, SF has 
n, + n, — 2 degrees of freedom. 

Now we know that 


has a N(O, 1) distribution. Replacing o by S, gives the following. 


Given the assumptions of this section, the quantity 


pm XLT 2 = (= by) 


(10-13) 


has at distribution with n, + n, — 2 degrees of freedom. 


The use of this information to test the hypotheses in Equation 10-11 is now straightfor- 
ward: Simply replace 4, — [2 by Ag, and the resulting test statistic has a f distribution with 
n, + ny, — 2 degrees of freedom under Hp: 1, — p2 = Ao. Therefore, the reference distribu- 
tion for the test statistic is the ¢ distribution withn, + n, — 2 degrees of freedom. The calcula- 
tion of P-values and the location of the critical region for fixed-significance-level testing for both 
two- and one-sided alternatives parallels those in the one-sample case. Because a pooled esti- 
mate of variance is used, the procedure is often called the pooled f-test. 
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Tests on the 
Difference in 
Means of Two 
Normal 
Distributions, 
Variances 
Unknown and 
Equal* 


Null hypothesis: 


Test statistic: 


Alternative Hypothesis 


_%1 —X— A 


Ao: By — Bo = Ao 


(10-14) 
1 1 
pas + = 
ny Ng 

Rejection Criterion 

P-Value for Fixed-Level Tests 


Ay: by — 2 # Ag 


Ay: by — Wy > Ao 
Ay: by — p< Ao 


Probability above |f,| and 


to > bu/2,n,+n9—2 or 
fo S Son tm—2 
ty > t 


any +n,—2 


probability below —|f9| 
Probability above fy 


Probability below fo to < ~lany+m—2 


EXAMPLE 10-5 Yield from a Catalyst 
Two catalysts are being analyzed to determine how they affect 
the mean yield of a chemical process. Specifically, catalyst 1 is 
currently in use, but catalyst 2 is acceptable. Since catalyst 2 is 
cheaper, it should be adopted, providing it does not change the 
process yield. A test is run in the pilot plant and results in the 
data shown in Table 10-1. Is there any difference between the 
mean yields? Use a = 0.05, and assume equal variances. 

The solution using the seven-step hypothesis-testing pro- 
cedure is as follows: 


1. Parameter of interest: The parameters of interest are 
p.; and jo, the mean process yield using catalysts 1 and 2, 
respectively, and we want to know if w,; — p2 = 0. 


4. Test statistic: The test statistic is 


Reject Hy if: Reject Ho if the P-value is less than 0.05. 


6. Computations: From Table 10-1 we have x, = 92.255, 
sy = 2.39, ny = 8,X_ = 92.733, sy = 2.98, and ny = 8. 
Therefore 


(m = Ast + (mg = 1)s3__ (7)(2.39)? + 7(2.98)° 


2 
7 nytn—2 


2. Null hypothesis: Ho: 1; — po = 0, or Ho: py = pr Cee 
3. Alternative hypothesis: H,: 1, # p> 8, = V 7.30 = 2.70 
Table 10-1 Catalyst Yield Data, Example 10-5 
Observation 
Number Catalyst 1 Catalyst 2 
1 91.50 89.19 
2 94.18 90.95 
3 92.18 90.46 
4 95.39 93.21 
5 91.79 97.19 
6 89.07 97.04 
7 94.72 91.07 
8 89.21 92.75 
X, = 92.255 X_ = 92.733 
5) = 2.39 Sy = 2.98 


*While we have given the development of this procedure for the case where the sample sizes could be different, there 
is an advantage to using equal sample sizes n; = n, = n. When the sample sizes are the same from both populations, 
the f-test is more robust to the assumption of equal variances. 
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and since 0.258 < 0.35 < 0.692, we conclude that lower and 
upper bounds on the P-value are 0.50 < P < 0.80. 
X, — X 92.255 — 92.733 Therefore, since the P-value exceeds a = 0.05, the null 

= = —0.35 hypothesis cannot be rejected. 


lo = 


1 1 
2.70, [7 + 7 2.70 


1 
— + 
1 ng 8 


8 Practical Interpretation: At the 0.05 level of significance, 

we do not have strong evidence to conclude that catalyst 2 re- 

7. Conclusions: Since —2.145, from Appendix Table V we sults in a mean yield that differs from the mean yield when 
find that f 49,14 = 0.258 and t»5 4 = 0.692. Therefore, catalyst 1 is used. 


The Minitab two-sample t-test and confidence interval procedure for Example 10-5 
follows: 


Minitab Computations 


Two-Sample T-Test and CI: Cat 1, Cat 2 


Two-sample T for Cat 1 vs Cat 2 


N Mean StDev SE Mean 
Cat 1 8 92.26 2.39 0.84 
Cat 2 8 92.73 2.99 1.1 


Difference = mu Cat 1 — mu Cat 2 

Estimate for difference: —0.48 

95% CI for difference: (—3.37, 2.42) 

T-Test of difference = 0 (vs not = ): T-Value = —0.35 P-Value = 0.730 DF = 14 
Both use Pooled StDev = 2.70 


Notice that the numerical results are essentially the same as the manual computations in 
Example 10-5. The P-value is reported as P = 0.73. The two-sided CI on 2; — pL is also 
reported. We will give the computing formula for the CI in Section 10-2.3. Figure 10-2 shows 
the normal probability plot of the two samples of yield data and comparative box plots. The 
normal probability plots indicate that there is no problem with the normality assumption. 
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(a) (b) 


Figure 10-2 Normal probability plot and comparative box plot for the catalyst yield data in Example 10-5. 
(a) Normal probability plot, (b) Box plots. 
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Furthermore, both straight lines have similar slopes, providing some verification of the 
assumption of equal variances. The comparative box plots indicate that there is no obvious 
difference in the two catalysts, although catalyst 2 has slightly greater sample variability. 


Case 2: 0) # 03 


In some situations, we cannot reasonably assume that the unknown variances oj and o% are 


equal. There is not an exact t-statistic available for testing Hp: w, — po = 


Ao in this case. 


However, an approximate result can be applied. 


Case 2: Test 
Statistic for the If Ho: 41 — 2 = Ag is true, the statistic 
Difference in 
Means, Variances a 
Unknown and = 2. St (10-15) 
Not Assumed Sue 2 
Equal ms 2 


_ (st/n)° fu 


If v is not an integer, round down to the nearest integer. 


(ay (10-16) 


ny, — | 


n, — 1 


Therefore, if oO; F 03, the hypotheses on differences in the means of two normal distributions are 
tested as in the equal variances case, except that 7 is used as the test statistic and n, + n — 2 is 
replaced by v in determining the degrees of freedom for the test. 


EXAMPLE 10-6 Arsenic in Drinking Water 
Arsenic concentration in public drinking water supplies is a 
potential health risk. An article in the Arizona Republic 
(May 27, 2001) reported drinking water arsenic concentra- 
tions in parts per billion (ppb) for 10 metropolitan Phoenix 
communities and 10 communities in rural Arizona. The data 
follow: 


Rural Arizona 
(x2 = 27.5, 82. = 15.3) 


Metro Phoenix 
(x1 = 12.5, sy; = 7.63) 


Phoenix, 3 Rimrock, 48 
Chandler, 7 Goodyear, 44 

Gilbert, 25 New River, 40 
Glendale, 10 Apache Junction, 38 
Mesa, 15 Buckeye, 33 

Paradise Valley, 6 Nogales, 21 

Peoria, 12 Black Canyon City, 20 
Scottsdale, 25 Sedona, 12 

Tempe, 15 Payson, 1 


Sun City, 7 Casa Grande, 18 


We wish to determine if there is any difference in mean ar- 
senic concentrations between metropolitan Phoenix commu- 
nities and communities in rural Arizona. Figure 10-3 shows a 
normal probability plot for the two samples of arsenic concen- 
tration. The assumption of normality appears quite reasonable, 
but since the slopes of the two straight lines are very different, 
it is unlikely that the population variances are the same. 
Applying the seven-step procedure gives the following: 


1. Parameter of interest: The parameters of interest are the 
mean arsenic concentrations for the two geographic 
regions, say, 4; and po, and we are interested in deter- 
mining whether pw, — p. = 0. 

2. Non hypothesis: Hy: 1; — 2 = 0, or Hy: py = po 

3. Alternative hypothesis: H,: 1, ~ pw. 


4. Test statistic: The test statistic is 
* x, xX —0 
iy = 

7: 
ny! ny 
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Figure 10-3 Normal 5 
probability plot of the 
arsenic concentration 
data from Example 0 10 20 
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10-6. Arsenic concentration in parts per billion 


5. The degrees of freedom on fg are found from Equation 
10-16 as 


(93/ Ny)” 


(sj/m)° 
+ 


iin | ny — 1 


(7.63 (15.3)? 72 
- | 10 10 | 2 7 
[(7.63)°/10]? . [(15.3)7/107? 13.2 = 13 


9 : 9 
Therefore, using a = 0.05 and a fixed-significance-level 
test, we would reject Ho: bw, = py if > to.025,13 = 2-160 
or if t) < —fo925,13 = —2.-160. 


6. Computations: Using the sample data we find 


4 X, — X2 12.5 — 27.5 
ig —(s—a > 5 5 = -2.77 
_ 3 -— _ (15.3) 
ny My 10:10 
7. Conclusions: Because fg = —2.77 < f9.025,13 = —2.160, 


we reject the null hypothesis. 


Practical Interpretation: There is strong evidence to con- 
clude that mean arsenic concentration in the drinking water in 
rural Arizona is different from the mean arsenic concentration 
in metropolitan Phoenix drinking water. Furthermore, the mean 
arsenic concentration is higher in rural Arizona communities. 
The P-value for this test is approximately P = 0.016. 


The Minitab output for this example follows: 


Minitab Computations 


Two-Sample T-Test and CI: PHX, RuralAZ 


Two-sample T for PHX vs RuralAZ 


N Mean 
PHX 10 12.50 
RuralAZ 10 27.5 


Difference = mu PHX — mu RuralAZ 


Estimate for difference: —15.00 


95% CI for difference: (—26.71, —3.29) 
T-Test of difference = 0 (vs not = ): T-Value = —2.77 


StDev SE Mean 
7.63 2.4 
15.3 4.9 


P-Value = 0.016 DF = 13 
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The numerical results from Minitab exactly match the calculations from Example 10-6. Note 
that a two-sided 95% CI on w, — p» is also reported. We will discuss its computation in 
Section 10-2.3; however, note that the interval does not include zero. Indeed, the upper 95% 
of confidence limit is —3.29 ppb, well below zero, and the mean observed difference is 
X; — X) = 12.5 — 27.5 = —15 ppb. 


10-2.2 Type II Error and Choice of Sample Size 


The operating characteristic curves in Appendix Charts VIle, VIIf, VIlg, and VIIh are used to 
evaluate the type II error for the case where oj = 03 = o”. Unfortunately, when a7 ¥ 03, the 
distribution of 79 is unknown if the null hypothesis is false, and no operating characteristic 
curves are available for this case. 

For the two-sided alternative H,: @, — 2 = A # Ag, when of = 03 = o’ andn, =n, = 
n, Charts VIle and VIIf are used with 
_ |A-Adl 


@ 20 


(10-17) 
where A is the true difference in means that is of interest. To use these curves, they must be 
entered with the sample size n* = 2n — 1. For the one-sided alternative hypothesis, we use 
Charts VIIg and VII/ and define d and A as in Equation 10-17. It is noted that the parameter d 
is a function of o, which is unknown. As in the single-sample t-test, we may have to rely on a 
prior estimate of o or use a subjective estimate. Alternatively, we could define the differences 
in the mean that we wish to detect relative to o. 


EXAMPLE 10-7 Yield from Catalyst Sample Size 
Consider the catalyst experiment in Example 10-5. Suppose 
that, if catalyst 2 produces a mean yield that differs from the 
mean yield of catalyst 1 by 4.0%, we would like to reject 
the null hypothesis with probability at least 0.85. What 
sample size is required? 

Using s, = 2.70 as a rough estimate of the common stan- 
dard deviation o, we have d = | A|/2o = |4.0|/[(2)(2.70)] = 


0.74. From Appendix Chart VIle with d = 0.74 and B = 0.15, 
we find n° = 20, approximately. Therefore, since n’ = 2n — 1, 


"+1 2041 
i= = — = 10.5 = 11(say) 


and we would use sample sizes of n, =n. =n = 11. 


Minitab will also perform power and sample size calculations for the two-sample t-test (equal 
variances). The output from Example 10-7 is as follows: 


Minitab Computations 


Power and Sample Size 


2-Sample ¢ Test 


Testing mean 1 = mean 2 (versus not =) 


Calculating power for mean 1 = mean 2 + difference 


Alpha = 0.05 Sigma = 2.7 


Sample Target 
Difference Size Power 
4 10 0.8500 


Actual 
Power 
0.8793 


The results agree fairly closely with the results obtained from the O.C. curve. 
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10-2.3 Confidence Interval on the Difference in Means, 


Variances Unknown 


Case 1: 6} = 063 = 0” 


To develop the confidence interval for the difference in means 4, — 2 when both variances 
are equal, note that the distribution of the statistic 


(10-18) 


is the ¢ distribution with n, + n. — 2 degrees of freedom. Therefore P(—ty/2.n,+n,-2 = T = 
te /2,my-+ny—2) = | — a. Now substituting Equation 10-18 for T and manipulating the quan- 
tities inside the probability statement will lead to the 100(1 — «)% confidence interval on 


My ~ M2. 


Case 1: 
Confidence 
Interval on the 
Difference in 
Means, 
Variances 
Unknowns and 


ence in means jt; — [2 iS 


Equal = = 1 1 

1 — X2 ~ ba/2,my+m-2 5p \! ny ats iy 
< ay x ! u 
= py — Po =X — X2 + boy2,ny+n-2 Sp m + n> 


If X|,X>, st, and s3 are the sample means and variances of two random samples of 
sizes n, and n, respectively, from two independent normal populations with un- 
known but equal variances, then a 100(1 — a) % confidence interval on the differ- 


(10-19) 


P 


where s, = Vi(n, — 1)st + (nm) — 1)s5]/(n, + ny — 2) is the pooled estimate 
of the common population standard deviation, and f,/7,,,+n,—-2 18 the upper a/2 
percentage point of the ¢ distribution with n, + n. — 2 degrees of freedom. 


EXAMPLE 10-8 Cement Hydration 
An article in the journal Hazardous Waste and Hazardous 
Materials (Vol. 6, 1989) reported the results of an analysis of 
the weight of calcium in standard cement and cement doped 
with lead. Reduced levels of calcium would indicate that the 
hydration mechanism in the cement is blocked and would al- 
low water to attack various locations in the cement struc- 
ture. Ten samples of standard cement had an average weight 
percent calcium of x, = 90.0, with a sample standard devia- 
tion of s; = 5.0, while 15 samples of the lead-doped cement 
had an average weight percent calcium of x, = 87.0, with a 
sample standard deviation of s, = 4.0. 

We will assume that weight percent calcium is normally 
distributed and find a 95% confidence interval on the difference 


in means, 4; — 2, for the two types of cement. Furthermore, 
we will assume that both normal populations have the same 
standard deviation. 

The pooled estimate of the common standard deviation 
is found using Equation 10-12 as follows: 


(ny — 1)sj + (m2 — 1)83 
ny + ny — 2 


2 
Pp 


S 


9(5.0)? + 14(4.0) 
~ 10+ 15-2 
= 19.52 
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Therefore, the pooled standard deviation estimate is 
8) = V19.52 = 4.4. The 95% confidence interval is found 
using Equation 10-19: 


X, — X_ — 19,025,23 Sp ™m a m = py — pe 


1 1 
<%,-%+ — + — 
1 ~ 42 6 $0,025,235p\i nT ny 


or upon substituting the sample values and using fy 95.03 = 2.069, 


1 1 
90.0 — 87.0 — 2.069(4.4),|—5 + 7 St ~ He 


ta 
= 90.0 — 87.0 + 2.069(4.4),/75 + 75 


Case 2: 04 # 03 


In many situations it is not reasonable to assume that o7 = 


which reduces to 


Practical Interpretation: Notice that the 95% confidence 
interval includes zero; therefore, at this level of confidence we 
cannot conclude that there is a difference in the means. Put 
another way, there is no evidence that doping the cement with 
lead affected the mean weight percent of calcium; therefore, 
we cannot claim that the presence of lead affects this aspect of 
the hydration mechanism at the 95% level of confidence. 


o3. When this assumption is 


unwarranted, we may still find a 100(1 — «)% confidence interval on w, — 2 using the fact 


that T° = [xX x (hy 


i rere 


2) ]/ V S/n, + S3/np is distributed approximately as ¢ with 


degrees of freedom v given by Equation 10-16. The CI expression follows. 


Case 2: 
Approximate 
Confidence 
Interval on the 


Difference in : 
means ty — po 1S 


Means, 
Variances 5 5 5 
Unknown Are = —F al rat X,-Ht+ ee 10-20 
Say = Uh prem arlene cae = i — a t reer rlnierer E 
oi Avaaed 1 %2— bofawrl yn, > my — Ml Pet 2 * toyz, * n,  ‘ ) 
Equal 


If x}, X5, 5}, and s3 are the means and variances of two random samples of sizes n, and 
Ny, respectively, from two independent normal populations with unknown and unequal 
variances, an approximate 100(1 — a)% confidence interval on the difference in 


where v is given by Equation 10-16 and f,,,, is the upper a/2 percentage point of the 
t distribution with v degrees of freedom. 


EXERCISES FOR SECTION 10-2 


10-10. Consider the computer output below. 
Two-Sample T-Test and CI 


Sample N Mean StDev SE Mean 
1 12 10.94 1.26 0.36 
Z 16 12415 1.99 0.50 
Difference = mu (1) — mu (2) 
Estimate for difference: —1.210 
95% CI for difference: (—2.560, 0.140) 
T-Test of difference = 0 (vs not =) T-Value 


= ? P-Value = ? DF = ? 

Both use Pooled StDev = ? 

(a) Fill in the missing values. Is this a one-sided or a two-sided 
test? Use lower and upper bounds for the P-value. 

(b) What are your conclusions if a = 0.05? What if a = 0.01? 


(c) This test was done assuming that the two population vari- 
ances were equal. Does this seem reasonable? 

(d) Suppose that the hypothesis had been Ho: 1, = [2 versus 
Ho: jb) < po. What would your conclusions be if a = 0.05? 


10-11. 
Two-Sample T-Test and Cl 


Consider the computer output below. 


Sample N Mean StDev SE Mean 
1 15 54.73 2.13 0.255 
Z 20 58.64 5.28 1.2 
Difference = mu (1) — mu (2) 
Estimate for difference: —3.91 
95% upper bound for difference: ? 
T-Test of difference = O(vs <): T-Value = 


—3.00 P-Value = ? DF = ? 
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(a) Fill in the missing values. Is this a one-sided or a two-sided 
test? Use lower and upper bounds for the P-value. 

(b) What are your conclusions if a = 0.05? What if a = 0.01? 

(c) This test was done assuming that the two population vari- 
ances were different. Does this seem reasonable? 

(d) Suppose that the hypotheses had been Hy: w; = p versus 
Ho: 1, # p2. What would your conclusions be if a = 0.05? 


10-12. Consider the hypothesis test Hp: ; = po against 

HA, : 1, # po. Suppose that sample sizes are n, = 15 and 

n, = 15, that ¥, = 4.7 and x, = 7.8, and that st = 4 and 

s5 = 6.25. Assume that 07 = 0% and that the data are drawn 

from normal distributions. Use a = 0.05. 

(a) Test the hypothesis and find the P-value. 

(b) Explain how the test could be conducted with a confi- 
dence interval. 

(c) What is the power of the test in part (a) for a true differ- 
ence in means of 3? 

(d) Assuming equal sample sizes, what sample size should 
be used to obtain B = 0.05 if the true difference in means 
is —2? Assume that a = 0.05. 


10-13. Consider the hypothesis test Hp: wp; = po against 

HA, : py < po. Suppose that sample sizes n, = 15 and ny = 15, 

that x, = 6.2 and x, = 7.8, and that sj = 4 and s3 = 6.25. 

Assume that o7 = 0% and that the data are drawn from normal 

distributions. Use a = 0.05. 

(a) Test the hypothesis and find the P-value. 

(b) Explain how the test could be conducted with a confi- 
dence interval. 

(c) What is the power of the test in part (a) if j.; is 3 units less 
than >? 

(d) Assuming equal sample sizes, what sample size should be 
used to obtain B = 0.05 if p; is 2.5 units less than j1.? 
Assume that a = 0.05. 


10-14. Consider the hypothesis test Ho: w; = po against 

HA, : pp, > po. Suppose that sample sizes n, = 10 and nz = 10, 

that x, = 7.8 and x, =5.6, and that sj =4 and s3 = 9. 

Assume that 07 = o% and that the data are drawn from normal 

distributions. Use a = 0.05. 

(a) Test the hypothesis and find the P-value. 

(b) Explain how the test could be conducted with a confi- 
dence interval. 

(c) What is the power of the test in part (a) if jp, is 3 units 
greater than pty? 

(d) Assuming equal sample sizes, what sample size should be 
used to obtain 8 = 0.05 if w, is 3 units greater than py? 
Assume that a = 0.05. 


10-15. The diameter of steel rods manufactured on two dif- 
ferent extrusion machines is being investigated. Two random 
samples of sizes n, = 15 and ny = 17 are selected, and the 
sample means and sample variances are x, = 8.73, sj = 0.35, 
X> = 8.68, and s3 = 0.40, respectively. Assume that o7 = o% 
and that the data are drawn from a normal distribution. 
(a) Is there evidence to support the claim that the two ma- 
chines produce rods with different mean diameters? 
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Use a = 0.05 in arriving at this conclusion. Find the 
P-value. 

(b) Construct a 95% confidence interval for the difference in 
mean rod diameter. Interpret this interval. 


10-16. An article in Fire Technology investigated two dif- 
ferent foam expanding agents that can be used in the nozzles 
of fire-fighting spray equipment. A random sample of five ob- 
servations with an aqueous film-forming foam (AFFF) had a 
sample mean of 4.7 and a standard deviation of 0.6. A random 
sample of five observations with alcohol-type concentrates 
(ATC) had a sample mean of 6.9 and a standard deviation 0.8. 
(a) Can you draw any conclusions about differences in mean 
foam expansion? Assume that both populations are well 
represented by normal distributions with the same stan- 
dard deviations. 
(b) Find a 95% confidence interval on the difference in mean 
foam expansion of these two agents. 


10-17. Two catalysts may be used in a batch chemical 
process. Twelve batches were prepared using catalyst 1, resulting 
in an average yield of 86 and a sample standard deviation of 3. 
Fifteen batches were prepared using catalyst 2, and they 
resulted in an average yield of 89 with a standard deviation 
of 2. Assume that yield measurements are approximately 
normally distributed with the same standard deviation. 
(a) Is there evidence to support a claim that catalyst 2 pro- 
duces a higher mean yield than catalyst 1? Use a = 0.01. 
(b) Find a 99% confidence interval on the difference in mean 
yields that can be used to test the claim in part (a). 


10-18. The deflection temperature under load for two dif- 
ferent types of plastic pipe is being investigated. Two random 
samples of 15 pipe specimens are tested, and the deflection 
temperatures observed are as follows (in °F): 


Type 1: 206, 188, 205, 187, 194, 193, 207, 185, 189, 213, 
192, 210, 194, 178, 205 


Type 2: 177, 197, 206, 201, 180, 176, 185, 200, 197, 192, 
198, 188, 189, 203, 192 


(a) Construct box plots and normal probability plots for the 
two samples. Do these plots provide support of the assump- 
tions of normality and equal variances? Write a practical 
interpretation for these plots. 

(b) Do the data support the claim that the deflection tempera- 
ture under load for type 1 pipe exceeds that of type 2? In 
reaching your conclusions, use a = 0.05. Calculate a 
P-value. 

(c) If the mean deflection temperature for type 1 pipe exceeds 
that of type 2 by as much as 5°E it is important to detect this 
difference with probability at least 0.90. Is the choice of 
Ny, = Ny = 15 adequate? Use a = 0.05. 


10-19. In semiconductor manufacturing, wet chemical etch- 
ing is often used to remove silicon from the backs of wafers 
prior to metallization. The etch rate is an important characteris- 
tic in this process and known to follow a normal distribution. 
Two different etching solutions have been compared, using two 
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random samples of 10 wafers for each solution. The observed 
etch rates are as follows (in mils per minute): 


Solution 1 Solution 2 
99 10.6 10.2 10.0 
9.4 10.3 10.6 10.2 
9.3 10.0 10.7 10.7 
9.6 10.3 10.4 10.4 

10.2 10.1 10.5 10.3 


(a) Construct normal probability plots for the two samples. 
Do these plots provide support for the assumptions of nor- 
mality and equal variances? Write a practical interpreta- 
tion for these plots. 

(b) Do the data support the claim that the mean etch rate is the 
same for both solutions? In reaching your conclusions, use 
a = 0.05 and assume that both population variances are 
equal. Calculate a P-value. 

(c) Find a 95% confidence interval on the difference in mean 
etch rates. 


10-20. Two suppliers manufacture a plastic gear used in a 
laser printer. The impact strength of these gears measured in 
foot-pounds is an important characteristic. A random sample 
of 10 gears from supplier 1 results in x, = 290 and s, = 12, 
while another random sample of 16 gears from the second 
supplier results in x, = 321 and s, = 22. 
(a) Is there evidence to support the claim that supplier 2 
provides gears with higher mean impact strength? Use 
a = 0.05, and assume that both populations are normally 
distributed but the variances are not equal. What is the 
P-value for this test? 
(b) Do the data support the claim that the mean impact 
strength of gears from supplier 2 is at least 25 foot-pounds 
higher than that of supplier 1? Make the same assump- 
tions as in part (a). 
Construct a confidence interval estimate for the difference 
in mean impact strength, and explain how this interval could 
be used to answer the question posed regarding supplier- 
to-supplier differences. 


10-21. The melting points of two alloys used in formulating 
solder were investigated by melting 21 samples of each mate- 
rial. The sample mean and standard deviation for alloy 1 was 
X, = 420°F and s, = 4°F, while for alloy 2 they were 
X_ = 426°F and 5s, = 3°F. 

(a) Do the sample data support the claim that both alloys have 
the same melting point? Use a = 0.05 and assume that 
both populations are normally distributed and have the 
same standard deviation. Find the P-value for the test. 

(b) Suppose that the true mean difference in melting points 
is 3°F. How large a sample would be required to detect this 
difference using an a = 0.05 level test with probability at 
least 0.9? Use o, = 0, = 4 as an initial estimate of the 
common standard deviation. 


(c 


YS 


10-22. A photoconductor film is manufactured at a nominal 
thickness of 25 mils. The product engineer wishes to increase 
the mean speed of the film, and believes that this can be 
achieved by reducing the thickness of the film to 20 mils. 
Eight samples of each film thickness are manufactured in a pi- 
lot production process, and the film speed (in microjoules per 
square inch) is measured. For the 25-mil film the sample data 
result is x, = 1.15 and s; = 0.11, while for the 20-mil film 
the data yield x, = 1.06 and s = 0.09. Note that an increase 
in film speed would lower the value of the observation in 
microjoules per square inch. 

(a) Do the data support the claim that reducing the film thick- 
ness increases the mean speed of the film? Use a = 0.10 
and assume that the two population variances are equal 
and the underlying population of film speed is normally 
distributed. What is the P-value for this test? 

(b) Find a 95% confidence interval on the difference in the 
two means that can be used to test the claim in part (a). 


10-23. Two companies manufacture a rubber material in- 
tended for use in an automotive application. The part will be 
subjected to abrasive wear in the field application, so we decide 
to compare the material produced by each company in a test. 
Twenty-five samples of material from each company are tested 
in an abrasion test, and the amount of wear after 1000 cycles is 
observed. For company 1, the sample mean and standard de- 
viation of wear are x; = 20 milligrams/1000 cycles and 
s,; = 2 milligrams/1000 cycles, while for company 2 we obtain 
X, = 15 milligrams/1000 cycles and s, = 8 milligrams/1000 
cycles. 

(a) Do the data support the claim that the two companies pro- 
duce material with different mean wear? Use a = 0.05, 
and assume each population is normally distributed but 
that their variances are not equal. What is the P-value for 
this test? 

(b) Do the data support a claim that the material from com- 
pany | has higher mean wear than the material from com- 
pany 2? Use the same assumptions as in part (a). 

(c) Construct confidence intervals that will address the ques- 
tions in parts (a) and (b) above. 


10-24. The thickness of a plastic film (in mils) on a sub- 
strate material is thought to be influenced by the temperature 
at which the coating is applied. A completely randomized 
experiment is carried out. Eleven substrates are coated at 
125°F, resulting in a sample mean coating thickness of 
x, = 103.5 and a sample standard deviation of s; = 10.2. 
Another 13 substrates are coated at 150°F, for which x, = 99.7 
and sy = 20.1 are observed. It was originally suspected that 
raising the process temperature would reduce mean coating 
thickness. 

(a) Do the data support this claim? Use a = 0.01 and assume 
that the two population standard deviations are not equal. 
Calculate an approximate P-value for this test. 

(b) How could you have answered the question posed regard- 
ing the effect of temperature on coating thickness by using 
a confidence interval? Explain your answer. 
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10-25. Anarticle in Electronic Components and Technology 
Conference (2001, Vol. 52, pp. 1167-1171) compared single 
versus dual spindle saw processes for copper metallized 
wafers. A total of 15 devices of each type were measured for the 
width of the backside chipouts, X.ingje = 66.385, Ssingie = 7.895 
and Xyouble = 45.278, Sgouble = 8.612. 

(a) Do the sample data support the claim that both processes 
have the same chip outputs? Use a = 0.05 and assume 
that both populations are normally distributed and have 
the same variance. Find the P-value for the test. 

(b) Construct a 95% two-sided confidence interval on the 
mean difference in spindle saw process. Compare this 
interval to the results in part (a). 

(c) If the B-error of the test when the true difference in chip 
outputs is 15 should not exceed 0.1, what sample sizes 
must be used? Use a = 0.05. 


10-26. An article in JEEE International Symposium on 
Electromagnetic Compatibility (2002, Vol. 2, pp. 667-670) 
quantified the absorption of electromagnetic energy and the 
resulting thermal effect from cellular phones. The experi- 
mental results were obtained from in vivo experiments con- 
ducted on rats. The arterial blood pressure values (mmHg) 
for the control group (8 rats) during the experiment are 
xX, = 90, s; = 5 and for the test group (9 rats) are x. = 115, 
Sy = 10. 

(a) Is there evidence to support the claim that the test group 
has higher mean blood pressure? Use a = 0.05, and as- 
sume that both populations are normally distributed but 
the variances are not equal. What is the P-value for this 
test? 

(b) Calculate a confidence interval to answer the question in 
part (a). 

(c) Do the data support the claim that the mean blood pressure 
from the test group is at least 15 mmHg higher than the 
control group? Make the same assumptions as in part (a). 

(d) Explain how the question in part (c) could be answered 
with a confidence interval. 

10-27. An article in Radio Engineering and Electronic 

Physics (1984, Vol. 29, No. 3, pp. 63-66) investigated the 

behavior of a stochastic generator in the presence of external 

noise. The number of periods was measured in a sample of 100 

trains for each of two different levels of noise voltage, 100 and 

150 mV. For 100 mV, the mean number of periods in a train 

was 7.9 with s = 2.6. For 150 mV, the mean was 6.9 with 

s = 2.4, 

(a) It was originally suspected that raising noise voltage 
would reduce mean number of periods. Do the data sup- 
port this claim? Use « = 0.01 and assume that each pop- 
ulation is normally distributed and the two population 
variances are equal. What is the P-value for this test? 

(b) Calculate a confidence interval to answer the question 
in part (a). 

10-28. An article in Technometrics (1999, Vol. 41, pp. 

202-211) studied the capability of a gauge by measuring 


the weights of two sheets of paper. The data are shown 
below. 


Paper Observations 
1 3.481 3.448 3.485 3.475 3.472 
3.477 3.472 3.464 3.472 3.470 
3.470 3.470 3.477 3.473 3.474 
2 3.258 3.254 3.256 3.249 3.241 
3.254 3.247 3207 3.239 3.250 
3.258 3.239 3.245 3.240 3.254 


(a) Check the assumption that the data from each sheet are 
from normal distributions. 

(b) Test the hypothesis that the mean weight of the two sheets 
are equal against the alternative that they are not (and 
assume equal variances). Use a = 0.05 and assume equal 
variances. Find the P-value. 

(c) Repeat the previous test with a = 0.10. 

(d) Compare your answers for parts (b) and (c) and explain 
why they are the same or different. 

(e) Explain how the questions in parts (b) and (c) could be an- 
swered with confidence intervals. 


10-29. The overall distance traveled by a golf ball is tested 
by hitting the ball with Iron Byron, a mechanical golfer with a 
swing that is said to emulate the legendary champion, Byron 
Nelson. Ten randomly selected balls of two different brands 
are tested and the overall distance measured. The data follow: 


Brand 1: 275, 286, 287, 271, 283, 271, 279, 275, 263, 267 
Brand 2: 258, 244, 260, 265, 273, 281, 271, 270, 263, 268 


(a) Is there evidence that overall distance is approximately 
normally distributed? Is an assumption of equal variances 
justified? 

(b) Test the hypothesis that both brands of ball have equal 
mean overall distance. Use a = 0.05. What is the P-value? 

(c) Construct a 95% two-sided CI on the mean difference in 
overall distance between the two brands of golf balls. 

(d) What is the power of the statistical test in part (b) to detect 
a true difference in mean overall distance of 5 yards? 

(e) What sample size would be required to detect a true dif- 
ference in mean overall distance of 3 yards with power of 
approximately 0.75? 

10-30. The “spring-like effect’ in a golf club could be de- 
termined by measuring the coefficient of restitution (the ratio 
of the outbound velocity to the inbound velocity of a golf ball 
fired at the clubhead). Twelve randomly selected drivers pro- 
duced by two clubmakers are tested and the coefficient of 
restitution measured. The data follow: 


Club 1: 0.8406, 0.8104, 0.8234, 0.8198, 0.8235, 0.8562, 
0.8123, 0.7976, 0.8184, 0.8265, 0.7773, 0.7871 
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Club 2: 0.8305, 0.7905, 0.8352, 0.8380, 0.8145, 0.8465, (c) Construct a 95% two-sided CI on the mean difference in 


0.8244, 0.8014, 0.8309, 0.8405, 0.8256, 0.8476 coefficient of restitution between the two brands of golf 
(a) Is there evidence that coefficient of restitution is approxi- clubs. 
mately normally distributed? Is an assumption of equal (d) What is the power of the statistical test in part (b) to detect 
variances justified? a true difference in mean coefficient of restitution of 0.2? 
(b) Test the hypothesis that both brands of clubs have equal (e) What sample size would be required to detect a true dif- 
mean coefficient of restitution. Use « = 0.05. What is the ference in mean coefficient of restitution of 0.1 with 
P-value of the test? power of approximately 0.8? 


10-3 A NONPARAMETRIC TEST FOR THE DIFFERENCE 
IN TWO MEANS 


Suppose that we have two independent continuous populations X, and X, with means 2, and 
|2, but we are unwilling to assume that they are (approximately) normal. However, we can as- 
sume that the distributions of X, and X, are continuous and have the same shape and spread, 
and differ only (possibly) in their locations. The Wilcoxon rank-sum test can be used to test 
the hypothesis Hp: &, = M. This procedure is sometimes called the Mann-Whitney test, al- 
though the Mann-Whitney test statistic is usually expressed in a different form. 


10-3.1 Description of the Wilcoxon Rank-Sum Test 


Let X1,, Xjo,- - », Xin, and X31, Xo, . . ., Xo, be two independent random samples of sizes n; = 
n, from the continuous populations X, and X, described earlier. We wish to test the hypotheses 


Ao: by = po 
Ay: wy F po 


The test procedure is as follows. Arrange all n, + n, observations in ascending order of 
magnitude and assign ranks to them. If two or more observations are tied (identical), use the 
mean of the ranks that would have been assigned if the observations differed. 

Let W, be the sum of the ranks in the smaller sample (1), and define W, to be the sum of 
the ranks in the other sample. Then, 


ny +nm)(ny +n 4+ 1 
wy itmin tm) on 


Now if the sample means do not differ, we will expect the sum of the ranks to be nearly equal 
for both samples after adjusting for the difference in sample size. Consequently, if the sums of 
the ranks differ greatly, we will conclude that the means are not equal. 

Appendix Table X contains the critical value of the rank sums for a = 0.05 and a = 0.01 
assuming the two-sided alternative above. Refer to Appendix Table X with the appropriate 
sample sizes n, and ny, and the critical value w, can be obtained. The null Ho: w,; = po is 
rejected in favor of H;: w, ~ p if either of the observed values w, or wy is less than or equal 
to the tabulated critical value wy. 

The procedure can also be used for one-sided alternatives. If the alternative is H): w, < 
[Ly, reject Hy if w, = w,; for Hy: wy > po, reject Hy if w, <= w,. For these one-sided tests, the 
tabulated critical values w,, correspond to levels of significance of a = 0.025 and a = 0.005. 
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EXAMPLE 10-9 Axial Stress 

The mean axial stress in tensile members used in an aircraft 
structure is being studied. Two alloys are being investigated. 
Alloy 1 is a traditional material, and alloy 2 is anew aluminum- 
lithium alloy that is much lighter than the standard material. 
Ten specimens of each alloy type are tested, and the axial stress 
is measured. The sample data are assembled in Table 10-2. 
Using a = 0.05, we wish to test the hypothesis that the means 
of the two stress distributions are identical. 


Table 10-2 Axial Stress for Two Aluminum-Lithium Alloys 


Alloy 1 Alloy 2 
3238 psi 3254 psi 3261 psi 3248 psi 
3195 3229 3187 3215 
3246 3225 3209 3226 
3190 3217 3212 3240 
3204 3241 3258 3234 


We will apply the seven-step hypothesis-testing proce- 
dure to this problem: 


1. Parameter of interest: The parameters of interest are the 
means of the two distributions of axial stress. 
Null hypothesis: Hy: 1; = 2 
Alternative hypothesis: 1: 1, ~ 2 


Test statistic: We will use the Wilcoxon rank-sum test 
statistic in Equation 10-21. 


(ny + ny)(ny oe ng + 1) 
W2.— Wi 


2 


5. Reject H, if: Since a = 0.05 and n, = n, = 10, Appendix 
Table X gives the critical value as wo95 = 78. If either w, 
Or W is less than or equal to wo95 = 78, we will reject Hp: 
Bei = Be2- 

6. Computations: The data from Table 10-2 are analyzed in 
ascending order and ranked as follows: 


10-3.2 Large-Sample Approximation 


Alloy Number Axial Stress Rank 
2 3187 psi 1 
1 3190 2 
1 3195 3} 
1 3204 4 
2 3209 5 
2 3212 6 
2 3215 7 
1 3217 8 
1 3225 9 
2 3226 10 
1 3229 11 
2 3234 12 
1 3238 13 
2 3240 14 
1 3241 15 
1 3246 16 
2 3248 17 
1 3254 18 
2 3258 19 
2 3261 20 


The sum of the ranks for alloy 1 is 


w,=24+34+44+84+94 11413415 + 16+ 18 = 99 
and for alloy 2 


(n i ny)(ny a ng - 1) 


W2 = 2 Wi 
10 + 10)(10 + 10 + 1 
zs - 99 = 111 


7. Conclusions: Since neither w, nor w, is less than or equal 
to Woos = 78, we cannot reject the null hypothesis that 
both alloys exhibit the same mean axial stress. 


Practical Interpretation: The data do not demonstrate that 
there is a superior alloy for this particular application. 


When both n, and n, are moderately large, say, greater than 8, the distribution of w, can be 
well approximated by the normal distribution with mean 


and variance 


Bw, = 


n(n, + nz + 1) 
2 


nyn,(n, + ny + 1) 


oo 12 
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Therefore, for n, and n, > 8, we could use 


Normal 
Approximation 
for Wilcoxon 
Rank-Sum 
Test Statistic 


= (10-22) 


as a Statistic, and the appropriate critical region is | Z| > Z/2, Z9 > Za» OF Z9 < —Zas 
depending on whether the test is a two-tailed, upper-tail, or lower-tail test. 


10-3.3 Comparison to the t-Test 


In Chapter 9 we discussed the comparison of the t-test with the Wilcoxon signed-rank test. The 
results for the two-sample problem are similar to the one-sample case. That is, when the nor- 
mality assumption is correct, the Wilcoxon rank-sum test is approximately 95% as efficient as 
the t-test in large samples. On the other hand, regardless of the form of the distributions, the 
Wilcoxon rank-sum test will always be at least 86% as efficient. The efficiency of the 
Wilcoxon test relative to the t-test is usually high if the underlying distribution has heavier tails 
than the normal, because the behavior of the f-test is very dependent on the sample mean, 
which is quite unstable in heavy-tailed distributions. 


EXERCISES FOR SECTION 10-3 


10-31. Anelectrical engineer must design a circuit to deliver 
the maximum amount of current to a display tube to achieve suf- 
ficient image brightness. Within her allowable design constraints, 
she has developed two candidate circuits and tests prototypes of 
each. The resulting data (in microamperes) are as follows: 


Circuit 1: | 251, 255, 258, 257, 250, 251, 254, 250, 248 
Circuit 2: | 250, 253, 249, 256, 259, 252, 260, 251 


(a) Use the Wilcoxon rank-sum test to test Hp: 4) = wz against 
the alternative H: 1; > po. Use a = 0.025. 

(b) Use the normal approximation for the Wilcoxon rank-sum 
test. Assume that a = 0.05. Find the approximate P-value 
for this test statistic. 


10-32. One of the authors travels regularly to Seattle, 
Washington. He uses either Delta or Alaska. Flight delays are 
sometimes unavoidable, but he would be willing to give most 
of his business to the airline with the best on-time arrival 
record. The number of minutes that his flight arrived late for 
the last six trips on each airline follows. Is there evidence that 
either airline has superior on-time arrival performance? Use 
a = 0.01 and the Wilcoxon rank-sum test. 


Delta: |13, 10,1, —4, 0,9 (minutes late) 


Alaska: |15, 8,3, —1, —2, 4 (minutes late) 
10-33. The manufacturer of a hot tub is interested in testing 
two different heating elements for his product. The element that 
produces the maximum heat gain after 15 minutes would be 
preferable. He obtains 10 samples of each heating unit and tests 
each one. The heat gain after 15 minutes (in °F) follows. 


Unit 1: | 25, 27, 29, 31, 30, 26, 24, 32, 33, 38 


Unit 2:1 31, 33, 32, 35, 34, 29, 38, 35, 37, 30 


(a) Is there any reason to suspect that one unit is superior to 
the other? Use a = 0.05 and the Wilcoxon rank-sum test. 


(b) Use the normal approximation for the Wilcoxon rank-sum 
test. Assume that a = 0.05. What is the approximate 
P-value for this test statistic? 


10-34. Consider the chemical etch rate data in Exercise 10-19. 

(a) Use the Wilcoxon rank-sum test to investigate the claim 
that the mean etch rate is the same for both solutions. If 
a = 0.05, what are your conclusions? 

(b) Use the normal approximation for the Wilcoxon rank-sum 
test. Assume that a = 0.05. Find the approximate P-value 
for this test. 


10-35. Consider the pipe deflection data in Exercise 10-18. 

(a) Use the Wilcoxon rank-sum test for the pipe deflection 
temperature experiment. If « = 0.05, what are your con- 
clusions? 

(b) Use the normal approximation for the Wilcoxon rank-sum 
test. Assume that a = 0.05. Find the approximate P-value 
for this test. 


10-36. Consider the distance traveled by a golf ball in 

Exercise 10-29. 

(a) Use the Wilcoxon rank-sum test to investigate if the 
means differ. Use a = 0.05. 

(b) Use the normal approximation for the Wilcoxon rank-sum 
test with a = 0.05. Find the approximate P-value for this test. 
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10-4 PAIRED t-TEST 


Paired t-Test 


A special case of the two-sample f¢-tests of Section 10-2 occurs when the observations 
on the two populations of interest are collected in pairs. Each pair of observations, say 
(Xj;, X2;), is taken under homogeneous conditions, but these conditions may change from one 
pair to another. For example, suppose that we are interested in comparing two different types 
of tips for a hardness-testing machine. This machine presses the tip into a metal specimen with 
a known force. By measuring the depth of the depression caused by the tip, the hardness of the 
specimen can be determined. If several specimens were selected at random, half tested with tip 
1, half tested with tip 2, and the pooled or independent f-test in Section 10-2 was applied, the 
results of the test could be erroneous. The metal specimens could have been cut from bar stock 
that was produced in different heats, or they might not be homogeneous in some other way that 
might affect hardness. Then the observed difference between mean hardness readings for the 
two tip types also includes hardness differences between specimens. 

A more powerful experimental procedure is to collect the data in pairs—that is, to make 
two hardness readings on each specimen, one with each tip. The test procedure would then 
consist of analyzing the differences between hardness readings on each specimen. If there is 
no difference between tips, the mean of the differences should be zero. This test procedure is 
called the paired f-test. 

Let (X11, X51), (X19, X99), --- 5 (X yy, Xo,) be a set of n paired observations where we assume 
that the mean and variance of the population represented by X, are , and of, and the mean 
and variance of the population represented by X, are 1, and o3. Define the differences be- 
tween each pair of observations as D; = X\; — X,;, j = 1,2,...,. The D;’s are assumed to be 
normally distributed with mean 


Lp = E(X, — Xp) = E(X) — E(X)) = wi — be 


and variance oj, so testing hypotheses about the difference between jt, and wp, can be 
accomplished by performing a one-sample f-test on wp. Specifically, testing Hp: wy, — 2 = Ao 
against H,: w, — 2 # Ao is equivalent to testing 

Ho: up = Ao 

Ay: Mp # Ao (10-23) 


The test statistic and decision procedure are given below. 


Null hypothesis: Ho: wp = Ag 
Test statistic: 7, De (10-24) 
est statistic: = —] @ 
ESS Ae 
Rejection Region 
Alternative Hypothesis P-Value for Fixed-Level Tests 
Hy: wp # Ao Probability above |f| and = f0 > foy2n—-1 OF fo < ~bo2,n-1 
probability below —|fo| 
Ay: wp > Ao Probability above fy fy 2 Corjacil 
Ay: wp < Ao Probability below fy iS hapai 
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In Equation 10-24, D is the sample average of the n differences D,, D>, ..., D,, and Sp is the 
sample standard deviation of these differences. 


EXAMPLE 10-10 Shear Strength of Steel Girders 
An article in the Journal of Strain Analysis (Vol. 18, No. 2 1983) 
reports a comparison of several methods for predicting the 
shear strength for steel plate girders. Data for two of these 
methods, the Karlsruhe and Lehigh procedures, when applied 
to nine specific girders, are shown in Table 10-3. We wish to 
determine whether there is any difference (on the average) 
between the two methods. 
The seven-step procedure is applied as follows: 


1. Parameter of Interest: The parameter of interest is the 
difference in mean shear strength between the two 
methods—say, tp = wy — 2 = 0. 

2. Null hypothesis: Ho: wp = 0 


Alternative hypothesis: H,: pp # 0 


4. Test statistic: The test statistic is 


Reject H, if: Reject Hy if the P-value is <0.05. 


Computations: The sample average and standard devia- 
tion of the differences d;are d = 0.2769 and s, = 0.1350, 
and so the test statistic is 


ae d __0,2769 
o sylVn 0.1350/V9 


= 6.15 


Conclusions: Because fo 9995.3 = 5.041 and the value of 
the test statistic fy = 6.15 exceeds this value, the P-value is 
less than 2(0.0005) = 0.001. Therefore, we conclude that 
the strength prediction methods yield different results. 


Practical Interpretation: Specifically, the data indicate 


ee d 
sal Vn 


that the Karlsruhe method produces, on the average, higher 
strength predictions than does the Lehigh method. This is a 
strong conclusion. 


Minitab can perform the paired ¢-test. The Minitab output for Example 10-10 is shown 


below: 


Paired T for Karlsruhe—Lehigh 


N Mean 
Karlsruhe 9 1.34011 
Lehigh 9 1.06322 
Difference 9 0.276889 


StDev SE Mean 
0.14603 0.04868 
0.05041 0.01680 
0.135027 0.045009 


95% CI for mean difference: (0.173098, 0.380680) 
T-Test of mean difference = 0 (vs not = 0): T-Value = 6.15, P-Value = 0.000 


Table 10-3 Strength Predictions for Nine Steel Plate Girders 
(Predicted Load/Observed Load) 


Girder Karlsruhe Method Lehigh Method Difference d; 


S1/1 1.186 
S2/1 1.151 
3/1 1.322 
S4/1 1.339 
5/1 1.200 
S2/1 1.402 
2/2 1.365 
S2/3 1.537 


S2/4 1.559 


1.061 0.125 
0.992 0.159 
1.063 0.259 
1.062 0.277 
1.065 0.135 
1.178 0.224 
1.037 0.328 
1.086 0.451 
1.052 0.507 
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The results essentially agree with the manual calculations. In addition to the hypothesis test re- 
sults. Minitab reports a two-sided CI on the difference in means. This Cl was found by con- 
structing a single-sample CI on tp. We will give the details later. 


Paired Versus Unpaired Comparisons 

In performing a comparative experiment, the investigator can sometimes choose between the 
paired experiment and the two-sample (or unpaired) experiment. If n measurements are to be 
made on each population, the two-sample f-statistic is 


X,-X,-A 
T = 1 . ~ 
S), a a 


which would be compared to f,,,_,, and of course, the paired t-statistic is 


7, D-bs 
° Sp/Vn 


which is compared to ¢,,_;. Notice that since 


_ n —. n (X1; — Xy;) n X,. n X,. _ 7 
J Vj 2j Vj 2j 
D= aK n > n = n > nA X2 


the numerators of both statistics are identical. However, the denominator of the two-sample 
t-test is based on the assumption that X, and X, are independent. In many paired experiments, 
a strong positive correlation p exists between X, and X,. Then it can be shown that 


V(D) = V(X, — Xz — Ao) 
= V(X,) + V(Xy) — 2 cov(X, Xo) 
= 207(1 — p) 


assuming that both populations X, and X, have identical variances o?. Furthermore, S3/n 
estimates the variance of D. Whenever there is positive correlation within the pairs, the de- 
nominator for the paired t-test will be smaller than the denominator of the two-sample t-test. 
This can cause the two-sample f-test to considerably understate the significance of the data if 
it is incorrectly applied to paired samples. 

Although pairing will often lead to a smaller value of the variance of X, — X), it does 
have a disadvantage—namely, the paired t-test leads to a loss of n — 1 degrees of freedom in 
comparison to the two-sample t-test. Generally, we know that increasing the degrees of free- 
dom of a test increases the power against any fixed alternative values of the parameter. 

So how do we decide to conduct the experiment? Should we pair the observations or not? 
Although there is no general answer to this question, we can give some guidelines based on 
the above discussion. 


1. If the experimental units are relatively homogeneous (small o) and the correlation 
within pairs is small, the gain in precision attributable to pairing will be offset by the 
loss of degrees of freedom, so an independent-sample experiment should be used. 
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2. If the experimental units are relatively heterogeneous (large o) and there is large pos- 
itive correlation within pairs, the paired experiment should be used. Typically, this 
case occurs when the experimental units are the same for both treatments; as in 
Example 10-10, the same girders were used to test the two methods. 


Implementing the rules still requires judgment, because o and p are never known precisely. 
Furthermore, if the number of degrees of freedom is large (say, 40 or 50), the loss of n — 1 of 
them for pairing may not be serious. However, if the number of degrees of freedom is small 
(say, 10 or 20), losing half of them is potentially serious if not compensated for by increased 


precision from pairing. 


Confidence Interval for tp 


To construct the confidence interval for wp = by; — p2, note that 


T= D> bp 
Sp/Vn 
follows a ¢ distribution with n — 1 degrees of freedom. Then, since P(—fy/2.,-) = T = 


by /2,n-D) =1 


— a, we can substitute for T in the above expression and perform the necessary 


steps to isolate 4p = ; — Wz between the inequalities. This leads to the following 100(1 — «)% 


confidence interval on py — po. 


Confidence 
Interval for pp 
from Paired 
Samples 


freedom. 


If d and sp are the sample mean and standard deviation of the difference of n random 
pairs of normally distributed measurements, a 100(1 — «)% confidence interval on 
the difference in means ftp = Py — 2 IS 


d — tej.n—1Sp/ Vn = Mp Sao te2.n-1Sp/ Vn 


where f,/7,,—; is the upper «/2% point of the ¢ distribution with n — 1 degrees of 


(10-25) 


This confidence interval is also valid for the case where oj # 03, because sj estimates 
op = V(X, — X,). Also, for large samples (say, n = 30 pairs), the explicit assumption of nor- 
mality is unnecessary because of the central limit theorem. 


EXAMPLE 10-11 Parallel Park Cars 

The journal Human Factors (1962, pp. 375-380) reported a 
study in which n = 14 subjects were asked to parallel park 
two cars having very different wheel bases and turning radii. 
The time in seconds for each subject was recorded and is 
given in Table 10-4. From the column of observed differences 
we calculate d = 1.21 and sp = 12.68. The 90% confidence 
interval for tp = pb; — p2 is found from Equation 10-25 as 
follows: 


d — 1095,138p/Vn = wp d + t95,135p/Vn 
1.21 — 1.771(12.68)/V14 < pp <1.21 + 1.771(12.68)/V14 
4.79 <pp<7.21 


Notice that the confidence interval on pLp includes zero. This im- 
plies that, at the 90% level of confidence, the data do not support 
the claim that the two cars have different mean parking times 1, 
and 1. That is, the value yp = ww, — 2 = 0 is not inconsistent 
with the observed data. 
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Table 10-4 Time in Seconds to Parallel Park Two 


Automobiles 
Automobile Difference 
Subject 1(x,;) 2(xy;) (d)) 
1 37.0 17.8 19.2 
2 25.8 20.2 5.6 
3 16.2 16.8 —0.6 
4 24.2 41.4 —17.2 
5 22.0 21.4 0.6 
6 33.4 38.4 —5.0 
7 23.8 16.8 7.0 
8 58.2 32.2 26.0 
9 33.6 27.8 5.8 
10 24.4 23.2 1.2 
11 23.4 29.6 —6.2 
12 21.2 20.6 0.6 
13 36.2 322) 4.0 
14 29.8 53.8 —24.0 


Nonparametric Approach to Paired Comparisons 

Both the sign test and the Wilcoxon signed-rank test discussed in Section 9-9 can be applied 
to paired observations. In the case of the sign test, the null hypothesis is that the median of the 
differences is equal to zero (that is, Hp: fip = 0). The Wilcoxon signed-rank test is for the null 
hypothesis that the mean of the differences is equal to zero. The procedures are applied to the 
observed differences as described in Sections 9-9.1 and 9-9.2. 
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10-37. Consider the shear strength experiment described in 

Example 10-10. 

(a) Construct a 95% confidence interval on the difference in 
mean shear strength for the two methods. Is the result you 
obtained consistent with the findings in Example 10-10? 
Explain why. 

(b) Do each of the individual shear strengths have to be 
normally distributed for the paired f-test to be appropriate, 
or is it only the difference in shear strengths that must 
be normal? Use a normal probability plot to investigate 
the normality assumption. 


10-38. Consider the parking data in Example 10-11. 

(a) Use the paired t-test to investigate the claim that the two 
types of cars have different levels of difficulty to parallel 
park. Use a = 0.10. 

(b) Compare your results with the confidence interval con- 
structed in Example 10-11 and comment on why they are 
the same or different. 

(c) Investigate the assumption that the differences in parking 
times are normally distributed. 


10-39. The manager of a fleet of automobiles is testing 
two brands of radial tires. He assigns one tire of each brand 


at random to the two rear wheels of eight cars and runs the 
cars until the tires wear out. The data (in kilometers) follow. 
Find a 99% confidence interval on the difference in mean life. 
Which brand would you prefer, based on this calculation? 


Car Brand 1 Brand 2 
1 36,925 34,318 
2 45,300 42,280 
3 36,240 35,500 
4 32,100 31,950 
5 37,210 38,015 
6 48,360 47,800 
i 38,200 37,810 
8 33,500 33,215 


10-40. A computer scientist is investigating the usefulness 
of two different design languages in improving programm- 
ing tasks. Twelve expert programmers, familiar with both 
languages, are asked to code a standard function in both lan- 
guages, and the time (in minutes) is recorded. The data follow: 


Time 
Design Design 
Language Language 
Programmer 1 2 
1 17 18 
2 16 14 
3 21 19 
4 14 11 
5 18 23 
6 24 21 
7 16 10 
8 14 13 
9 21 19 
10 23 24 
11 13 15 
12 18 20 


(a) Is the assumption that the difference in coding time is nor- 
mally distributed reasonable? 

(b) Find a 95% confidence interval on the difference in mean 
coding times. Is there any indication that one design lan- 
guage is preferable? 


10-41. Fifteen adult males between the ages of 35 and 50 
participated in a study to evaluate the effect of diet and exercise 
on blood cholesterol levels. The total cholesterol was meas- 
ured in each subject initially and then three months after 
participating in an aerobic exercise program and switching 
to a low-fat diet. The data are shown in the accompanying 
table 


Blood Cholesterol Level 


Subject Before After 
1 265 229 
2 240 231 
3 258 DOF 
4 295 240 
5 251 238 
6 245 241 
Tf 287 234 
8 314 256 
9 260 247 
10 279 239 
11 283 246 
12 240 218 
13 238 219 
14 225 226 
15 247 253 
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(a) Do the data support the claim that low-fat diet and aerobic 
exercise are of value in producing a mean reduction in 
blood cholesterol levels? Use a = 0.05. Find the P-value. 

(b) Calculate a one-sided confidence limit that can be used 
to answer the question in part (a). 


10-42. An article in the Journal of Aircraft (Vol. 23, 1986, 
pp. 859-864) described a new equivalent plate analysis 
method formulation that is capable of modeling aircraft struc- 
tures such as cranked wing boxes, and that produces results 
similar to the more computationally intensive finite element 
analysis method. Natural vibration frequencies for the cranked 
wing box structure are calculated using both methods, and 
results for the first seven natural frequencies follow: 


Finite Equivalent 
Element Plate, 
Freq. Cycle/s Cycle/s 
1 14.58 14.76 
2 48.52 49.10 
3 97.22 99.99 
4 113.99 117.53 
5 174.73 181.22 
6 212.72 220.14 
7 277.38 294.80 


(a) Do the data suggest that the two methods provide the same 
mean value for natural vibration frequency? Use a = 0.05. 
Find the P-value. 

(b) Find a 95% confidence interval on the mean difference 
between the two methods. 


10-43. Ten individuals have participated in a diet-modification 
program to stimulate weight loss. Their weight both before and 
after participation in the program is shown in the following list. 


Subject Before After 
1 195 187 
2 213 195 
3 247 22) 
4 201 190 
5 187 175 
6 210 197 
7 215 199 
8 246 221 
9 294 278 
10 310 285 


(a) Is there evidence to support the claim that this particular 
diet-modification program is effective in producing a 
mean weight reduction? Use a = 0.05. 

(b) Is there evidence to support the claim that this particular 
diet-modification program will result in a mean weight 
loss of at least 10 pounds? Use a = 0.05. 
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(c) Suppose that, if the diet-modification program results in 
mean weight loss of at least 10 pounds, it is important to 
detect this with probability of at least 0.90. Was the use of 
10 subjects an adequate sample size? If not, how many 
subjects should have been used? 


10-44. Two different analytical tests can be used to deter- 
mine the impurity level in steel alloys. Eight specimens 
are tested using both procedures, and the results are shown in 
the following tabulation. 


Specimen Test 1 Test 2 
1 2 1.4 
2 1.3 1.7 
3 15) LS) 
4 1.4 1.3 
5 if 2.0 
6 1.8 2.1 
7 1.4 1.7 
8 1.3 1.6 


(a) Is there sufficient evidence to conclude that tests differ in 
the mean impurity level, using a = 0.01? 

(b) Is there evidence to support the claim that Test 1 generates a 
mean difference 0.1 units lower than Test 2? Use a = 0.05. 

(c) If the mean from Test 1 is 0.1 less than the mean from 
Test 2, it is important to detect this with probability at least 
0.90. Was the use of eight alloys an adequate sample size? 
If not, how many alloys should have been used? 


10-45. Anarticle in Neurology (1998, Vol. 50, pp. 1246-1252) 
discussed that monozygotic twins share numerous physical, psy- 
chological, and pathological traits. The investigators measured an 
intelligence score of 10 pairs of twins, and the data are as follows: 


Pair Birth Order: 1 Birth Order: 2 


1 6.08 Sb 
2 6.22 5.80 
3 TY 8.42 
4 7.44 6.84 
5) 6.48 6.43 
6 7.99 8.76 
7 6.32 6.32 
8 7.60 7.62 
9 6.03 6.59 
10 7.52 7.67 


(a) Is the assumption that the difference in score is nor- 
mally distributed reasonable? Show results to support 
your answer. 

(b) Find a 95% confidence interval on the difference in mean 
score. Is there any evidence that mean score depends on 
birth order? 

(c) It is important to detect a mean difference in score of one 
point, with a probability of at least 0.90. Was the use of 10 
pairs an adequate sample size? If not, how many pairs 
should have been used? 


10-46. In Biometrics (1990, Vol. 46, pp. 673-87), the au- 


thors analyzed the circumference of five orange trees (labeled 
as A-E) measured on seven occasions (x;). 


Tree xy Xp X3 X4 X5 X6 x7 
A 30 58 87 115 120 142 145 
33 69 111 156 172 ~=203 203 
30.~=— 51 1S 108 115 139 140 
62 112 167 179 209 214 
30 = 49 81 125 142 174 177 


moa 
ww 
N 


(a) Compare the mean increase in circumference in periods 
1 to 2 to the mean increase in periods 2 to 3. The increase 
is the difference in circumference in the two periods. Are 
these means significantly different at a = 0.10? 

(b) Is there evidence that the mean increase in period 1 to 
period 2 is greater than the mean increase in period 6 to 
period 7 at a = 0.05? 

(c) Are the assumptions of the test in part (a) violated because 
the same data (period 2 circumference) is used to calculate 
both mean increases? 

10-47. Use the sign test on the blood cholesterol data in 

Exercise 10-41. Is there evidence that diet and exercise reduce 

the median cholesterol level? 

10-48. Repeat Exercise 10-47 using the Wilcoxon signed- 

rank test. State carefully what hypothesis is being tested and 

how it differs from the one tested in Exercise 10-47. 


10-5 INFERENCE ON THE VARIANCES OF TWO 


NORMAL DISTRIBUTIONS 


We now introduce tests and confidence intervals for the two population variances shown in 
Fig. 10-1. We will assume that both populations are normal. Both the hypothesis-testing and 
confidence interval procedures are relatively sensitive to the normality assumption. 
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10-5.1  F Distribution 


Suppose that two independent normal populations are of interest, where the population means 
and variances, say, 1}, 07, M2, and 03, are unknown. We wish to test hypotheses about the 
equality of the two variances, say, Hj: oj = 03. Assume that two random samples of size n, 
from population 1 and of size n, from population 2 are available, and let Sj and 55 be the sam- 
ple variances. We wish to test the hypotheses 


Ho: o7 = 05 
Aya, #02 (10-26) 


The development of a test procedure for these hypotheses requires a new probability 
distribution, the F distribution. The random variable F is defined to be the ratio of two 
independent chi-square random variables, each divided by its number of degrees of free- 
dom. That is, 


_ Wu 
7 Y/v 


(10-27) 


where W and Y are independent chi-square random variables with u and v degrees of freedom, 
respectively. We now formally state the sampling distribution of F. 


F Distribution 
Let W and Y be independent chi-square random variables with u and v degrees of 
freedom, respectively. Then the ratio 


_ Wi 
Fe= Y/v (10-28) 


has the probability density function 


ut+v\(u\? u/2)-1 
FQ) = : ( i : _ 
r(3)r(3) (s)s+1| 


and is said to follow the F distribution with u degrees of freedom in the numerator 
and v degrees of freedom in the denominator. It is usually abbreviated as F,,,. 


O0<x<0 (10-29) 


The mean and variance of the F distribution are 1 = v/(v — 2) for v > 2, and 


2v7(u + v — 2) 
~ u(y — 27°(v -— 4Y 


v>4 


Two F distributions are shown in Fig. 10-4. The F random variable is nonnegative, and the 
distribution is skewed to the right. The F distribution looks very similar to the chi-square dis- 
tribution; however, the two parameters u and v provide extra flexibility regarding shape. 
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F(x) 


f(x) 


a 


fe) 2 4 6 8 10 x fi-a,u,v fa, u,v x 


Figure 10-4 Probability density functions of | Figure 10-5 Upper and lower percentage 
two F distributions. points of the F distribution. 


The percentage points of the F distribution are given in Table VI of the Appendix. Let 
Fou.y be the percentage point of the F distribution, with numerator degrees of freedom u and 
denominator degrees of freedom v such that the probability that the random variable F 
exceeds this value is 


(oo) 


P(F eae ee > | rooax =o 


Fea 


This is illustrated in Fig. 10-5. For example, if uw = 5 and v = 10, we find from Table V of the 
Appendix that 


P(F > foos5,10) = P(Fs,10 > 3.33) = 0.05 


That is, the upper 5 percentage point of F's 19 iS fo.os,s.10 = 3-33. 
Table V contains only upper-tail percentage points (for selected values of f, ,,, fora = 0.25) 
of the F distribution. The lower-tail percentage points f,_,.,,, can be found as follows. 


(10-30) 


For example, to find the lower-tail percentage point f,95 5 9, note that 


1 1 
fo.95,5,10 = aan ~ 474 


= 0.211 


10-5.2 Hypothesis Tests on the Ratio of Two Variances 


A hypothesis-testing procedure for the equality of two variances is based on the following 
result. 
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Distribution 
of the Ratio Let X,;, X19,..., X;,, be arandom sample from a normal population with mean «., and 
of Sample variance 07, and let X>,, Xy,..., Xo), be a random sample from a second normal pop- 


Variances from 
Two Normal 
Distributions 


inator degrees of freedom. 


ulation with mean wp, and variance o3. Assume that both normal populations are 
independent. Let Sj and S3 be the sample variances. Then the ratio 


has an F distribution with n, — 1 numerator degrees of freedom and n, — 1 denom- 


_ St/ot 
S3/05 


This result is based on the fact that (n,; — 1)S{/o7 is a chi-square random variable with n, — 1 


degrees of freedom, that (n, — 1)S 5 


/o3 is a chi-square random variable with n, — 1 degrees 


of freedom, and that the two normal populations are independent. Clearly under the null 
hypothesis Hy: oj = 03 the ratio Fy = S +/S% has an F,, 1,1 distribution. This is the basis of 


the following test procedure. 


Tests on the 
Ratioof | Null hypothesis: Ho: 07 = 3 
Variances from 52 
Eve Soria Test statistic: Fo = = (10-31) 
Distributions S5 
Alternative Hypotheses Rejection Criterion 

H;: o7 # 05 jo > de Pa nat C8. = HiePag pat 
A: ot > 05 to = Ife ilaoet 
A: ot < 03 fo < fi-o, n.-1n-1 


The critical regions for these fixed-significance-level tests are shown in Figure 10-6. 


f(x) f(x) 


2 
Xn-1 


a/2 
a/2 


x ie) 


2 2 
O Xt_a2,n-1 Xal2,n-1 


(a) 


f(x) 


2 2 
Xn-1 Xn-1 


2 2 
Xo,n-1 x ) Xi -o@n-1 


(b) (c) 


Figure 10-6 The F distribution for the test of Hp: 07 = o% with critical region values for (a) Hj: of # 0%, (b) Hy: 07 > 0%, 


and (c) H;: 07 < 04. 


EXAMPLE 10-12 Semiconductor Etch Variability 

Oxide layers on semiconductor wafers are etched in a mixture 
of gases to achieve the proper thickness. The variability in the 
thickness of these oxide layers is a critical characteristic of the 
wafer, and low variability is desirable for subsequent process- 
ing steps. Two different mixtures of gases are being studied to 
determine whether one is superior in reducing the variability 


of the oxide thickness. Sixteen wafers are etched in each gas. 
The sample standard deviations of oxide thickness are s,; = 
1.96 angstroms and s, = 2.13 angstroms, respectively. Is there 
any evidence to indicate that either gas is preferable? Use a 
fixed-level test with a = 0.05. 
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The seven-step hypothesis-testing procedure may be applied 


to this problem as follows: 


1. 


Parameter of interest: The parameter of interest are the 
variances of oxide thickness o7 and 03. We will assume 
that oxide thickness is a normal random variable for both 
gas mixtures. 


Null hypothesis: Ho: 07 = 03 
Alternative hypothesis: H,: 07 # 03 
Test statistic: The test statistic is given by equation 10-31: 


Reject Hy if : Because n, = n. = 16 and a = 0.05, we 
will reject Ho: of = 03 if fo > fo.o25,15,15 = 2.86 or if 


P-Values for the F-Test 
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fo < fo975,15,.15 = 1/fo.025,15,15 = 1/2.86 = 0.35. Refer to 
Figure 10-6(a). 
Computations: Because sj = (1.96)? = 3.84 and s} = 
(2.13)? = 4.54, the test statistic is 

sj 3.84 


== =—— =0.85 
fo 33 4.54 


Conclusions: Because fo975.15,15 0.35 < 0.85 < 
foors.15.15 = 2.86, we cannot reject the null hypothesis Ho: 
0; = 0% at the 0.05 level of significance. 


Practical Interpretation: There is no strong evidence to 


indicate that either gas results in a smaller variance of oxide 
thickness. 


The P-value approach can also be used with F-tests. To show how to do this, consider the 
upper-tailed one-tailed test. The P-value is the area (probability) under the F distribution with 
n, — 1 and ny, — | degrees of freedom that lies beyond the computed value of the test statistic 
Jo. Appendix A Table IV can be used to obtain upper and lower bounds on the P-value. For 
example, consider an F-test with 9 numerator and 14 denominator degrees of freedom for 
which fo = 3.05. From Appendix A Table IV we find that foo5.9,144 = 2.65 and fo259.14 = 3.21, 
so because f, = 3.05 lies between these two values, the P-value is between 0.05 and 0.025; that 
is, 0.025 < P < 0.05. The P-value for a lower-tailed test would be found similarly, although 
since Appendix A Table IV contains only upper-tail points of the F distribution, equation 10-30 
would have to be used to find the necessary lower-tail points. For a two-tailed test, the bounds 
obtained from a one-tail test would be doubled to obtain the P-value. 


Finding the 


To illustrate calculating bounds on the P-value for a two-tailed F-test, reconsider 


P-Value for Example 10-12. The computed value of the test statistic in this example is fo = 0.85. This 
Example 10-12 value falls in the lower tail of the F 15,15 distribution. The lower-tail point that has 0.25 proba- 
bility to the left of it is fo75 1515 = 1/fo.25,15,15 = 1/1.43 = 0.70 and since 0.70 < 0.85, the prob- 

ability that lies to the left of 0.85 exceeds 0.25. Therefore, we would conclude that the P-value 

for fy = 0.85 is greater than 2(0.25) = 0.5, so there is insufficient evidence to reject the null 

hypothesis. This is consistent with the original conclusions from Example 10-12. The actual 

P-value is 0.7570. This value was obtained from a calculator from which we found that 

P(F 15,15 S 0.85) = 0.3785 and 2(0.3785) = 0.7570. Minitab can also be used to calculate the 


required probabilities. 


Minitab will perform the F-test on the equality of two variances of independent normal 
distributions. The Minitab output is shown below. 


Test for Equal Variances 


95% Bonferroni confidence intervals for standard deviations 


Sample N Lower 
1 16 1.38928 
2 16 1.51061 


F-Test (Normal Distribution) 
Test statistic = 0.85, P-value 


StDev Upper 
1.95959 3.24891 
2.13073 3.53265 
= 0.750 
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Minitab also gives confidence intervals on the individual variances. These are the confidence 
intervals originally given in Equation 8-19, except that a Bonferroni “adjustment” has been 
applied to make the confidence level for both intervals simultaneously at least equal to 95%. 
This consists of using a/2 = 0.05/2 = 0.025 to construct the individual intervals. That is, each 
individual confidence interval is a97.5% CI. In Section 10-5.4, we will show how to construct 
a Cl on the ratio of the two variances. 


10-5.3 Type II Error and Choice of Sample Size 


Appendix Charts VIlo, VIIp, VIIg, and VIIr provide operating characteristic curves for the 
F-test given in Section 10-5.1 for a = 0.05 and a = 0.01, assuming that n, = n, = n. Charts 
VIlo and VIIp are used with the two-sided alternate hypothesis. They plot B against the 
abscissa parameter 

Oo} 


A= (10-32) 


oy) 
for various n; = n, = n. Charts VIIq and VIIr are used for the one-sided alternative hypotheses. 


EXAMPLE 10-13 Semiconductor Etch Variability Sample Size 

For the semiconductor wafer oxide etching problem in = = 

Example 10-12, suppose that one gas resulted in a standard 02 

deviation of oxide thickness that is half the standard deviation By referring to Appendix Chart VIlo with n, =n. = n = 20 
of oxide thickness of the other gas. If we wish to detect such a and \ = 2, we find that 8 ~ 0.20. Therefore, if 8 = 0.20, the 


oO; 


situation with probability at least 0.80, is the sample size n, = power of the test (which is the probability that the difference in 
ny = 20 adequate? standard deviations will be detected by the test) is 0.80, and we 
Note that if one standard deviation is half the other, conclude that the sample sizes n, = n, = 20 are adequate. 


10-5.4 Confidence Interval on the Ratio of Two Variances 


To find the confidence interval on 07/03, recall that the sampling distribution of 


_ $3/03 
Si/oy 


is an F with n. — 1 and n, — 1 degrees of freedom. Therefore, P( f\—o2,n,-1,)-1 =F = 
fe/2.n,-1,n,-1) = 1 — a. Substitution for F and manipulation of the inequalities will lead to 
the 100(1 — «)% confidence interval for 07/05. 


Confidence 
Interval on If sj and s3 are the sample variances of random samples of sizes n, and n5, respec- 
the Ratio of tively, from two independent normal populations with unknown variances o7 and 03, 
Variances from then a 100(1 — «)% confidence interval on the ratio 07/03 is 
Two Normal 
Distributions st ot si 
Shi Peqiaa = = SS lepaiaai (10-33) 
oy) O27 $2 


where fo/2n,-1,n,-1 8nd fi-a/2,n,-1,n,-1 are the upper and lower a/2 percentage 
points of the F distribution with n. — 1 numerator and n, — 1 denominator degrees 
of freedom, respectively. A confidence interval on the ratio of the standard deviations 
can be obtained by taking square roots in Equation 10-33. 
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EXAMPLE 10-14 Surface Finish for Titanium Alloy 
A company manufactures impellers for use in jet-turbine 
engines. One of the operations involves grinding a particular 
surface finish on a titanium alloy component. Two different 
grinding processes can be used, and both processes can produce 
parts at identical mean surface roughness. The manufacturing 
engineer would like to select the process having the least vari- 
ability in surface roughness. A random sample of n, = 11 parts 
from the first process results in a sample standard deviation 
s, = 5.1 microinches, and a random sample of n, = 16 parts 
from the second process results in a sample standard deviation of 
Sy = 4.7 microinches. We will find a 90% confidence interval on 
the ratio of the two standard deviations, 0/0. 

Assuming that the two processes are independent and 
that surface roughness is normally distributed, we can use 
Equation 10-33 as follows: 


2 2 2 
s oj Ss 
1 ie 

2 2 


2 £0.95.15.10 = = 7 Jo05,15,10 
3 
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(5.1)? of — (5.1) 
503955 5 2.85 
(4.7) 03 (4.7) 


or upon completing the implied calculations and taking square 
roots, 


0.678 < —! < 1.832 
02 


Notice that we have used Equation 10-30 to find fo95.1510 = 
1/fo.os.10.15 = 1/2.54 = 0.39. 


Practical Interpretation: Since this confidence interval in- 
cludes unity, we cannot claim that the standard deviations of 
surface roughness for the two processes are different at the 
90% level of confidence. 


10-49. Foran F distribution, find the following: 

(a) fo25,5,10 (b) fo.10,24,9 

(c) foos.gis (4) fo.75,5,10 

(€) fo.90,24,9 (£) fo.9s,8,15 

10-50. Foran F distribution, find the following: 

(a) fo2s,7,15 (b) fo.10,10,12 

(C) fo.or.20.10 (4) fo7s.715 

(€) foso.1012 () fo.99,20,10 

10-51. Consider the hypothesis test Hy:o7 = 0% against 


H,:0% <0}. Suppose that the sample sizes are n, = 5 and 
n> = 10, and that s} = 23.2 and s5 = 28.8. Use a = 0.05. Test 
the hypothesis and explain how the test could be conducted 
with a confidence interval on 0/0). 


10-52. Consider the hypothesis test Hy:o7 = 0% against 
Hy:0; > 03. Suppose that the sample sizes are n; = 20 and 
ny = 8, and that st = 4.5 and s3 = 2.3. Use a = 0.01. Test the 
hypothesis and explain how the test could be conducted with a 
confidence interval on o,/0>. 


10-53. Consider the hypothesis test Hy:o] = 0% against 

H,:0; # 03. Suppose that the sample sizes are n, = 15 and 

n> = 15, and the sample variances are st = 2.3 and s3 = 1.9. 

Use a = 0.05. 

(a) Test the hypothesis and explain how the test could be con- 
ducted with a confidence interval on 0/0. 

(b) What is the power of the test in part (a) if a1 is twice as 
large as o>? 

(c) Assuming equal sample sizes, what sample size should be 
used to obtain B = 0.05 if the o is half of o,? 


10-54. Two chemical companies can supply a raw material. 
The concentration of a particular element in this material is 


important. The mean concentration for both suppliers is the 
same, but we suspect that the variability in concentration may 
differ between the two companies. The standard deviation of 
concentration in a random sample of n, = 10 batches pro- 
duced by company | is s,; = 4.7 grams per liter, while for com- 
pany 2, a random sample of n, = 16 batches yields s) = 5.8 
grams per liter. Is there sufficient evidence to conclude that the 
two population variances differ? Use a = 0.05. 


10-55. A study was performed to determine whether men 
and women differ in their repeatability in assembling compo- 
nents on printed circuit boards. Random samples of 25 men 
and 21 women were selected, and each subject assembled the 
units. The two sample standard deviations of assembly time 
WETE Sen = 0.98 minutes and Syomen = 1.02 minutes. 

(a) Is there evidence to support the claim that men and women 
differ in repeatability for this assembly task? Use a = 
0.02 and state any necessary assumptions about the under- 
lying distribution of the data. 

(b) Find a 98% confidence interval on the ratio of the two 
variances. Provide an interpretation of the interval. 


10-56. Consider the foam data in Exercise 10-16. Construct 

the following: 

(a) A 90% two-sided confidence interval on 07/03. 

(b) A 95% two-sided confidence interval on o{/03. Comment 
on the comparison of the width of this interval with the 
width of the interval in part (a). 

(c) A 90% lower-confidence bound on o,/0>. 


10-57. Consider the diameter data in Exercise 10-15. Con- 
struct the following: 
(a) A 90% two-sided confidence interval on 0/0. 
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(b) A 95% two-sided confidence interval on 0/07. Comment 
on the comparison of the width of this interval with the 
width of the interval in part (a). 

(c) A 90% lower-confidence bound on o,/0>. 


10-58. Consider the gear impact strength data in Exercise 
10-20. Is there sufficient evidence to conclude that the variance 
of impact strength is different for the two suppliers? Use a = 
0.05. 


10-59. Consider the melting-point data in Exercise 10-21. 
Do the sample data support a claim that both alloys have the 
same variance of melting point? Use a = 0.05 in reaching 
your conclusion. 


10-60. Exercise 10-24 presented measurements of plastic 
coating thickness at two different application temperatures. 
Test Hy: of = 03 against H): of # 03 using a = 0.01. 
10-61. Reconsider the overall distance data for golf balls 
in Exercise 10-29. Is there evidence to support the claim 
that the standard deviation of overall distance is the same 
for both brands of balls (use a = 0.05)? Explain how this 
question can be answered with a 95% confidence interval on 
o|/0>. 
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10-62. Reconsider the coefficient of restitution data in 
Exercise 10-30. Do the data suggest that the standard deviation is 
the same for both brands of drivers (use a = 0.05)? Explain how 
to answer this question with a confidence interval on o;/o>. 


10-63. Consider the weight of paper data from Technometrics 
in Exercise 10-28. Is there evidence that the variance of the 
weight measurement differs between the sheets of paper? Use 
a = 0.05. Explain how this test can be conducted with a con- 
fidence interval. 


10-64. Consider the film speed data in Exercise 10-22. 

(a) Test Hp: 07 = 03 versus H: 07 # 03 using a = 0.02. 

(b) Suppose that one population standard deviation is 50% 
larger than the other. Is the sample size n, = n2 = 8 ade- 
quate to detect this difference with high probability? 
Use a = 0.01 in answering this question. 


10-65. Consider the etch rate data in Exercise 10-19. 

(a) Test the hypothesis Hy: oj = 03 against Hy: oj # 03 
using a = 0.05, and draw conclusions. 

(b) Suppose that if one population variance is twice as large 
as the other, we want to detect this with probability at least 
0.90 (using a = 0.05). Are the sample sizes n, = nz = 10 
adequate? 


10-6 INFERENCE ON TWO POPULATION PROPORTIONS 


We now consider the case where there are two binomial parameters of interest, say, p, and p>, 
and we wish to draw inferences about these proportions. We will present large-sample 
hypothesis testing and confidence interval procedures based on the normal approximation to 


the binomial. 


10-6.1 Large-Sample Tests on the Difference 


in Population Proportions 


Suppose that two independent random samples of sizes n, and n, are taken from two pop- 
ulations, and let X, and X, represent the number of observations that belong to the class of in- 
terest in samples | and 2, respectively. Furthermore, suppose that the normal approximation 
to the binomial is applied to each population, so the estimators of the population proportions 
P, = X,/n, and P, = X>/n have approximate normal distributions. We are interested in 


testing the hypotheses 


The statistic 


Ao: Py = P2 
Ay: py # Po 


Test Statistic 
for the B B 
Ritaeboe = 
Difference of Z= : 2 — (Pi = Pa) (10-34) 
Two Population pil = 71) ie p21 — pr) 
Proportions ny Wb) 
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is distributed approximately as standard normal and is the basis of a test for Ho: py = po. 
Specifically, if the null hypothesis Ho: p; = p> is true, using the fact that p,; = py = p, the 
random variable 


~ Xy +X, 
P= 
ny + ny 
The test statistic for Ho: p; = p2 1s then 
P, aa P, 


Zo 


This leads to the test procedures described below. 


Approximate 
Tests on the Null hypothesis: Ho: P; = Pr 
Difference of p Pp 
es Test statistic: f= — (10-35) 
Proportions ‘ . wai 1 
Pa — B)( — +— 
( ) Wil Wh) 
Rejection Criterion 
Alternative Hypothesis P-Value for Fixed-Level Tests 
Ay: py # Po Probability above |zp| and Zp > %/2 OF % < —Zap2 
probability below —|Zp|. 
P = 2[1 — O(|z9)] 
Ay: p, > po Probability above Zp. iy SS Ke 
P=1— ®(z) 
Ay: py < pr Probability below zp. Zy < 2a 
ee) 


EXAMPLE 10-15 St. John’s Wort 


Extracts of St. John’s Wort are widely used to treat depression. 
An article in the April 18, 2001, issue of the Journal of the 
American Medical Association (“Effectiveness of St. John’s 
Wort on Major Depression: A Randomized Controlled Trial”) 
compared the efficacy of a standard extract of St. John’s Wort 
with a placebo in 200 outpatients diagnosed with major depres- 
sion. Patients were randomly assigned to two groups; one group 
received the St. John’s Wort, and the other received the placebo. 
After eight weeks, 19 of the placebo-treated patients showed 
improvement, whereas 27 of those treated with St. John’s Wort 


improved. Is there any reason to believe that St. John’s Wort is 
effective in treating major depression? Use a = 0.05. 

The seven-step hypothesis testing procedure leads to the 
following results: 


1. Parameter of interest: The parameters of interest are p, 
and p>, the proportion of patients who improve following 
treatment with St. John’s Wort (~,) or the placebo (p>). 


2. Null hypothesis: Hy: p,; = p> 
3. Alternative hypothesis: H,: p, # p> 


10-6.2 Type II Error and Choice of Sample Size 
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Test statistic: The test statistic is 


au -a( +z) 


where p, = 27/100 = 0.27, po = 19/100 = 0.19, n, =n 
100, and 


Zo — 


XTX 19 + 27 =~ 0.23 
Pe m +m 100+100. ~ 


Reject Ho if: Reject Hy: p, = p2 if the P-value is less 
than 0.05. 


6. Computations: The value of the test statistic is 


0.27 — 0.19 


1 1 
0.23(0.77)| T35 + Gg9 


% 1.34 


7. Conclusions: Since z) = 1.34, the P-value is P = 
2[1 — (1.34)] = 0.18, so, we cannot reject the null 
hypothesis. 


Practical Interpretation: There is insufficient evidence to 
support the claim that St. John’s Wort is effective in treating 
major depression. 


The following box shows the Minitab two-sample hypothesis test and CI procedure for 
proportions. Notice that the 95% CI on p,; — p> includes zero. The equation for constructing 
the CI will be given in Section 10-6.3. 


Minitab Computations 
Test and CI for Two Proportions 


Sample 
1 
2 
Estimate for p(1) — p(2): 0.08 


N Sample p 
100 0.270000 
100 0.190000 


95% CI for p(1) — p(2): (—0.0361186, 0.196119) 
Test for p(1) — p(2) = 0 (vs not = 0): Z = 1.35 P-Value = 0.177 


The computation of the B-error for the large-sample test of Ho: py = p> is somewhat more 
involved than in the single-sample case. The problem is that the denominator of the test 
statistic Z) is an estimate of the standard deviation of P, = P, under the assumption that 
P| = Po = p. When Hy: p; = pois false, the standard deviation of P, — Py is 


Pill — pi) , poll — pr) 
Op -P, = J 1 + Ny (10-36) 
Approximate 
Type I Error If the alternative hypothesis is two sided, the B-error is 
for a Two-Sided 
Test on the es 
Difference of hs FS La/2 pq(l/n, oF 1/n) a (Pi ~ P>) 
Two Population OPP, 
Proportions 
i pq(1/n, + 1/n2) — (p1 — po) 
7 0| a/2 a g 2 1 2 (10-37) 
B-P, 


392 


Approximate 
Type I Error 
for a One-Sided 
Test on the 
Difference of 
Two Population 
Proportions 


Approximate 
Sample Size 
for a Two-Sided 
Test on the 
Difference in 
Population 
Proportions 
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where 


NP, + Nzp2 Zs 
q 


_ ny + Ny ane 


and op — 4, is given by Equation 10-36. 


If the alternative hypothesis is Hy: p; > po, 


Vpq(l/n, + 1/n2) — (py —#2) 


OA? 


BS o|* (10-38) 


and if the alternative hypothesis is H,: p; < po, 


i= > ees + I/n) = 


OPP, 
For a specified pair of values p, and p,, we can find the sample sizes n, = n, = n required to 
give the test of size a that has specified type II error B. 


(10-39) 


B = (P1 af 


For the two-sided alternative, the common sample size is 


[ZusoW (Pi + Poi + G2)/2 + zgV Pia + P2942]? 
(Pi = Po) 


(10-40) 


n= 


where g, = 1 — p, and q = 1 — p>. 


For a one-sided alternative, replace Z,/2 in Equation 10-40 by 2,. 


10-6.3 Confidence Interval on the Difference in Population Proportions 


The confidence interval for p; — pz can be found directly, since we know that 


Py _ Py — (Pi — pr) 
= — Pi) , Pall = Po) 


ny Ny 


Z= 


is a standard normal random variable. Thus P(—Z,/. = Z S Z,/2) = 1 — a, so we can substi- 
tute for Z in this last expression and use an approach similar to the one employed previously 
to find an approximate 100(1 — «)% two-sided confidence interval for p; — pp. 
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Approximate 
Confidence 
Interval on the 
Difference in 
Population 


‘ — Pri 
Proportions Pi~ Pris 


If p, and p> are the sample proportions of observations in two independent random 
samples of sizes n, and n, that belong to a class of interest, an approximate two- 
sided 100(1 — a)% confidence interval on the difference in the true proportions 


Po(1 — po) 


a Pi(l — pi) 
P\ — Pr ~ %a/2 ny a 


= Pi — Po =P, — po + as 1 a Ty 


where z,,2 is the upper a/2 percentage point of the standard normal distribution. 


Ny 


P21 — py) 


(10-41) 


EXAMPLE 10-16 Defective Bearings 

Consider the process of manufacturing crankshaft bearings 
described in Example 8-7. Suppose that a modification is 
made in the surface finishing process and that, subsequently, a 
second random sample of 85 bearings is obtained. The number 
of defective bearings in this second sample is 8. Therefore, 
since n, = 85, p, = 0.12, nm) = 85, and p. = 8/85 = 0.09, we 
can obtain an approximate 95% confidence interval on the 
difference in the proportion of defective bearings produced 
under the two processes from Equation 10-41 as follows: 


fae =Pi) . Pal — py) 


ny; Ny 


EXERCISES FOR SECTION 10-6 


or 


‘ieee 1.96 | 2088) _ 0.09(0.91) 
. . . 85 T 85 


0.12(0.88) 0.09(0.91) 


=p) — p2 =0.12 — 0.09 + 1. 


This simplifies to 
—0.06 = p; — pz = 0.12 


Practical Interpretation: This confidence interval includes 
zero, so, based on the sample data, it seems unlikely that the 
changes made in the surface finish process have reduced the pro- 
portion of defective crankshaft bearings being produced. 


10-66. Consider the computer output below. 


Test and Cl for Two Proportions 


Sample X N Sample p 

iL 54 250 0.216000 

2 60 290 0.206897 
Difference = p (1) — p (2) 
Estimate for difference: 0.00910345 
95% CI for difference: (—0.0600031, 
0.0782100) 
Test for difference = 0 (vs not = 0): 


4 = ? P-Value = ? 
(a) Is this a one-sided or a two-sided test? 
(b) Fill in the missing values. 
(c) Can the null hypothesis be rejected? 
(d) Construct an approximate 90% CI for the difference in the 
two proportions. 


10-67. 
Test and CI for Two Proportions 


Consider the computer output below. 


Sample X N Sample p 
1 188 250 0.752000 
2 245 350 0.700000 


Difference = p (1) — p (2) 
Estimate for difference: 0.052 
95% lower bound for difference: ? 
Test for difference = 0 (vs > 0) 
Z = ? P-Value = ? 


(a) Is this a one-sided or a two-sided test? 

(b) Fill in the missing values 

(c) Can the null hypothesis be rejected if « = 0.10? What if 
a = 0.05? 
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10-68. An article in Knee Surgery, Sports Traumatology, 
Arthroscopy (2005, Vol. 13, pp. 273-279), considered arthro- 
scopic meniscal repair with an absorbable screw. Results 
showed that for tears greater than 25 millimeters, 14 of 18 
(78%) repairs were successful while for shorter tears, 22 of 30 
(73%) repairs were successful. 

(a) Is there evidence that the success rate is greater for longer 
tears? Use a = 0.05. What is the P-value? 

(b) Calculate a one-sided 95% confidence bound on the dif- 
ference in proportions that can be used to answer the 
question in part (a). 

10-69. In the 2004 presidential election, exit polls from the 

critical state of Ohio provided the following results: For re- 

spondents with college degrees, 53% voted for Bush and 46% 

voted for Kerry. There were 2020 respondents. 

(a) Is there a significant difference in these proportions? Use 
a = 0.05. What is the P-value? 

(b) Calculate a 95% confidence interval for the difference in 
the two proportions and comment on the use of this inter- 
val to answer the question in part (a). 

10-70. Two different types of injection-molding machines 

are used to form plastic parts. A part is considered defective if 

it has excessive shrinkage or is discolored. Two random sam- 
ples, each of size 300, are selected, and 15 defective parts are 
found in the sample from machine 1 while 8 defective parts 

are found in the sample from machine 2. 

(a) Is it reasonable to conclude that both machines produce 
the same fraction of defective parts, using a = 0.05? Find 
the P-value for this test. 

(b) Construct a 95% confidence interval on the difference in 
the two fractions defective. 

(c) Suppose that p; = 0.05 and p, = 0.01. With the sample 
sizes given here, what is the power of the test for this two- 
sided alternate? 
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(d) Suppose that p,; = 0.05 and pz = 0.01. Determine the 
sample size needed to detect this difference with a proba- 
bility of at least 0.9. 

(e) Suppose that p; = 0.05 and p = 0.02. With the sample 
sizes given here, what is the power of the test for this two- 
sided alternate? 

(f) Suppose that p,; = 0.05 and p, = 0.02. Determine the 
sample size needed to detect this difference with a proba- 
bility of at least 0.9. 

10-71. Two different types of polishing solutions are being 

evaluated for possible use in a tumble-polish operation for 

manufacturing interocular lenses used in the human eye fol- 
lowing cataract surgery. Three hundred lenses were tumble 
polished using the first polishing solution, and of this number 

253 had no polishing-induced defects. Another 300 lenses 

were tumble-polished using the second polishing solution, and 

196 lenses were satisfactory upon completion. 

(a) Is there any reason to believe that the two polishing solutions 
differ? Use a = 0.01. What is the P-value for this test? 

(b) Discuss how this question could be answered with a 
confidence interval on py — p>. 


10-72. A random sample of 500 adult residents of 
Maricopa County found that 385 were in favor of increasing 
the highway speed limit to 75 mph, while another sample of 
400 adult residents of Pima County found that 267 were in 
favor of the increased speed limit. 

(a) Do these data indicate that there is a difference in the sup- 
port for increasing the speed limit between the residents 
of the two counties? Use a = 0.05. What is the P-value 
for this test? 

(b) Construct a 95% confidence interval on the difference in 
the two proportions. Provide a practical interpretation of 
this interval. 


10-7 SUMMARY TABLE AND ROADMAP FOR INFERENCE 


PROCEDURES FOR TWO SAMPLES 


The table in the end papers of the book summarizes all of the two-sample parametric inference 
procedures given in this chapter. The table contains the null hypothesis statements, the test sta- 
tistics, the criteria for rejection of the various alternative hypotheses, and the formulas for con- 
structing the 100(1 — a)% confidence intervals. 

The roadmap to select the appropriate parametric confidence interval formula or hypoth- 
esis test method for one-sample problems was presented in Table 8-1. In Table 10-5, we extend 
the road map to two-sample problems. The primary comments stated previously also apply 
here (except we usually apply conclusions to a function of the parameters from each sample, 


such as the difference in means): 


1. Determine the function of the parameters (and the distribution of the data) that is to 
be bounded by the confidence interval or tested by the hypothesis. 


2. Check if other parameters are known or need to be estimated (and if any assumptions 


are made). 
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Table 10-5 Roadmap to Construct Confidence Intervals and Hypothesis Tests, Two-Sample Case 
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Function of the 
Parameters to be 


Bounded by the 
Confidence Interval Confidence Hypothesis 
or Tested with a Interval Test 
Hypothesis Symbol Other Parameters? Section Section Comments 
Difference in means [ey — bo Standard deviations 10-1.3 10-1.1 
from two normal o, and o» known 
distributions 
Difference in means y—B2 Sample sizes large 10-1.3 10-1.1 Large sample size is 
from two arbitrary enough that o, and o often taken to be n, 
distributions with large are essentially known and n, = 40 
sample sizes 
Difference in means Hi B2 Standard deviations o; 10-2.3 10-2.1 Case 1: 0; = 09 
from two normal and o» are unknown, 
distributions and assumed equal 
Difference in means Bi p2 10-3 The Wilcoxon 
from two symmetric rank-sum test is a 
distributions nonparametric 
procedure 
Difference in means ey B2 Standard deviations 10-2.3 10-2.1 Case 2: 0; # 05 
from two normal o, and o> are 
distributions unknown, and NOT 
assumed equal 
Difference in means Kp =Pi-b2 Standard deviation 10-4 10-4 Paired analysis 
from two normal of differences are calculates differences 
distributions in a paired unknown and uses a one-sample 
analysis method for inference 
on the mean difference 
Ratio of variances of two 07/03 Means , and p2 10-5.4 10-5.2 
normal distributions unknown and 
estimated 
Difference in two Pi~P2 None 10-6.3 10-6.1 Normal approximation 
population proportions to the binomial 
distribution used 
for the tests and 
confidence intervals 
Supplemental Exercises (a) Fillin the missing values. You may use bounds for the P-value. 
(b) Is this a two-sided test or a one-sided test? 
10-73. Consider the computer output below. (c) What are your conclusions if « = 0.05? What if « = 0.10? 
Two-Sample T-Test and Cl . 
10-74. Consider the computer output below. 
Sample N Mean StDev SE Mean 
1 20 11.87 2.23 ? Two-Sample T-Test CI 
2 20 12.73 3.19 0.71 Sample N Mean StDev SE Mean 
Difference = mu — mu (2) 1 16 22.45 2.98 0.75 
Estimate for difference: —0.860 2 25 24.61 5.36 1.1 
95% CI for difference: (?, ?) Difference = mu (1) — mu (2) 
T-Test of difference = O(vs not =) Estimate for difference: —2.16 
T-Value = T-Test of difference (vs <): 


? P-Value = ? DF = ? 
od 


Both use Pooled StDev = 


T-Value = 


—1.65 P-Value = ? DF = ? 
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(a) Is this a one-sided or a two-sided test? 

(b) Fill in the missing values. You may use bounds for the 
P-value. 

(c) What are your conclusions if a = 0.05? What if a = 0.10? 

(d) Find a 95% upper-confidence bound on the difference in 
the two means. 


10-75. An article in the Journal of Materials Engineering 
(1989, Vol. 11, No. 4, pp. 275-282) reported the results of an 
experiment to determine failure mechanisms for plasma- 
sprayed thermal barrier coatings. The failure stress for one 
particular coating (NiCrAIZr) under two different test condi- 
tions is as follows: 


Failure stress (X 10° Pa) after nine 1-hour cycles: 19.8, 
18.5, 17.6, 16.7, 16.7, 14.8, 15.4, 14.1, 13.6 


Failure stress (X 10° Pa) after six 1-hour cycles: 14.9, 
12.7, 11.9, 11.4, 10.1, 7.9 


(a) What assumptions are needed to construct confidence in- 
tervals for the difference in mean failure stress under the 
two different test conditions? Use normal probability plots 
of the data to check these assumptions. 

(b) Find a 99% confidence interval on the difference in mean 
failure stress under the two different test conditions. 

(c) Using the confidence interval constructed in part (b), does 
the evidence support the claim that the first test conditions 
yield higher results, on the average, than the second? 
Explain your answer. 

(d) Construct a 95% confidence interval on the ratio of the 
variances, 07/05, of failure stress under the two different 
test conditions. 

(e) Use your answer in part (b) to determine whether there is 
a significant difference in variances of the two different 
test conditions. Explain your answer. 

10-76. A procurement specialist has purchased 25 resistors 

from vendor 1 and 35 resistors from vendor 2. Each resistor’s 

resistance is measured with the following results: 


Vendor 1 
96.8 100.0 100.3 98.5 98.3 98.2 
99.6 99.4 99.9 101.1 103.7 97.7 
99.7 101.1 97.7 98.6 101.9 101.0 
99.4 99.8 99.1 99.6 101.2 98.2 
98.6 

Vendor 2 
106.8 106.8 104.7 104.7 108.0 102.2 
103.2 103.7 106.8 105.1 104.0 106.2 
102.6 100.3 104.0 107.0 104.3 105.8 
104.0 106.3 102.2 102.8 104.2 103.4 
104.6 103.5 106.3 109.2 107.2 105.4 
106.4 106.8 104.1 107.1 107.7 


(a) What distributional assumption is needed to test the claim 
that the variance of resistance of product from vendor 1 is 
not significantly different from the variance of resistance 
of product from vendor 2? Perform a graphical procedure 
to check this assumption. 

(b) Perform an appropriate statistical hypothesis-testing pro- 
cedure to determine whether the procurement specialist 
can claim that the variance of resistance of product from 
vendor 1 is significantly different from the variance of 
resistance of product from vendor 2. 


10-77. A liquid dietary product implies in its advertising 
that use of the product for one month results in an average 
weight loss of at least 3 pounds. Eight subjects use the product 
for one month, and the resulting weight loss data are reported 
below. Use hypothesis-testing procedures to answer the fol- 
lowing questions. 


Initial Final 
Subject Weight (Ib) Weight (Ib) 
1 165 161 
2 201 195 
3 195 192 
4 198 193 
5 155 150 
6 143 141 
7 150 146 
8 187 183 


(a) Do the data support the claim of the producer of the dietary 
product with the probability of a type I error set to 0.05? 

(b) Do the data support the claim of the producer of the dietary 
product with the probability of a type I error set to 0.01? 

(c) In an effort to improve sales, the producer is considering 
changing its claim from “at least 3 pounds” to “at least 5 
pounds.” Repeat parts (a) and (b) to test this new claim. 


10-78. The breaking strength of yarn supplied by two man- 

ufacturers is being investigated. We know from experience 

with the manufacturers’ processes that 0, = 5 psi and 0, = 

4 psi. A random sample of 20 test specimens from each manu- 

facturer results in x; = 88 psi and x, = 91 psi, respectively. 

(a) Using a 90% confidence interval on the difference in 
mean breaking strength, comment on whether or not there 
is evidence to support the claim that manufacturer 2 
produces yarn with higher mean breaking strength. 

(b) Using a 98% confidence interval on the difference in 
mean breaking strength, comment on whether or not there 
is evidence to support the claim that manufacturer 2 
produces yarn with higher mean breaking strength. 

(c) Comment on why the results from parts (a) and (b) are dif- 
ferent or the same. Which would you choose to make your 
decision and why? 
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10-79. The Salk polio vaccine experiment in 1954 focused 
on the effectiveness of the vaccine in combating paralytic 
polio. Because it was felt that without a control group of 
children there would be no sound basis for evaluating the 
efficacy of the Salk vaccine, the vaccine was administered to 
one group, and a placebo (visually identical to the vaccine 
but known to have no effect) was administered to a second 
group. For ethical reasons, and because it was suspected that 
knowledge of vaccine administration would affect subse- 
quent diagnoses, the experiment was conducted in a double- 
blind fashion. That is, neither the subjects nor the administrators 
knew who received the vaccine and who received the placebo. 
The actual data for this experiment are as follows: 


Placebo group: n = 201,299: 110 cases of polio observed 
Vaccine group: n = 200,745: 33 cases of polio observed 


(a) Use a hypothesis-testing procedure to determine if the 
proportion of children in the two groups who contracted 
paralytic polio is statistically different. Use a probability 
of a type I error equal to 0.05. 

(b) Repeat part (a) using a probability of a type I error equal 
to 0.01. 

(c) Compare your conclusions from parts (a) and (b) and 
explain why they are the same or different. 

10-80. Consider Supplemental Exercise 10-78. Suppose that 

prior to collecting the data, you decide that you want the error in 

estimating J4, — by by x; — X, to be less than 1.5 psi. Specify 
the sample size for the following percentage confidence: 

(a) 90% 

(b) 98% 

(c) Comment on the effect of increasing the percentage confi- 
dence on the sample size needed. 

(d) Repeat parts (a)-(c) with an error of less than 0.75 psi 
instead of 1.5 psi. 

(e) Comment on the effect of decreasing the error on the 
sample size needed. 


10-81. A random sample of 1500 residential telephones in 
Phoenix in 1990 found that 387 of the numbers were unlisted. 
A random sample in the same year of 1200 telephones in 
Scottsdale found that 310 were unlisted. 

(a) Find a 95% confidence interval on the difference in the 

two proportions and use this confidence interval to deter- 

mine if there is a statistically significant difference in 
proportions of unlisted numbers between the two cities. 

Find a 90% confidence interval on the difference in the 

two proportions and use this confidence interval to deter- 

mine if there is a statistically significant difference in pro- 
portions of unlisted numbers between the two cities. 

(c) Suppose that all the numbers in the problem description 
were doubled. That is, 774 residents out of 3000 sampled 
in Phoenix and 620 residents out of 2400 in Scottsdale had 
unlisted phone numbers. Repeat parts (a) and (b) and com- 
ment on the effect of increasing the sample size without 
changing the proportions on your results. 


(b 


YS 


10-82. Ina random sample of 200 Phoenix residents who 

drive a domestic car, 165 reported wearing their seat belt regu- 

larly, while another sample of 250 Phoenix residents who drive 

a foreign car revealed 198 who regularly wore their seat belt. 

(a) Perform a hypothesis-testing procedure to determine if 
there is a statistically significant difference in seat belt us- 
age between domestic and foreign car drivers. Set your 
probability of a type I error to 0.05. 

(b) Perform a hypothesis-testing procedure to determine if 
there is a statistically significant difference in seat belt 
usage between domestic and foreign car drivers. Set your 
probability of a type I error to 0.1. 

(c) Compare your answers for parts (a) and (b) and explain 
why they are the same or different. 

(d) Suppose that all the numbers in the problem description 
were doubled. That is, in a random sample of 400 Phoenix 
residents who drive a domestic car, 330 reported wearing 
their seat belt regularly, while another sample of 500 
Phoenix residents who drive a foreign car revealed 396 
who regularly wore their seat belt. Repeat parts (a) and (b) 
and comment on the effect of increasing the sample size 
without changing the proportions on your results. 


10-83. Consider the previous exercise, which summarized 

data collected from drivers about their seat belt usage. 

(a) Do you think there is a reason not to believe these data? 
Explain your answer. 

(b) Is it reasonable to use the hypothesis-testing results from 
the previous problem to draw an inference about the dif- 
ference in proportion of seat belt usage 

(i) of the spouses of these drivers of domestic and foreign 
cars? Explain your answer. 
(ii) of the children of these drivers of domestic and foreign 
cars? Explain your answer. 
(iii) of all drivers of domestic and foreign cars? Explain 
your answer. 
(iv) of all drivers of domestic and foreign trucks? Explain 
your answer. 


10-84. A manufacturer of a new pain relief tablet would 
like to demonstrate that its product works twice as fast as the 
competitor’s product. Specifically, the manufacturer would 
like to test 


Ay: py = 2p9 
Ay: wy > 2p2 


where 2, is the mean absorption time of the competitive prod- 
uct and pt, is the mean absorption time of the new product. 
Assuming that the variances oj and o3 are known, develop a 
procedure for testing this hypothesis. 


10-85. Two machines are used to fill plastic bottles with 
dishwashing detergent. The standard deviations of fill volume 
are known to be o, = 0.10 fluid ounces and oy = 0.15 fluid 
ounces for the two machines, respectively. Two random samples 
of n, = 12 bottles from machine | and n, = 10 bottles from 
machine 2 are selected, and the sample mean fill volumes are 
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X, = 30.87 fluid ounces and x, = 30.68 fluid ounces. Assume 

normality. 

(a) Construct a 90% two-sided confidence interval on the 
mean difference in fill volume. Interpret this interval. 

(b) Construct a 95% two-sided confidence interval on the mean 
difference in fill volume. Compare and comment on the 
width of this interval to the width of the interval in part (a). 

(c) Construct a 95% upper-confidence interval on the mean 
difference in fill volume. Interpret this interval. 

(d) Test the hypothesis that both machines fill to the same 
mean volume. Use a = 0.05. What is the P-value? 

(e) If the B-error of the test when the true difference in fill 
volume is 0.2 fluid ounces should not exceed 0.1, what 
sample sizes must be used? Use a = 0.05. 

10-86. Suppose that we are testing Hp: w, = pb versus 

H,: w, # pW», and we plan to use equal sample sizes from the 

two populations. Both populations are assumed to be normal 

with unknown but equal variances. If we use a = 0.05 and if 
the true mean 1, = f. + o, what sample size must be used for 

the power of this test to be at least 0.90? 

10-87. Consider the situation described in Exercise 10-71. 

(a) Redefine the parameters of interest to be the proportion 
of lenses that are unsatisfactory following tumble polishing 
with polishing fluids 1 or 2. Test the hypothesis that the two 
polishing solutions give different results using a = 0.01. 

(b) Compare your answer in part (a) with that for Exercise 
10-71. Explain why they are the same or different. 

(c) We wish to use a = 0.01. Suppose that if p; = 0.9 and 
Pz = 0.6, we wish to detect this with a high probability, say, 
at least 0.9. What sample sizes are required to meet this 
objective? 

10-88. Consider the fire-fighting foam expanding agents 

investigated in Exercise 10-16, in which five observations of 

each agent were recorded. Suppose that, if agent 1 produces a 

mean expansion that differs from the mean expansion of agent 

1 by 1.5, we would like to reject the null hypothesis with prob- 

ability at least 0.95. 

(a) What sample size is required? 

(b) Do you think that the original sample size in Exercise 
10-16 was appropriate to detect this difference? Explain 
your answer. 

10-89. A fuel-economy study was conducted for two 

German automobiles, Mercedes and Volkswagen. One vehicle 

of each brand was selected, and the mileage performance was 

observed for 10 tanks of fuel in each car. The data are as fol- 
lows (in miles per gallon): 


Mercedes Volkswagen 
24.7 24.9 41.7 42.8 
24.8 24.6 42.3 42.4 
24.9 23.9 41.6 39.9 
24.7 24.9 39.5 40.8 
24.5 24.8 41.9 29.6 
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(a) Construct a normal probability plot of each of the data 
sets. Based on these plots, is it reasonable to assume that 
they are each drawn from a normal population? 

(b) Suppose that it was determined that the lowest observa- 
tion of the Mercedes data was erroneously recorded and 
should be 24.6. Furthermore, the lowest observation of the 
Volkswagen data was also mistaken and should be 39.6. 
Again construct normal probability plots of each of the 
data sets with the corrected values. Based on these new 
plots, is it reasonable to assume that they are each drawn 
from a normal population? 

(c) Compare your answers from parts (a) and (b) and com- 
ment on the effect of these mistaken observations on the 
normality assumption. 

(d) Using the corrected data from part (b) and a 95% confi- 
dence interval, is there evidence to support the claim that 
the variability in mileage performance is greater for a 
Volkswagen than for a Mercedes? 

(e) Rework part (d) of this problem using an appropriate 
hypothesis-testing procedure. Did you get the same an- 
swer as you did originally? Why? 

10-90. Anexperiment was conducted to compare the filling 

capability of packaging equipment at two different wineries. 

Ten bottles of pinot noir from Ridgecrest Vineyards were ran- 

domly selected and measured, along with 10 bottles of pinot 

noir from Valley View Vineyards. The data are as follows (fill 
volume is in milliliters): 


Ridgecrest Valley View 
1S Wil I2 753 756 754 757 756 
753 753 753 754 755 756 756 755 
We Tayi 755 756 


(a) What assumptions are necessary to perform a hypothesis- 
testing procedure for equality of means of these data? 
Check these assumptions. 

(b) Perform the appropriate hypothesis-testing procedure to 
determine whether the data support the claim that both 
wineries will fill bottles to the same mean volume. 

(c) Suppose that the true difference in mean fill volume is as 
much as 2 fluid ounces; did the sample sizes of 10 from 
each vineyard provide good detection capability when 
a = 0.05? Explain your answer. 

10-91. A Rockwell hardness-testing machine presses a tip 

into a test coupon and uses the depth of the resulting depres- 

sion to indicate hardness. Two different tips are being com- 
pared to determine whether they provide the same Rockwell 

C-scale hardness readings. Nine coupons are tested, with both 

tips being tested on each coupon. The data are shown in the 

accompanying table. 

(a) State any assumptions necessary to test the claim that both 
tips produce the same Rockwell C-scale hardness readings. 
Check those assumptions for which you have the information. 


10-7 SUMMARY TABLE AND ROADMAPS FOR INFERENCE PROCEDURES FOR TWO SAMPLES 


Coupon Tipl Tip2 Coupon Tip! Tip2 
1 47 46 6 41 41 
2 42 40 7 45 46 
3 43 45 8 45 46 
4 40 41 9 49 48 
3) 42 43 


(b) Apply an appropriate statistical method to determine if the 
data support the claim that the difference in Rockwell 
C-scale hardness readings of the two tips is significantly 
different from zero. 

(c) Suppose that if the two tips differ in mean hardness read- 
ings by as much as 1.0, we want the power of the test to be 
at least 0.9. For an a = 0.01, how many coupons should 
have been used in the test? 


10-92. Two different gauges can be used to measure the depth 
of bath material in a Hall cell used in smelting aluminum. Each 
gauge is used once in 15 cells by the same operator. 


Cell Gaugel Gauge2 Cell Gaugel Gauge2 

1 46 in. 47 in. 9 52 51 
2 50 53 10 47 45 
3 47 45 11 49 51 
4 53 50 12 45 45 
5 49 51 13 47 49 
6 48 48 14 46 43 
7 53 54 15 50 51 
8 56 53 
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(a) State any assumptions necessary to test the claim that both 
gauges produce the same mean bath depth readings. Check 
those assumptions for which you have the information. 

(b) Apply an appropriate statistical procedure to determine if 
the data support the claim that the two gauges produce dif- 
ferent mean bath depth readings. 

(c) Suppose that if the two gauges differ in mean bath depth 
readings by as much as 1.65 inch, we want the power of 
the test to be at least 0.8. For a = 0.01, how many cells 
should have been used? 


10-93. An article in the Journal of the Environmental 
Engineering Division [“Distribution of Toxic Substances in 
Rivers” (1982, Vol. 108, pp. 639-—649)] investigated the concen- 
tration of several hydrophobic organic substances in the Wolf 
River in Tennessee. Measurements on hexachlorobenzene (HCB) 
in nanograms per liter were taken at different depths downstream 
of an abandoned dump site. Data for two depths follow: 


Surface: 3.74, 4.61, 4.00, 4.67, 4.87, 5.12, 4.52, 5.29, 5.74, 5.48 
Bottom: 5.44, 6.88, 5.37, 5.44, 5.03, 6.48, 3.89, 5.85, 6.85, 7.16 


(a) What assumptions are required to test the claim that mean 
HCB concentration is the same at both depths? Check 
those assumptions for which you have the information. 

(b) Apply an appropriate procedure to determine if the data 
support the claim in part a. 

(c) Suppose that the true difference in mean concentrations 
is 2.0 nanograms per liter. For « = 0.05, what is the power 
of a statistical test for Hp: ; = pb. versus Hy: wy #7 py? 

(d) What sample size would be required to detect a difference 
of 1.0 nanograms per liter at a = 0.05 if the power must 
be at least 0.9? 


MIND-EXPANDING EXERCISES 


10-94. Three different pesticides can be used to control 
infestation of grapes. It is suspected that pesticide 3 is 
more effective than the other two. In a particular vineyard, 
three different plantings of pinot noir grapes are selected 
for study. The following results on yield are obtained: 


(Bushels/ eee of 
Pesticide Plant) S; Plants) 
1 4.6 0.7 100 
2) ey 0.6 120 
3 6.1 0.8 130 


If p; is the true mean yield after treatment with the ith 
pesticide, we are interested in the quantity 


w= = (Wi + Wo) — Bs 


N[e 


which measures the difference in mean yields be- 

tween pesticides 1 and 2 and pesticide 3. If the sam- 

ple sizes n; are large, the estimator (say, f.) obtained 

by replacing each individual 1; by X; is approximately 

normal. 

(a) Find an approximate 100(1 — a)% large-sample 
confidence interval for jw. 
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MIND-EXPANDING EXERCISES 


(b) Do these data support the claim that pesticide 3 is 
more effective than the other two? Use a = 0.05 in 
determining your answer. 


10-95. Suppose that we wish to test Ho: 4; = po Versus 
Hy: w, % po, where of and o3 are known. The total sample 
size N is to be determined, and the allocation of observa- 
tions to the two populations such that n, + n, = Nis to be 
made on the basis of cost. If the cost of sampling for pop- 
ulations 1 and 2 are C, and C,, respectively, find the mini- 
mum cost sample sizes that provide a specified variance 
for the difference in sample means. 


10-96. Suppose that we wish to test the hypothesis Ho: 
Mt; = py versus Hj: w; ~ py, where both variances oj; and 
o3 are known. A total of n; + nm) = N observations can be 
taken. How should these observations be allocated to the 
two populations to maximize the probability that Hy will 
be rejected if H, is true and p, — pp = AA0? 

10-97. Suppose that we wish to test Hp: w = [yo Versus 
H: & # [o, where the population is normal with known 
o. Let 0 < € <a, and define the critical region so that we 
will reject Hy if z) > z, or if 7% < —Zy—¢, Where Zp is the 
value of the usual test statistic for these hypotheses. 

(a) Show that the probability of type I error for this test 

is a. 


(b) Suppose that the true mean is p; = Wo + A. Derive 
an expression for B for the above test. 


10-98. Construct a data set for which the paired ¢-test 
Statistic is very large, indicating that when this analysis 
is used the two population means are different, but fy for 
the two-sample ¢-test is very small so that the incorrect 
analysis would indicate that there is no significant dif- 
ference between the means. 


10-99. In some situations involving proportions, we 

are interested in the ratio 8 = p,/p, rather than the differ- 

ence p; — p>. Let 6 = p,/ >. We can show that In(6) has 

an approximate normal distribution with the mean (n/@) 

and variance [(n, — x1)/(mx,) + (my — x2)/(nox»)]!/?. 

(a) Use the information above to derive a large-sample 
confidence interval for In 0. 

(b) Show how to find a large-sample CI for 0. 

(c) Use the data from the St. John’s Wort study in 
Example 10-15, and find a 95% CI on 0 = p,/po. 
Provide a practical interpretation for this CI. 

10-100. Derive an expression for 8 for the test of the 

equality of the variances of two normal distributions. 

Assume that the two-sided alternative is specified. 
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The space shuttle Challenger accident in January 1986 was the result of the failure of 
O-rings used to seal field joints in the solid rocket motor due to the extremely low ambi- 
ent temperatures at the time of launch. Prior to the launch there were data on the occur- 
rence of O-ring failure and the corresponding temperature on 24 prior launches or static 
firings of the motor. In this chapter we will see how to build a statistical model relating the 
probability of O-ring failure to temperature. This model provides a measure of the risk as- 
sociated with launching the shuttle at the low temperature occurring when Challenger 
was launched. 
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LEARNING OBJECTIVES 


After careful study of this chapter, you should be able to do the following: 

1. Use simple linear regression for building empirical models to engineering and scientific data 

2. Understand how the method of least squares is used to estimate the parameters in a linear 
regression model 


3. Analyze residuals to determine if the regression model is an adequate fit to the data or to see if 
any underlying assumptions are violated 


4. Test statistical hypotheses and construct confidence intervals on regression model parameters 


5. Use the regression model to make a prediction of a future observation and construct an 
appropriate prediction interval on the future observation 


6. Apply the correlation model 


7. Use simple transformations to achieve a linear regression model 


11-1 EMPIRICAL MODELS 


Many problems in engineering and the sciences involve a study or analysis of the relationship 
between two or more variables. For example, the pressure of a gas in a container is related to 
the temperature, the velocity of water in an open channel is related to the width of the chan- 
nel, and the displacement of a particle at a certain time is related to its velocity. In this last ex- 
ample, if we let dy be the displacement of the particle from the origin at time ¢ = 0 and v be the 
velocity, then the displacement at time ¢ is d, = dy + vt. This is an example of a deterministic 
linear relationship, because (apart from measurement errors) the model predicts displacement 
perfectly. 

However, there are many situations where the relationship between variables is not deter- 
ministic. For example, the electrical energy consumption of a house (y) is related to the size 
of the house (x, in square feet), but it is unlikely to be a deterministic relationship. Similarly, 
the fuel usage of an automobile (y) is related to the vehicle weight x, but the relationship is not 
a deterministic one. In both of these examples the value of the response of interest y (energy 
consumption, fuel usage) cannot be predicted perfectly from knowledge of the corresponding x. 
It is possible for different automobiles to have different fuel usage even if they weigh the same, 
and it is possible for different houses to use different amounts of electricity even if they are the 
same Size. 

The collection of statistical tools that are used to model and explore relationships be- 
tween variables that are related in a nondeterministic manner is called regression analysis. 
Because problems of this type occur so frequently in many branches of engineering and sci- 
ence, regression analysis is one of the most widely used statistical tools. In this chapter we 
present the situation where there is only one independent or predictor variable x and the re- 
lationship with the response y is assumed to be linear. While this seems to be a simple sce- 
nario, there are many practical problems that fall into this framework. 

For example, in a chemical process, suppose that the yield of the product is related to 
the process-operating temperature. Regression analysis can be used to build a model to 
predict yield at a given temperature level. This model can also be used for process opti- 
mization, such as finding the level of temperature that maximizes yield, or for process 
control purposes. 
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As an illustration, consider the data in Table 11-1. In this table y is the purity of oxygen 
produced in a chemical distillation process, and x is the percentage of hydrocarbons that are 
present in the main condenser of the distillation unit. Figure 11-1 presents a scatter diagram 
of the data in Table 11-1. This is just a graph on which each (x;, y,) pair is represented as a point 
plotted in a two-dimensional coordinate system. This scatter diagram was produced by 
Minitab, and we selected an option that shows dot diagrams of the x and y variables along the 
top and right margins of the graph, respectively, making it easy to see the distributions of the 
individual variables (box plots or histograms could also be selected). Inspection of this scatter 
diagram indicates that, although no simple curve will pass exactly through all the points, there 
is a strong indication that the points lie scattered randomly around a straight line. Therefore, it 
is probably reasonable to assume that the mean of the random variable Y is related to x by the 
following straight-line relationship: 


E(Y|x) = py|x = Bo + Bix 


where the slope and intercept of the line are called regression coefficients. While the mean of 
Y is a linear function of x, the actual observed value y does not fall exactly on a straight line. 
The appropriate way to generalize this to a probabilistic linear model is to assume that the 
expected value of Y is a linear function of x, but that for a fixed value of x the actual value of Y 
is determined by the mean value function (the linear model) plus a random error term, say, 


Y=£6o+ Bux te (11-1) 
Table 11-1 Oxygen and Hydrocarbon Levels 
Observation Hydrocarbon Level Purity 
Number x(%) y(%) 
1 0.99 90.01 
2 1.02 89.05 
3 1.15 91.43 
4 1.29 93.74 
5 1.46 96.73 : t . : 
6 1.36 94.45 ini e e e e e e e e e e e 
7 0.87 87.59 el; 
8 1.23 91.77 98 
9 1.55 99.42 e ° 
10 1.40 93.65 ae “i 
iil 1.19 93.54 SS oa s abe 3 ae 
12 1.15 92.52 £ “ ° . 
13 0.98 9056 © . - 
14 1.01 89.54 90 e, ; . meewe 
15 1.11 89.85 : ° 
16 1.20 90.39 a ar . 
17 1.26 CB) 5) 86 
18 1.32 93.41 0.85 0.95 1.05 1.15 125° 1,85 1.45 1.55 
19 1.43 94.98 Hydrocarbon level (x) 
20 0.95 87.33 Figure 11-1 Scatter diagram of oxygen purity versus hydrocarbon 


level from Table 11-1. 
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where € is the random error term. We will call this model the simple linear regression model, 
because it has only one independent variable or regressor. Sometimes a model like this will 
arise from a theoretical relationship. At other times, we will have no theoretical knowledge of 
the relationship between x and y, and the choice of the model is based on inspection of a scatter 
diagram, such as we did with the oxygen purity data. We then think of the regression model as 
an empirical model. 

To gain more insight into this model, suppose that we can fix the value of x and observe 
the value of the random variable Y. Now if x is fixed, the random component € on the right- 
hand side of the model in Equation 11-1 determines the properties of Y. Suppose that the mean 
and variance of € are 0 and o”, respectively. Then, 


E(Y|x) = E(Bo + Bix + €) = Bo + Bix + Ele) = Bo + Bix 


Notice that this is the same relationship that we initially wrote down empirically from inspection 
of the scatter diagram in Fig. 11-1. The variance of Y given x is 


V(Y|x) = V(Bo + Bux + €) = V(Bo + Bix) + Vie) =O + 0 =e? 


Thus, the true regression model wy|, = Bo + Bx is a line of mean values; that is, the height 
of the regression line at any value of x is just the expected value of Y for that x. The slope, B,, 
can be interpreted as the change in the mean of Y for a unit change in x. Furthermore, the 
variability of Y at a particular value of x is determined by the error variance o*. This implies 
that there is a distribution of Y-values at each x and that the variance of this distribution is the 
same at each x. 

For example, suppose that the true regression model relating oxygen purity to hydrocarbon 
level is wy), = 75 + 15x, and suppose that the variance is o* = 2. Figure 11-2 illustrates this 
situation. Notice that we have used a normal distribution to describe the random variation 
in €. Since Y is the sum of a constant By + B,x (the mean) and a normally distributed 
random variable, Y is a normally distributed random variable. The variance o” determines 
the variability in the observations Y on oxygen purity. Thus, when o” is small, the observed 
values of Y will fall close to the line, and when o” is large, the observed values of Y may 
deviate considerably from the line. Because o” is constant, the variability in Y at any value 
of x is the same. 

The regression model describes the relationship between oxygen purity Y and hydrocar- 
bon level x. Thus, for any value of hydrocarbon level, oxygen purity has a normal distribution 


y 
(Oxygen 
purity) 
Bo + By (1.25) True regression line 
Hy|x =Bot Bix 
=75+15x 
Bo + By (1.00) 
<= 1,00 x=1.25 x (Hydrocarbon level) 


Figure 11-2 The distribution of Y for a given value of x for the 
oxygen purity—hydrocarbon data. 
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with mean 75 + 15x and variance 2. For example, if x = 1.25, Y has mean value py), = 75 + 
15(1.25) = 93.75 and variance 2. 

In most real-world problems, the values of the intercept and slope (Bp, 8,) and the error 
variance o” will not be known, and they must be estimated from sample data. Then this fitted 
regression equation or model is typically used in prediction of future observations of Y, or for 
estimating the mean response at a particular level of x. To illustrate, a chemical engineer might 
be interested in estimating the mean purity of oxygen produced when the hydrocarbon level is 
x = 1.25%. This chapter discusses such procedures and applications for the simple linear 
regression model. Chapter 12 will discuss multiple linear regression models that involve more 
than one regressor. 


Historical Note 

Sir Francis Galton first used the term regression analysis in a study of the heights of fathers (x) 
and sons (y). Galton fit a least squares line and used it to predict the son’s height from the 
father’s height. He found that if a father’s height was above average, the son’s height would also 
be above average, but not by as much as the father’s height was. A similar effect was observed 
for below average heights. That is, the son’s height “regressed” toward the average. Consequently, 
Galton referred to the least squares line as a regression line. 


Abuses of Regression 

Regression is widely used and frequently misused; several common abuses of regression are 
briefly mentioned here. Care should be taken in selecting variables with which to construct 
regression equations and in determining the form of the model. It is possible to develop sta- 
tistically significant relationships among variables that are completely unrelated in a causal 
sense. For example, we might attempt to relate the shear strength of spot welds with the num- 
ber of empty parking spaces in the visitor parking lot. A straight line may even appear to pro- 
vide a good fit to the data, but the relationship is an unreasonable one on which to rely. You 
can’t increase the weld strength by blocking off parking spaces. A strong observed association 
between variables does not necessarily imply that a causal relationship exists between those 
variables. This type of effect is encountered fairly often in retrospective data analysis, and 
even in observational studies. Designed experiments are the only way to determine cause- 
and-effect relationships. 

Regression relationships are valid only for values of the regressor variable within the 
range of the original data. The linear relationship that we have tentatively assumed may be 
valid over the original range of x, but it may be unlikely to remain so as we extrapolate—that 
is, if we use values of x beyond that range. In other words, as we move beyond the range of 
values of x for which data were collected, we become less certain about the validity of the 
assumed model. Regression models are not necessarily valid for extrapolation purposes. 

Now this does not mean don’ t ever extrapolate. There are many problem situations in 
science and engineering where extrapolation of a regression model is the only way to even 
approach the problem. However, there is a strong warning to be careful. A modest extrapola- 
tion may be perfectly all right in many cases, but a large extrapolation will almost never 
produce acceptable results. 


11-2. SIMPLE LINEAR REGRESSION 


The case of simple linear regression considers a single regressor variable or predictor variable 
x and a dependent or response variable Y. Suppose that the true relationship between Y and x 
is a straight line and that the observation Y at each level of x is a random variable. As noted 
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previously, the expected value of Y for each value of x is 
E(Y|x) = Bo + Bix 


where the intercept By and the slope 8, are unknown regression coefficients. We assume that 
each observation, Y, can be described by the model 


Y=By+Byxte (11-2) 


where € is a random error with mean zero and (unknown) variance o”. The random errors cor- 
responding to different observations are also assumed to be uncorrelated random variables. 

Suppose that we have n pairs of observations (x,, V1), (Xo, V2), ---» (Xn, ¥,). Figure 11-3 
shows a typical scatter plot of observed data and a candidate for the estimated regression 
line. The estimates of By) and B, should result in a line that is (in some sense) a “best fit” to 
the data. The German scientist Karl Gauss (1777-1855) proposed estimating the parameters 
8, and B, in Equation 11-2 to minimize the sum of the squares of the vertical deviations in 
Fig. 11-3. 

We call this criterion for estimating the regression coefficients the method of least 
squares. Using Equation 11-2, we may express the n observations in the sample as 


VW= Pot Bie te, i= 1,2,...,0 (11-3) 


and the sum of the squares of the deviations of the observations from the true regression 
line is 


= 
= 


L= De = 01 - Bo — Bx)? (11-4) 


The least squares estimators of By and 8, say, Bo and Bi. must satisfy 


OL Lf 2 P 

a —_ —2 po _ Xx; = 0 

Bo [80.8 a Bo — By ) 

OL Z x n 

——|, . = =2 i — Byx,;)x; = 0 (11-5) 
3B, [Gods a Bo — Byx)) 


Observed value 
Data (y) 


Estimated 
regression line 


Figure 11-3 Deviations of the data from the 
estimated regression model. 
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Simplifying these two equations yields 


Bo Dx t+ bi Six? = Dd yx; (11-6) 


Equations 11-6 are called the least squares normal equations. The solution to the normal 
equations results in the least squares estimators By and Bj. 


Least Squares 


Estimates The least squares estimates of the intercept and slope in the simple linear regression 
model are 


Bo =y— Bix (11-7) 


6, => - (11-8) 


i=1 


where y = (1/n) >)-; y; and ¥ = (1/n) D3-, x;. 


The fitted or estimated regression line is therefore 
5 = Bo + Bix (11-9) 
Note that each pair of observations satisfies the relationship 
Vi = Bo + Bix; +e, P= 1,2, 00 


where e; = y; — J; is called the residual. The residual describes the error in the fit of the 
model to the ith observation y,;. Later in this chapter we will use the residuals to provide 
information about the adequacy of the fitted model. 

Notationally, it is occasionally convenient to give special symbols to the numerator and 
denominator of Equation 11-8. Given data (x1, v1), (2, V2), +--+» Xns Vn)» let 


_(Q») 


Sex = > @ — 3% = Dx 7 (11-10) 
i=1 i=1 


and 


Sxy = » (i — Vou — x) = Dri rr (11-11) 
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EXAMPLE 11-1 Oxygen Purity 


We will fit a simple linear regression model to the oxygen 
purity data in Table 11-1. The following quantities may be 
computed: 


20 20 
n=20 Six, = 23.92 >) y, = 1,843.21 
i=1 i=1 


¥ = 1.1960 7 = 92.1605 


20 
> y? = 170,044.5321 


i=1 


20 
> x7 = 29.2892 


i=1 


20 
> x) = 2,214.6566 
i=1 


20 2 
> Xi 
= 

x? . 
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Therefore, the least squares estimates of the slope and inter- 
cept are 


a Sxy  10.17744 ee 
1S. 0.68088 ; 


and 
By = y — Bix = 92.1605 — (14.94748)1.196 = 74.2833] 


The fitted simple linear regression model (with the coefficients 
reported to three decimal places) is 


y = 74.283 + 14.947x 


This model is plotted in Fig. 11-4, along with the sample data. 

Practical Interpretation: Using the regression model, we 
would predict oxygen purity of } = 89.23% when the 
hydrocarbon level is x = 1.00%. The purity 89.23% may be 
interpreted as an estimate of the true population mean purity 
when x = 1.00%, or as an estimate of a new observation 
when x = 1.00%. These estimates are, of course, subject to 
error; that is, it is unlikely that a future observation on purity 
would be exactly 89.23% when the hydrocarbon level is 
1.00%. In subsequent sections we will see how to use confi- 
dence intervals and prediction intervals to describe the error 
in estimation from a regression model. 


Computer software programs are widely used in regression modeling. These programs 
typically carry more decimal places in the calculations. Table 11-2 shows a portion of the 
output from Minitab for this problem. The estimates Bo and B, are highlighted. In subse- 
quent sections we will provide explanations for the information provided in this computer 


20 (23.92) 
Sy, = = 29.2892 — 
XX ps 20 
= 0.68088 
and 
20 20 
ewe) 
i= i=l 
Syy = Dri 20 
(23.92)(1,843.21) 
= 2,214.6566 = 10.17744 
20 
output. 
102 
99 
X 
a 96 
2 
= 
= 
& 93 
< 
ro) 
Figure 11-4 Scatter 90 
plot of oxygen 
purity y versus 
hydrocarbon level x 87 5.87 1.07 1.27 


and regression model 
yp = 74.283 + 14.947x. % 


1.47 1.67 


Hydrocarbon level (%) 


Estimator 
of Variance 
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Table 11-2 Minitab Output for the Oxygen Purity Data in Example 11-1 


Regression Analysis 


The regression equation is 


Purity = 74.3 + 14.9 HC Level 


Predictor Coef SE Coef T P 
Constant 74.283 <P 1.593 46.62 0.000 
HC Level 14.947 ~<-B, 1.317 11.35 0.000 
S = 1.087 R-Sq = 87.7% R-Sq (adj) = 87.1% 


Analysis of Variance 


Source DF SS MS F P 
Regression 1 152.13 152.13 128.86 0.000 
Residual Error 18 21.25 <— SS; 1.18 <6? 

Total 19 173.38 


Predicted Values for New Observations 

New Obs Fit SE Fit 95.0% CI 95.0% PI 

1 89.231 0.354 (88.486, 89.975) (86.830, 91.632) 
Values of Predictors for New Observations 


New Obs HC Level 
1 1.00 


Estimating o” 

There is actually another unknown parameter in our regression model, o” (the variance of the 
error term €). The residuals e; = y; — $; are used to obtain an estimate of o*. The sum of 
squares of the residuals, often called the error sum of squares, is 


n 


SSp = 28 = Di - di? (11-12) 


i=1 


We can show that the expected value of the error sum of squares is E(SS;) = (n — 2)o’. 
Therefore an unbiased estimator of o7 is 


(11-13) 


Computing SS, using Equation 11-12 would be fairly tedious. A more convenient computing 
formula can be obtained by substituting }; = By + B,x; into Equation 11-12 and simplifying. 
The resulting computing formula is 


SSz = SSp — BiSry (11-14) 
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where SS; = D/_,(y; — y = Dj, y7 — ny? is the total sum of squares of the response 


variable y. Formulas such as this are presented in Section 11-4. The error sum of squares and 
the estimate of o” for the oxygen purity data, G” = 1.18, are highlighted in the Minitab output 


in Table 11-2. 


EXERCISES FOR SECTION 11-2 


11-1. An article in Concrete Research [Near Surface 

Characteristics of Concrete: Intrinsic Permeability” (Vol. 41, 

1989)] presented data on compressive strength x and intrinsic per- 

meability y of various concrete mixes and cures. Summary quan- 

tities aren = 14, Sy, = 572, Dy? = 23,530, Sx, = 43, Sx? = 

157.42, and > x;y; = 1697.80. Assume that the two variables 

are related according to the simple linear regression model. 

(a) Calculate the least squares estimates of the slope and intercept. 
Estimate o*. Graph the regression line. 

(b) Use the equation of the fitted line to predict what perme- 
ability would be observed when the compressive strength 
isx = 4.3. 

(c) Give a point estimate of the mean permeability when 
compressive strength is x = 3.7. 

(d) Suppose that the observed value of permeability at x = 
3.7 is y = 46.1. Calculate the value of the corresponding 
residual. 


11-2. Regression methods were used to analyze the data 
from a study investigating the relationship between roadway 
surface temperature (x) and pavement deflection (y). Summary 
quantities were n = 20, Sy; = 12.75, Sy7 = 8.86, Sx; = 
1478, Sx? = 143,215.8, and > x,y; = 1083.67. 

(a) Calculate the least squares estimates of the slope and in- 
tercept. Graph the regression line. Estimate 0. 

(b) Use the equation of the fitted line to predict what pave- 
ment deflection would be observed when the surface 
temperature is 85°F. 

(c) What is the mean pavement deflection when the surface 
temperature is 90°F? 

(d) What change in mean pavement deflection would be ex- 
pected for a 1°F change in surface temperature? 


11-3. The following table presents data on the ratings of quar- 
terbacks for the 2008 National Football League season (source: 
The Sports Network). It is suspected that the rating (7) is related 
to the average number of yards gained per pass attempt (x). 

(a) Calculate the least squares estimates of the slope and 
intercept. What is the estimate of 0°? Graph the regres- 
sion model. 

(b) Find an estimate of the mean rating if a quarterback 
averages 7.5 yards per attempt. 

(c) What change in the mean rating is associated with a 
decrease of one yard per attempt? 

(d) To increase the mean rating by 10 points, how much in- 
crease in the average yards per attempt must be generated? 


(e) Given that x = 7.21 yards, find the fitted value of y and the 
corresponding residual. 


Yards per Rating 

Player Team Attempt Points 
Philip Rivers SD 8.39 105.5 
Chad Pennington MIA 7.67 97.4 
Kurt Warner ARI 7.66 96.9 
Drew Brees NO 7.98 96.2 
Peyton Manning IND ee 95 
Aaron Rodgers GB 7.53 93.8 
Matt Schaub HOU 8.01 CPi 
Tony Romo DAL 7.66 91.4 
Jeff Garcia TB Was 90.2 
Matt Cassel NE 7.16 89.4 
Matt Ryan ATL 7.93 87.7 
Shaun Hill SF 7.10 87.5 
Seneca Wallace SEA 6.33 87 
Eli Manning NYG 6.76 86.4 
Donovan McNabb PHI 6.86 86.4 
Jay Cutler DEN 7.35 86 
Trent Edwards BUF fe22) 85.4 
Jake Delhomme CAR 7.94 84.7 
Jason Campbell WAS 6.41 84.3 
David Garrard JAC 6.77 81.7 
Brett Favre NYJ 6.65 81 
Joe Flacco BAL 6.94 80.3 
Kerry Collins TEN 6.45 80.2 
Ben Roethlisberger PIT 7.04 80.1 
Kyle Orton CHI 6.39 79.6 
JaMarcus Russell OAK 6.58 77.1 
Tyler Thigpen KC 6.21 76 
Gus Freotte MIN TAT 73.7 
Dan Orlovsky DET 6.34 72.6 
Marc Bulger STL 6.18 71.4 
Ryan Fitzpatrick CIN SoZ 70 
Derek Anderson CLE 5.71 66.5 


am) 11-4. An article in Zechnometrics by S.C. Narula and J. FE 
9 Wellington [“Prediction, Linear Regression, and a Minimum 


Sum of Relative Errors” (Vol. 19, 1977)] presents data on the 
selling price and annual taxes for 24 houses. The data are 
shown in the following table. 


Taxes Taxes 
Sale (Local, School), Sale (Local, School), 
Price/1000 County)/1000 = Price/1000 County)/1000 


25.9 4.9176 30.0 5.0500 
29.5 5.0208 36.9 8.2464 
27.9 4.5429 41.9 6.6969 
25.9 4.5573 40.5 7.7841 
29.9 5.0597 43.9 9.0384 
29.9 3.8910 37.5 5.9894 
30.9 5.8980 37.9 7.5422 
28.9 5.6039 44.5 8.7951 
35.9 5.8282 38) 6.0831 
315 5.3003 38.9 8.3607 
31.0 6.2712 36.9 8.1400 
30.9 5.9592 45.8 9.1416 


(a) Assuming that a simple linear regression model is 
appropriate, obtain the least squares fit relating selling 
price to taxes paid. What is the estimate of 0”? 

(b) Find the mean selling price given that the taxes paid are 
x = 7.50. 

(c) Calculate the fitted value of y corresponding to x = 
5.8980. Find the corresponding residual. 

(d) Calculate the fitted »; for each value of x; used to fit the 
model. Then construct a graph of 7; versus the correspon- 
ding observed value y; and comment on what this plot 
would look like if the relationship between y and x was a 
deterministic (no random error) straight line. Does the 
plot actually obtained indicate that taxes paid is an 
effective regressor variable in predicting selling price? 

11-5. The number of pounds of steam used per month by a 

chemical plant is thought to be related to the average ambient 

temperature (in° F) for that month. The past year’s usage and 
temperature are shown in the following table: 


Month Temp. Usage/1000 Month Temp. Usage/1000 


Jan. 21 185.79 July 68 621.55 
Feb. 24 214.47 Aug. 74 675.06 
Mar. 32 288.03 Sept. 62 562.03 
Apr. 47 424.84 Oct. 50 452.93 
May 50 454.58 Nov. 41 369.95 
June 59 539.03 Dec. 30 273.98 
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(a) Assuming that a simple linear regression model is appro- 
priate, fit the regression model relating steam usage (y) to 
the average temperature (x). What is the estimate of 07? 
Graph the regression line. 

(b) What is the estimate of expected steam usage when the 
average temperature is 55°F? 

(c) What change in mean steam usage is expected when the 
monthly average temperature changes by 1°F? 

(d) Suppose the monthly average temperature is 47°F. Calculate 
the fitted value of y and the corresponding residual. 


11-6. The following table presents the highway gasoline 
mileage performance and engine displacement for Daimler- 
Chrysler vehicles for model year 2005 (source: U.S. Environ- 
mental Protection Agency). 

(a) Fit a simple linear model relating highway miles per gal- 
lon (y) to engine displacement (x) in cubic inches using 
least squares. 

(b) Find an estimate of the mean highway gasoline mileage 
performance for a car with 150 cubic inches engine 
displacement. 

(c) Obtain the fitted value of y and the corresponding residual 
for a car, the Neon, with an engine displacement of 122 
cubic inches. 


Engine 
Displacement MPG 

Carline (in?) (highway) 
300C/SRT-8 215 30.8 
CARAVAN 2WD 201 32.5 
CROSSFIRE ROADSTER 196 35.4 
DAKOTA PICKUP 2WD 226 28.1 
DAKOTA PICKUP 4WD 226 24.4 
DURANGO 2WD 348 24.1 
GRAND CHEROKEE 2WD 226 28.5 
GRAND CHEROKEE 4WD 348 24.2 
LIBERTY/CHEROKEE 2WD 148 32.8 
LIBERTY/CHEROKEE 4WD 226 28 
NEON/SRT-4/SX 2.0 122 41.3 
PACIFICA 2WD 215 30.0 
PACIFICA AWD 215 28.2 
PT CRUISER 148 34.1 
RAM 1500 PICKUP 2WD 500 18.7 
RAM 1500 PICKUP 4WD 348 20.3 
SEBRING 4-DR 165 35.1 
STRATUS 4-DR 148 37.9 
TOWN & COUNTRY 2WD 148 33.8 
VIPER CONVERTIBLE 500 25.9 
WRANGLER/TJ 4WD 148 26.4 
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11-7. An article in the Yappi Journal (March, 1986) 
presented data on green liquor Na,S concentration (in grams 
per liter) and paper machine production (in tons per day). The 
data (read from a graph) are shown as follows: 


x | 915 | 960 | 990 | 1010 | 1012 | 1030 | 1050 


(a) Fit a simple linear regression model with y = green liquor 
Na,S concentration and x = production. Find an estimate 
of o”. Draw a scatter diagram of the data and the resulting 
least squares fitted model. 

(b) Find the fitted value of y corresponding to x = 910 and 
the associated residual. 

(c) Find the mean green liquor Na,S concentration when the 
production rate is 950 tons per day. 


11-8. An article in the Journal of Sound and Vibration 
(Vol. 151, 1991, pp. 383-394) described a study investigating 
the relationship between noise exposure and hypertension. 
The following data are representative of those reported in the 
article. 


y| 1] O}] 1} 2]5]1]4] 6 2 3 


x | 601 63! 65! 70! 701701 80! 90 | 80 | 80 


(a) Draw a scatter diagram of y (blood pressure rise in 
millimeters of mercury) versus x (sound pressure level in 
decibels). Does a simple linear regression model seem 
reasonable in this situation? 

(b) Fit the simple linear regression model using least squares. 
Find an estimate of 0°. 

(c) Find the predicted mean rise in blood pressure level 
associated with a sound pressure level of 85 decibels. 
11-9. Anarticle in Wear (Vol. 152, 1992, pp. 171-181) pres- 
ents data on the fretting wear of mild steel and oil viscosity. 
Representative data follow, with x = oil viscosity and y = wear 

volume (10~* cubic millimeters). 


y 240 181 193 155 | 172 
x 1.6 9.4 15.5 20.0 | 22.0 
y 110 113 75 94 
x 35.5 | 43.0 | 40.5 33.0 
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(a) Construct a scatter plot of the data. Does a simple linear 
regression model appear to be plausible? 

(b) Fit the simple linear regression model using least squares. 
Find an estimate of 0. 

(c) Predict fretting wear when viscosity x = 30. 

(d) Obtain the fitted value of y when x = 22.0 and calculate 
the corresponding residual. 


11-10. An article in the Journal of Environmental 
Engineering (Vol. 115, No. 3, 1989, pp. 608-619) reported the 
results of a study on the occurrence of sodium and chloride in 
surface streams in central Rhode Island. The following data 
are chloride concentration y (in milligrams per liter) and 
roadway area in the watershed x (in percentage). 


y 4.4 6.6 9.7 10.6 | 10.8 10.9 
x 0.19 0.15 0.57 0.70 0.67 0.63 
y 11.8 12.1 14.3 14.7 | 15.0 17.3 
x 0.47 0.70 0.60 0.78 0.81 0.78 
y 19.2 23.1 27.4 27.7 | 31.8 39.5 
x 0.69 1.30 1.05 1.06 1.74 1.62 


(a) Draw a scatter diagram of the data. Does a simple linear 
regression model seem appropriate here? 

(b) Fit the simple linear regression model using the method of 
least squares. Find an estimate of 0”. 

(c) Estimate the mean chloride concentration for a watershed 
that has 1% roadway area. 

(d) Find the fitted value corresponding to x = 0.47 and the 
associated residual. 


11-11. A rocket motor is manufactured by bonding together 
two types of propellants, an igniter and a sustainer. The shear 
strength of the bond y is thought to be a linear function of the 
age of the propellant x when the motor is cast. Twenty obser- 
vations are shown in the following table. 

(a) Draw a scatter diagram of the data. Does the straight-line 
regression model seem to be plausible? 

(b) Find the least squares estimates of the slope and inter- 
cept in the simple linear regression model. Find an 
estimate of o°. 

(c) Estimate the mean shear strength of a motor made from 
propellant that is 20 weeks old. 

(d) Obtain the fitted values j; that correspond to each observed 
value y;. Plot }; versus y; and comment on what this plot 
would look like if the linear relationship between shear 
strength and age were perfectly deterministic (no error). 
Does this plot indicate that age is a reasonable choice of 
regressor variable in this model? 


Observation Strength y Age x 
Number (psi) (weeks) 

1 2158.70 15.50 
2 1678.15 23.75 
3 2316.00 8.00 
4 2061.30 17.00 
5 2207.50 5.00 
6 1708.30 19.00 
7 1784.70 24.00 
8 2575.00 2.50 
9 2357.90 7.50 
10 2277.70 11.00 
11 2165.20 13.00 
12 2399.55 3.75 
13 1779.80 25.00 
14 2336.75 9.75 
15 1765.30 22.00 
16 2053.50 18.00 
17 2414.40 6.00 
18 2200.50 12.50 
19 2654.20 2.00 
20 1753.70 21.50 


11-12. An article in the Journal of the American Ceramic 
Society [“Rapid Hot-Pressing of Ultrafine PSZ Powders” 
(1991, Vol. 74, pp. 1547—1553)] considered the microstructure 
of the ultrafine powder of partially stabilized zirconia as a 
function of temperature. The data are shown below: 


x = Temperature (°C): 1100 1200 1300 1100 1500 
1200 1300 

y = Porosity (%): 30.8 19.2 60 135 114 
7.7 3.6 


(a) Fit the simple linear regression model using the method of 


least squares. Find an estimate of 07. 
(b) Estimate the mean porosity for a temperature of 1400°C. 


(c) Find the fitted value corresponding to y = 11.4 and the 
associated residual. 

(d) Draw a scatter diagram of the data. Does a simple linear 
regression model seem appropriate here? Explain. 


11-13. An article in the Journal of the Environmental 
Engineering Division [“Least Squares Estimates of BOD 
Parameters” (1980, Vol. 106, pp. 1197—1202)] took a sample 
from the Holston River below Kingport, Tennessee, during 
August 1977. The biochemical oxygen demand (BOD) test is 
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conducted over a period of time in days. The resulting data are 
shown below: 


Time(days: 1 2 4 6 8 10 12 14° 16 


18 20 
BOD (mg/liter): 0.6 0.7 15 19 21 26 29 3.7 3.5 
3.7 3.8 


(a) Assuming that a simple linear regression model is appro- 
priate, fit the regression model relating BOD (y) to the 
time (x). What is the estimate of 07? 

(b) What is the estimate of expected BOD level when the time 
is 15 days? 

(c) What change in mean BOD is expected when the time 
changes by three days? 

(d) Suppose the time used is six days. Calculate the fitted 
value of y and the corresponding residual. 

(e) Calculate the fitted }; for each value of x; used to fit the 
model. Then construct a graph of }; versus the correspon- 
ding observed values y; and comment on what this plot 
would look like if the relationship between y and x was a 
deterministic (no random error) straight line. Does the 
plot actually obtained indicate that time is an effective 
regressor variable in predicting BOD? 

11-14. An article in Wood Science and Technology [Creep 
in Chipboard, Part 3: Initial Assessment of the Influence of 
Moisture Content and Level of Stressing on Rate of Creep 
and Time to Failure” (1981, Vol. 15, pp. 125—144)] studied 
the deflection (mm) of particleboard from stress levels of rel- 
ative humidity. Assume that the two variables are related ac- 
cording to the simple linear regression model. The data are 
shown below: 


x = Stress level (%): 54 54 61 61 68 
y = Deflection (mm): 16.473 18.693 14.305 15.121 13.505 


x = Stress level (%): 68 75 75 75 
y = Deflection (mm): 11.640 11.168 12.534 11.224 


(a) Calculate the least square estimates of the slope and inter- 
cept. What is the estimate of o”? Graph the regression 
model and the data. 

(b) Find the estimate of the mean deflection if the stress level 
can be limited to 65%. 

(c) Estimate the change in the mean deflection associated 
with a 5% increment in stress level. 

(d) To decrease the mean deflection by one millimeter, how 
much increase in stress level must be generated? 

(e) Given that the stress level is 68%, find the fitted value of 
deflection and the corresponding residual. 


11-15. In an article in Statistics and Computing [“An 
Iterative Monte Carlo Method for Nonconjugate Bayesian 
Analysis” (1991, pp. 119-128)] Carlin and Gelfand 
investigated the age (x) and length (y) of 27 captured dugongs 
(sea cows). 
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x = 1.0, 1.5, 1.5, 1.5, 2.5, 4.0, 5.0, 5.0, 7.0, 8.0, 8.5, 9.0, 9.5, 
9.5, 10.0, 12.0, 12.0, 13.0, 13.0, 14.5, 15.5, 15.5, 16.5, 
17.0, 22.5, 29.0, 31.5 


y = 1.80, 1.85, 1.87, 1.77, 2.02, 2.27, 2.15, 2.26, 2.47, 2.19, 
2.26, 2.40, 2.39, 2.41, 2.50, 2.32, 2.32, 2.43, 2.47, 2.56, 
2.65, 2.47, 2.64, 2.56, 2.70, 2.72, 2.57 


(a 


YS 


Find the least squares estimates of the slope and the inter- 
cept in the simple linear regression model. Find an esti- 
mate of 0. 

(b) Estimate the mean length of dugongs at age 11. 

(c) Obtain the fitted values j; that correspond to each ob- 
served value y;. Plot }; versus y,, and comment on what 
this plot would look like if the linear relationship between 
length and age were perfectly deterministic (no error). 
Does this plot indicate that age is a reasonable choice of 
regressor variable in this model? 


11-16. Consider the regression model developed in Ex- 

ercise 11-2. 

(a) Suppose that temperature is measured in °C rather than °F. 
Write the new regression model. 

(b) What change in expected pavement deflection is associ- 
ated with a 1°C change in surface temperature? 


11-17. Consider the regression model developed in Exercise 
11-6. Suppose that engine displacement is measured in cubic 
centimeters instead of cubic inches. 
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(a) Write the new regression model. 
(b) What change in gasoline mileage is associated with a 
1 cm* change is engine displacement? 


11-18. Show that in a simple linear regression model 
the point (x, y) lies exactly on the least squares regression line. 


11-19. Consider the simple linear regression model Y = By + 

B.x + €. Suppose that the analyst wants to use z = x — X as 

the regressor variable. 

(a) Using the data in Exercise 11-11, construct one scatter 
plot of the (x, ,) points and then another of the 
(z; = x; — X, y;) points. Use the two plots to intuitively 
explain how the two models, Y = By + Bix + € and 
Y = BO + Biz + e«, are related. 

(b) Find the least squares estimates of 8 and B* in the model 


Y = B + Biz + e. How do they relate to the least 
squares estimates By and B,? 
11-20. Suppose we wish to fit a regression model for which 


the true regression line passes through the point (0, 0). The ap- 

propriate model is Y = Bx + e. Assume that we have n pairs 

of data (x1, v1), (X2, V2), +++ 5 Ons Vn)- 

(a) Find the least squares estimate of B. 

(b) Fit the model Y = Bx + € to the chloride concentration- 
roadway area data in Exercise 11-10. Plot the fitted 
model on a scatter diagram of the data and comment on 
the appropriateness of the model. 


11-3 PROPERTIES OF THE LEAST SQUARES ESTIMATORS 


The statistical properties of the least squares estimators Bo and B, may be easily described. 
Recall that we have assumed that the error term € in the model Y = By + B,x + €is arandom 
variable with mean zero and variance o*. Since the values of x are fixed, Y is a random vari- 
able with mean yj, = Bo + B,x and variance o*. Therefore, the values of Bo and By depend 
on the observed y’s; thus, the least squares estimators of the regression coefficients may be 
viewed as random variables. We will investigate the bias and variance properties of the least 


squares estimators Bo and Bi. 


Consider first B 1. Because B, is a linear combination of the observations Y;,, we can use 
properties of expectation to show that the expected value of 8, is 


E(B) = By (11-15) 


Thus, 8, is an unbiased estimator of the true slope {. 

Now consider the variance of Bi. Since we have assumed that V(e;) = o”, it follows that 
V(Y,) = o*. Because B, is a linear combination of the observations Y,, the results in 
Section 5-5 can be applied to show that 


(11-16) 
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For the intercept, we can show in a similar manner that 
de os es ol, Bake 
E(Bo) = Bo and V(Bo) = 0° | 5 + o- (11-17) 


Thus, Bo is an unbiased estimator of the intercept By. The covariance of the random vari- 
ables Bo and B, is not zero. It can be shown (see Exercise 11-98) that cov(Ro, B,) = 
—0°X/Syx- 

The estimate of o? could be used in Equations 11-16 and 11-17 to provide estimates of 
the variance of the slope and the intercept. We call the square roots of the resulting variance 
estimators the estimated standard errors of the slope and intercept, respectively. 


Estimated 


Standard In simple linear regression the estimated standard error of the slope and the 
Errors estimated standard error of the intercept are 


AD) = 
se(B,) = oo and —-se(By) = *|t ae =| 


respectively, where G? is computed from Equation 11-13. 


The Minitab computer output in Table 11-2 reports the estimated standard errors of the slope 
and intercept under the column heading “SE coeff.” 


11-4 HYPOTHESIS TESTS IN SIMPLE LINEAR REGRESSION 


An important part of assessing the adequacy of a linear regression model is testing statistical 
hypotheses about the model parameters and constructing certain confidence intervals. Hypothesis 
testing in simple linear regression is discussed in this section, and Section 11-5 presents methods 
for constructing confidence intervals. To test hypotheses about the slope and intercept of the re- 
gression model, we must make the additional assumption that the error component in the 
model, ¢€, is normally distributed. Thus, the complete assumptions are that the errors are normally 
and independently distributed with mean zero and variance o°, abbreviated NID(0, 0”). 


11-4.1 Use of t-Tests 


Suppose we wish to test the hypothesis that the slope equals a constant, say, 8; 9. The appro- 
priate hypotheses are 


Ao: Bi = Bio 
A: By # Bio (11-18) 


where we have assumed a two-sided alternative. Since the errors €,; are NID(0, 07), it follows 
directly that the observations Y, are NID(By + B,x;, 07). Now 8, is a linear combination of 
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Test Statistic 


Test Statistic 


independent normal random variables, and consequently, 8; is N(G,, 07/S,,), using the bias 
and variance properties of the slope discussed in Section 11-3. In addition, (n — 2)67/o? has 
a chi-square distribution with n — 2 degrees of freedom, and B , is independent of 6. As a 
result of those properties, the statistic 


T= Bi — Bio (11-19) 


V6"7/Sy 


follows the ¢ distribution with n — 2 degrees of freedom under Hp: B; = Bio. We would reject 
Ay: By = Bio if 


|to| > bo/2,n—2 d 1-20) 


where ¢, is computed from Equation 11-19. The denominator of Equation 11-19 is the standard 
error of the slope, so we could write the test statistic as 


Bi — Bio 


To = 7 
se(Bi) 
A similar procedure can be used to test hypotheses about the intercept. To test 


Ho: Bo = Boo 
A: Bo # Boo (11-21) 


we would use the statistic 


qT, = Bo — Boo —__ Bo — Boo (11-22) 


: yal , Se se(Bo) 


and reject the null hypothesis if the computed value of this test statistic, fp, is such that 
|fo| > tay2,n—2- Note that the denominator of the test statistic in Equation 11-22 is just the stan- 
dard error of the intercept. 

A very important special case of the hypotheses of Equation 11-18 is 


A: By, = 0 
Ay: B, #0 (11-23) 


These hypotheses relate to the significance of regression. Failure to reject Hp: B, = 0 is 
equivalent to concluding that there is no linear relationship between x and Y. This situation is 
illustrated in Fig. 11-5. Note that this may imply either that x is of little value in explaining the 
variation in Y and that the best estimator of Y for any x is } = Y [Fig. 11-5(a)] or that the true 
relationship between x and Y is not linear [Fig. 11-5(6)]. Alternatively, if Hp: B; = 0 is re- 
jected, this implies that x is of value in explaining the variability in Y (see Fig. 11-6). Rejecting 
Ay: 8; = 0 could mean either that the straight-line model is adequate [Fig. 11-6(a)] or that, 
although there is a linear effect of x, better results could be obtained with the addition of 
higher order polynomial terms in x [Fig. 11-6(4)]. 
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Figure 11-5 The 
hypothesis Hy: B, = 0 re 
is not rejected. (a) 


EXAMPLE 11-2 Oxygen Purity Tests of Coefficients 


We will test for significance of regression using the model for 
the oxygen purity data from Example 11-1. The hypotheses are 


A: By =0 
A: By, #0 


and we will use a = 0.01. From Example 11-1 and Table 11-2 
we have 


B, = 14.947 n= 20, Sy, = 0.68088, 6? = 1.18 


so the f-statistic in Equation 10-20 becomes 


14.947 
V/1.18/0.68088 


1 
se(By) 
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(b) 


Practical Interpretation: Since the reference value of t is 
to005,18 = 2-88, the value of the test statistic is very far into the 
critical region, implying that Hy: B, = 0 should be rejected. 
There is strong evidence to support this claim. The P-value for 
this test is P ~ 1.23 X 10°. This was obtained manually 
with a calculator. 

Table 11-2 presents the Minitab output for this problem. 
Notice that the ¢-statistic value for the slope is computed as 
11.35 and that the reported P-value is P = 0.000. Minitab also 
reports the ¢-statistic for testing the hypothesis Hp: By = 0. 
This statistic is computed from Equation 11-22, with Boo = 0, 
as fy = 46.62. Clearly, then, the hypothesis that the intercept is 
zero is rejected. 


11-4.2 Analysis of Variance Approach to Test Significance of Regression 


A method called the analysis of variance can be used to test for significance of regression. 
The procedure partitions the total variability in the response variable into meaningful compo- 
nents as the basis for the test. The analysis of variance identity is as follows: 


Analysis of 
Variance le 
Identity Lory 


(11-24) 


Figure 11-6 The 
hypothesis Hy: B, = 0 x 
is rejected. (a) 


(b) 
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Test for 
Significance of 
Regression 


The two components on the right-hand-side of Equation 11-24 measure, respectively, the 
amount of variability in y, accounted for by the regression line and the residual variation left 
unexplained by the regression line. We usually call SS; = D7, (vy; — 9;)° the error sum of 
squares and SSp = >;_; (3; — y)* the regression sum of squares. Symbolically, Equation 
11-24 may be written as 


where SS; = >'_, (v; — y)? is the total corrected sum of squares of y. In Section 11-2 we 
noted that SS, = SS; — B,S,, (see Equation 11-14), so since SS; = B,5,, + SS;, we note that the 
regression sum of squares in Equation 11-25 is SSp = B,S,,. The total sum of squares SS; has 
n — | degrees of freedom, and SSp and SS; have 1 and n —2 degrees of freedom, respectively. 

We may show that E[SS,/(n — 2)] = 0, E(SSp) = 07 + BiS,, and that SS,/o? and 
SSp/o° are independent chi-square random variables with n — 2 and | degrees of freedom, re- 
spectively. Thus, if the null hypothesis Hp: 8, = 0 is true, the statistic 


_ SSp/1 | MSp 
SS;/(n — 2) MSz 


Fo (11-26) 


follows the F’, ,-, distribution, and we would reject Ho if fo > fa.1,-2- The quantities MSp = 
SS,/1 and MS; = SS;,/(n — 2) are called mean squares. In general, a mean square is always 
computed by dividing a sum of squares by its number of degrees of freedom. The test proce- 
dure is usually arranged in an analysis of variance table, such as Table 11-3. 


EXAMPLE 11-3 Oxygen Purity ANOVA 

We will use the analysis of variance approach to test for signifi- 
cance of regression using the oxygen purity data model from 
Example 11-1. Recall that SS; = 173.38, B; = 14.947, Sy = 
10.17744, and n = 20. The regression sum of squares is 


SSp = ByS,y = (14.947)10.17744 = 152.13 


and the error sum of squares is 


SS = SS 


SSp = 173.38 — 152.13 = 21.25 


The analysis of variance for testing Hy: 8; = 0 is sum- 
marized in the Minitab output in Table 11-2. The test statistic 
is fo = MSp/MS, = 152.13/1.18 = 128.86, for which we 
find that the P-value is P ~ 1.23 X 10°, so we conclude that 
B, is not zero. 

There are frequently minor differences in terminology 
among computer packages. For example, sometimes the re- 
gression sum of squares is called the “model” sum of squares, 
and the error sum of squares is called the “residual” sum of 
squares. 


Table 11-3 Analysis of Variance for Testing Significance of Regression 


Source of Sum of Degrees of Mean 

Variation Squares Freedom Square Fo 
Regression SSp = BiS.y 1 MSp MS;/MS, 
Error So 557 bis n-2 MS; 

Total SS n-1 


Note that MS, = 67. 
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Note that the analysis of variance procedure for testing for significance of regression is 
equivalent to the z-test in Section 11-4.1. That is, either procedure will lead to the same conclusions. 
This is easy to demonstrate by starting with the ¢test statistic in Equation 11-19 with B; 9 = 0, say 


T, = Pils (11-27) 


© V6? /Sex 


Squaring both sides of Equation 11-27 and using the fact that 6? = MS; results in 


7? BTS,» B xy MS 
°™ MS; MS, MS; 


Note that 75 in Equation 11-28 is identical to Fy in Equation 11-26. It is true, in general, that 
the square of a t random variable with v degrees of freedom is an F random variable, with one 
and v degrees of freedom in the numerator and denominator, respectively. Thus, the test using 
T, is equivalent to the test based on Fy. Note, however, that the ¢-test is somewhat more flexible 
in that it would allow testing against a one-sided alternative hypothesis, while the F-test is 
restricted to a two-sided alternative. 


(11-28) 


EXERCISES FOR SECTION 11-4 


11-21. Consider the computer output below. 


The regression equation is 
Y = 12.9 + 2.34x 


Predictor Coef SE Coef T 

Constant 12.857 1.032 ? ? 
xX 2.3445 0.1150 ? ? 
S = 1.48111 R-Sq = 98.1% R-Sq(adj) = 97.9% 
Analysis of Variance 

Source DF SS MS F 
Regression 1 912.43 912.43 ? ? 
Residual Error 8 17.55 ? 

Total 9 929.98 


(a) Fill in the missing information. You may use bounds for 
the P-values. 

(b) Can you conclude that the model defines a useful linear 
relationship? 

(c) What is your estimate of 0”? 


11-22. Consider the computer output below. 


The regression equation is 
Y = 26.8 + 1.48x 


Predictor Coef SE Coef T 

Constant 26.753 2.373 ? ? 
xX 1.4756 0.1063 ? ? 
S = 2.70040 R-Sq = 93.7% R-Sq (adj) = 93.2% 
Analysis of Variance 

Source DF SS MS F P 
Regression 1 ? ? ? ? 
Residual Error ? 94.8 73 

Total 15 1500.0 


(a) Fill in the missing information. You may use bounds for 
the P-values. 

(b) Can you conclude that the model defines a useful linear 
relationship? 

(c) What is your estimate of 07? 


11-23. Consider the data from Exercise 11-1 on x = 

compressive strength and y = intrinsic permeability of concrete. 

(a) Test for significance of regression using a = 0.05. Find 
the P-value for this test. Can you conclude that the model 
specifies a useful linear relationship between these two 
variables? 

(b) Estimate o? and the standard deviation of 8. 

(c) What is the standard error of the intercept in this model? 


11-24. Consider the data from Exercise 11-2 on x = road- 

way surface temperature and y = pavement deflection. 

(a) Test for significance of regression using a = 0.05. Find 
the P-value for this test. What conclusions can you draw? 

(b) Estimate the standard errors of the slope and intercept. 


11-25. Consider the National Football League data in 

Exercise 11-3. 

(a) Test for significance of regression using a = 0.01. Find 
the P-value for this test. What conclusions can you draw? 

(b) Estimate the standard errors of the slope and intercept. 

(c) Test Hp: Bj = 10 versus Ay: B,; # 10 with a = 0.01. 
Would you agree with the statement that this is a test of 
the hypothesis that a one-yard increase in the average 
yards per attempt results in a mean increase of 10 rating 
points? 

11-26. Consider the data from Exercise 11-4 on y = sales 

price and x = taxes paid. 

(a) Test Ho: B, = O using the ¢-test; use a = 0.05. 

(b) Test Hp: 8; = Ousing the analysis of variance with a = 0.05. 
Discuss the relationship of this test to the test from part (a). 
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(c) Estimate the standard errors of the slope and intercept. 
(d) Test the hypothesis that By = 0. 


11-27. Consider the data from Exercise 11-5 on y = steam 

usage and x = average temperature. 

(a) Test for significance of regression using a = 0.01. What 
is the P-value for this test? State the conclusions that 
result from this test. 

(b) Estimate the standard errors of the slope and intercept. 

(c) Test the hypothesis Hy: 8; = 10 versus H,: B, ~ 10 using 
a = 0.01. Find the P-value for this test. 

(d) Test Ho: By = 0 versus H;: By ¥ 0 using a = 0.01. Find 
the P-value for this test and draw conclusions. 


11-28. Consider the data from Exercise 11-6 ony = highway 

gasoline mileage and x = engine displacement. 

(a) Test for significance of regression using a = 0.01. Find 
the P-value for this test. What conclusions can you 
reach? 

(b) Estimate the standard errors of the slope and intercept. 

(c) Test Hy: B; = —0.05 versus H,: B, < —0.05 using a = 
0.01 and draw conclusions. What is the P-value for this test? 

(d) Test the hypothesis Hj: By = 0 versus H;: By ~ O using 
a = 0.01. What is the P-value for this test? 


11-29. Consider the data from Exercise 11-7 on y = green 

liquor Na,S concentration and x = production in a paper mill. 

(a) Test for significance of regression using a = 0.05. Find 
the P-value for this test. 

(b) Estimate the standard errors of the slope and intercept. 

(c) Test Hp: By = 0 versus H,: By ¥ 0 using a = 0.05. What 
is the P-value for this test? 

11-30. Consider the data from Exercise 11-8 on y = blood 

pressure rise and x = sound pressure level. 

(a) Test for significance of regression using a = 0.05. What 
is the P-value for this test? 

(b) Estimate the standard errors of the slope and intercept. 

(c) Test Hp: By = 0 versus H;: By # O using a = 0.05. Find 
the P-value for this test. 

11-31. Consider the data from Exercise 11-11, on y = shear 

strength of a propellant and x = propellant age. 

(a) Test for significance of regression with a = 0.01. Find the 
P-value for this test. 

(b) Estimate the standard errors of Bo and Bi. 

(c) Test Hp: 8B; = —30 versus H,: B; ~ —30 using a = 0.01. 
What is the P-value for this test? 

(d) Test Hp: By = 0 versus H,: By ¥ 0 using a = 0.01. What 
is the P-value for this test? 

(e) Test Hp: By = 2500 versus H,: By > 2500 using a = 
0.01. What is the P-value for this test? 

11-32. Consider the data from Exercise 11-10 on y = chloride 

concentration in surface streams and x = roadway area. 

(a) Test the hypothesis Hy: B; = 0 versus H,: B, ~ 0 using 
the analysis of variance procedure with a = 0.01. 

(b) Find the P-value for the test in part (a). 

(c) Estimate the standard errors of 8, and Bo. 
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(d) Test Hp: Bo = 0 versus H): By ~ 0 using a = 0.01. What 
conclusions can you draw? Does it seem that the model 
might be a better fit to the data if the intercept were removed? 


11-33. Consider the data in Exercise 11-13 on y= 

oxygen demand and x = time. 

(a) Test for significance of regression using a = 0.01. Find 
the P-value for this test. What conclusions can you draw? 

(b) Estimate the standard errors of the slope and intercept. 

(c) Test the hypothesis that By = 0. 


11-34. Consider the data in Exercise 11-14 on y = deflection 

and x = stress level. 

(a) Test for significance of regression using a = 0.01. What 
is the P-value for this test? State the conclusions that result 
from this test. 

(b) Does this model appear to be adequate? 

(c) Estimate the standard errors of the slope and intercept. 


11-35. An article in The Journal of Clinical Endocrinology 
and Metabolism [‘Simultaneous and Continuous 24-Hour 
Plasma and Cerebrospinal Fluid Leptin Measurements: 
Dissociation of Concentrations in Central and Peripheral 
Compartments” (2004, Vol. 89, pp. 258—-265)] studied the 
demographics of simultaneous and continuous 24-hour 
plasma and cerebrospinal fluid leptin measurements. The data 
follow: 


y = BMI (kg/m’): 19.92 20.59 29.02 20.78 25.97 


20.39 23.29 17.27 35.24 
x = Age (yr): 45.5 346 40.6 32.9 28.2 30.1 
52.1 33.3 47.0 


(a) Test for significance of regression using a = 0.05. Find the 
P-value for this test. Can you conclude that the model speci- 
fies a useful linear relationship between these two variables? 

(b) Estimate o? and the standard deviation of Bi. 

(c) What is the standard error of the intercept in this model? 


11-36. Suppose that each value of x; is multiplied by a pos- 
itive constant a, and each value of y; is multiplied by another 
positive constant b. Show that the ¢-statistic for testing 
Ay: B,; = 0 versus H,: 8, # 0 is unchanged in value. 


11-37. The type II error probability for the t-test for 
Ay: By = By can be computed in a similar manner to the 
t-tests of Chapter 9. If the true value of B, is B), the value 
d =|Bi9 — Bil/((oV(n — 1)/S, is calculated and used as 
the horizontal scale factor on the operating characteristic curves 
for the ¢-test (Appendix Charts VIIe through VIIA) and the type 
II error probability is read from the vertical scale using the curve 
for n — 2 degrees of freedom. Apply this procedure to the foot- 
ball data of Exercise 11-3, using o = 5.5 and B; = 12.5, where 
the hypotheses are Hp: B, = 10 versus H;: B, # 10. 

11-38. Consider the no-intercept model Y= Bx + € 
with the e’s NID(O, 07). The estimate of co? is s* = 
Dia (Vi — Bx;)°/(n — 1) and V(B) = 0° / Dinix}. 

(a) Devise a test statistic for Hy: 8 = 0 versus H,: B # 0. 

(b) Apply the test in (a) to the model from Exercise 11-20. 
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11-5 CONFIDENCE INTERVALS 

11-5.1 Confidence Intervals on the Slope and Intercept 
In addition to point estimates of the slope and intercept, it is possible to obtain confidence 
interval estimates of these parameters. The width of these confidence intervals is a measure of 


the overall quality of the regression line. If the error terms, €;, in the regression model are 
normally and independently distributed, 


=2 
(By — Bi) /V67/S,. and (Bo — Bo)/ a 4 =| 


are both distributed as ¢ random variables with n — 2 degrees of freedom. This leads to the 
following definition of 100(1 — «)% confidence intervals on the slope and intercept. 


Confidence 


Intervals on Under the assumption that the observations are normally and independently distributed, 
Parameters a 100(1 — «)% confidence interval on the slope B, in simple linear regression is 


“ & 6? 


~ o 
Bi — tay2,n-2 Tee Bi = Bi + toprn-2 3. (11-29) 


Similarly, a 10011 — «)% confidence interval on the intercept Bo is 


i ex 
= By = Bo + tenn-2,/07] - + — (11-30) 
0 08 Fa/2,n-2\fF | Tg 


EXAMPLE 11-4 Oxygen Purity Confidence Interval on the Slope 


| eo 
Bo — ba/2,n-2 a is aa | 
xx 


We will find a 95% confidence interval on the slope of the re- This simplifies to 
gression line using the data in Example 11-1. Recall that 
a = = a2 
By = 14.947, Syx = 0.68088, and G~ = 1.18 (see Table 11-2). 12.181 = By <= 17.713 
Then, from Equation 11-29 we find 
: & . & Practical Interpretation: This CI does not include zero, so 
Bi — to.025,18 s. = Bi = Bi + (025,18 5 there is strong evidence (at a = 0.05) that the slope is not zero. 
sa i The Cl is reasonably narrow (+2.766) because the error vari- 
or ance is fairly small. 
14.947 — 2.101 Ee B, = 14.947 
0.68088" 
1.18 
+ 2.101 
0.68088 


11-5.2 Confidence Interval on the Mean Response 


A confidence interval may be constructed on the mean response at a specified value of x, say, 
Xo. This is a confidence interval about E(Y|xo) = Myx, and is often called a confidence interval 
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about the regression line. Since E(Y|xo) = Ly|x, = Bo + BX, we may obtain a point estimate 
of the mean of Y at x = x(py|x,) from the fitted model as 


ey] xp = Bo + Bixo 


Now ftyjx, is an unbiased point estimator of py|,,, since Bo and B, are unbiased estimators of 
Bo and B,. The variance of jLy),, is 


V (frp) = aE . oe 


This last result follows from the fact that iy), = ¥ + Bi (xo — xX) and cov (Y, B1) = (0. The zero 
covariance result is left as a mind-expanding exercise. Also, y|x, is normally distributed, because 
8, and By are normally distributed, and if we 6? use as an estimate of o°, it is easy to show that 


Py |xy LY |xo 

i Gig= xy? 

a2 a eee 
\é E 2 


has a ¢ distribution with n — 2 degrees of freedom. This leads to the following confidence 
interval definition. 


Confidence 


Interval on the A 100(1 — a)% confidence interval about the mean response at the value of 
Mean Response X = Xo, SAY [y)x,9 18 given by 


I Gp =x) 
A a2 
PY|xo — ane E ar ce 
1 (xo = iF 


P Ve 
= By |x) = Bypxy T tawnje|} os Ss. (11-31) 


where fLy|x, = By + Bixo is computed from the fitted regression model. 


Note that the width of the CI for py),, is a function of the value specified for xo. The interval 
width is a minimum for xy = X and widens as |xg — X| increases. 


EXAMPLE 11-5 Oxygen Purity Confidence Interval on the Mean Response 


We will construct a 95% confidence interval about the mean and the 95% confidence interval is 
response for the data in Example 11-1. The fitted model is 
Ply|x, = 74.283 + 14.947x9, and the 95% confidence interval 1 (1.00 — 1.1960) 
On Ly|x, 18 found from Equation 11-31 as 89.23 + 2.1014/ 1.18 EB ; 0.68088 
; 1 (% — 1.1960)? or 
Pey|xy = sory 1s| 0.68088 
89.23 + 0.75 


Suppose that we are interested in predicting mean oxygen 
purity when x) = 1.00%. Then Therefore, the 95% CI on Wy|1,00 is 


Pvy xq = 74-283 + 14.947(1.00) = 89.23 88.48 = py) 100 = 89.98 
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Figure 11-7 Scatter § 
diagram of oxygen 
purity data from 90 


Example 11-1 with 
fitted regression line 
and 95 percent 
confidence limits on 


BY |x9° 


1.07 


1.27 
x 


This is a reasonable narrow CI. 

Minitab will also perform these calculations. Refer to 
Table 11-2. The predicted value of y at x = 1.00 is shown 
along with the 95% CI on the mean of y at this level of x. 

By repeating these calculations for several different val- 
ues for x9, we can obtain confidence limits for each correspon- 
ding value of My|x,. Figure 11-7 displays the scatter diagram 
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1.47 


Hydrocarbon level (%) 


with the fitted model and the corresponding 95% confidence 
limits plotted as the upper and lower lines. The 95% confi- 
dence level applies only to the interval obtained at one value 
of x and not to the entire set of x-levels. Notice that the width 
of the confidence interval on Wy|x, increases as |xo — | 
increases. 


11-6 PREDICTION OF NEW OBSERVATIONS 


An important application of a regression model is predicting new or future observations Y 
corresponding to a specified level of the regressor variable x. If xp is the value of the regressor 


variable of interest, 


Yo = Bo + Bixo (11-32) 


is the point estimator of the new or future value of the response Yo. 

Now consider obtaining an interval estimate for this future observation Yo. This new 
observation is independent of the observations used to develop the regression model. 
Therefore, the confidence interval for jy), in Equation 11-31 is inappropriate, since it is based 
only on the data used to fit the regression model. The confidence interval about py;,, refers to 
the true mean response at x = Xp (that is, a population parameter), not to future observations. 

Let Yy be the future observation at x = xo, and let Yo given by Equation 11-32 be the 
estimator of Yo. Note that the error in prediction 


&p = Yo — Y% 


is a normally distributed random variable with mean zero and variance 


- 1 
= =~ ia 
V(% — %) = 07/1 +5 + 


(j= %) 


Syx 
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Prediction 
Interval 


because Yj is independent of Yo. If we use G’ to estimate o”, we can show that 
Yo — Yo 
=\2 
1 x5 x 
‘| Ppp hae ee 
a Syx 


has a ¢ distribution with n — 2 degrees of freedom. From this we can develop the following 
prediction interval definition. 


A 10001 — a) % prediction interval on a future observation Yo at the value xo is 
given by 


VS as 
Ol a2 be Bon ea 
yo aa 1 SP n ar ce 


1 
<Y%y=fyt tannay/t|1 ae 


Cy =e 
Sey 


(11-33) 


The value is computed from the regression model fy = By + BiXo. 


Notice that the prediction interval is of minimum width at x9 = x and widens as |xy — x| 
increases. By comparing Equation 11-33 with Equation 11-31, we observe that the prediction 
interval at the point x, is always wider than the confidence interval at x). This results because 
the prediction interval depends on both the error from the fitted model and the error associated 


with future observations. 


EXAMPLE 11-6 Oxygen Purity Prediction Interval 
To illustrate the construction of a prediction interval, suppose 
we use the data in Example 11-1 and find a 95% prediction in- 
terval on the next observation of oxygen purity atx) = 1.00%. 
Using Equation 11-33 and recalling from Example 11-5 that 
Jo = 89.23, we find that the prediction interval is 


89.23 2.1011 18) noes aero” 
: , "20 0.68088 


1 (1.00 —1.1960)° 
< Y¥) = 89.23 + 2.101.4/1.18/1+—4 
20 0.68088 


which simplifies to 


86.83 < wy < 91.63 


This is a reasonably narrow prediction interval. 

Minitab will also calculate prediction intervals. Refer to 
the output in Table 11-2. The 95% PI on the future observation 
at X) = 1.00 is shown in the display. 

By repeating the foregoing calculations at different levels 
of x9, we may obtain the 95% prediction intervals shown 
graphically as the lower and upper lines about the fitted re- 
gression model in Fig. 11-8. Notice that this graph also shows 
the 95% confidence limits on [-y|x, calculated in Example 11-5. 
It illustrates that the prediction limits are always wider than 
the confidence limits. 
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EXERCISES FOR SECTIONS 11-5 AND 11-6 


11-39. Refer to the data in Exercise 11-1 on y = intrinsic 

permeability of concrete and x = compressive strength. Find 

a 95% confidence interval on each of the following: 

(a) Slope  (b) Intercept 

(c) Mean permeability when x = 2.5 

(d) Find a 95% prediction interval on permeability when 
x = 2.5. Explain why this interval is wider than the 
interval in part (c). 

11-40. Exercise 11-2 presented data on roadway surface 

temperature x and pavement deflection y. Find a 99% confi- 

dence interval on each of the following: 

(a) Slope  (b) Intercept 

(c) Mean deflection when temperature x = 85°F 

(d) Find a 99% prediction interval on pavement deflection 
when the temperature is 90°F. 


11-41. Refer to the NFL quarterback ratings data in 

Exercise 11-3. Find a 95% confidence interval on each of the 

following: 

(a) Slope 

(b) Intercept 

(c) Mean rating when the average yards per attempt is 8.0 

(d) Find a 95% prediction interval on the rating when the 
average yards per attempt is 8.0. 


11-42. Refer to the data on y = house selling price and 

x = taxes paid in Exercise 11-4. Find a 95% confidence inter- 

val on each of the following: 

(a) B, (db) Bo 

(c) Mean selling price when the taxes paid are x = 7.50 

(d) Compute the 95% prediction interval for selling price 
when the taxes paid are x = 7.50. 


11-43. Exercise 11-5 presented data on y = steam usage 
and x = monthly average temperature. 


(a) Find a 99% confidence interval for B,. 

(b) Find a 99% confidence interval for By. 

(c) Find a 95% confidence interval on mean steam usage 
when the average temperature is 55°F. 

(d) Find a 95% prediction interval on steam usage when tem- 
perature is 55°F. Explain why this interval is wider than 
the interval in part (c). 


11-44. Exercise 11-6 presented gasoline mileage perfor- 

mance for 21 cars, along with information about the engine 

displacement. Find a 95% confidence interval on each of the 

following: 

(a) Slope  (b) Intercept 

(c) Mean highway gasoline mileage when the engine dis- 
placement is x = 150 in? 

(d) Construct a 95% prediction interval on highway gasoline 
mileage when the engine displacement is x = 150 in’, 


11-45. Consider the data in Exercise 11-7 on y = green 

liquor Na,S concentration and x = production in a paper 

mill. Find a 99% confidence interval on each of the following: 

(a) Bi (6) Bo 

(c) Mean Na,S concentration when production x = 910 
tons/day 

(d) Find a 99% prediction interval on Na,S concentration 
when x = 910 tons/day. 


11-46. Exercise 11-8 presented data on y = blood pressure 

rise and x = sound pressure level. Find a 95% confidence 

interval on each of the following: 

(a) B:  (b) Bo 

(c) Mean blood pressure rise when the sound pressure level is 
85 decibels 

(d) Find a 95% prediction interval on blood pressure rise 
when the sound pressure level is 85 decibels. 
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11-47. Refer to the data in Exercise 11-9 on y = wear 

volume of mild steel and x = oil viscosity. Find a 95% confi- 

dence interval on each of the following: 

(a) Intercept (b) Slope 

(c) Mean wear when oil viscosity x = 30 

11-48. Exercise 11-10 presented data on chloride concentra- 

tion y and roadway area x on watersheds in central Rhode 

Island. Find a 99% confidence interval on each of the following: 

(a) B, (db) Bo 

(c) Mean chloride concentration when roadway areax = 1.0% 

(d) Find a 99% prediction interval on chloride concentration 
when roadway area x = 1.0%. 

11-49. Refer to the data in Exercise 11-11 on rocket motor 

shear strength y and propellant age x. Find a 95% confidence 

interval on each of the following: 

(a) Slope B,; (b) Intercept By 

(c) Mean shear strength when age x = 20 weeks 


(d) Find a 95% prediction interval on shear strength when age 
x = 20 weeks. 

11-50. Refer to the data in Exercise 11-12 on the mi- 

crostructure of zirconia. Find a 95% confidence interval on 

each of the following: 

(a) Slope (b) Intercept 

(c) Mean length when x = 1500 

(d) Find a 95% prediction interval on length when x = 1500. 
Explain why this interval is wider than the interval in 
part (c). 

11-51. Refer to the data in Exercise 11-13 on oxygen de- 

mand. Find a 99% confidence interval on each of the 

following: 

(a) By 

(b) Bo 

(c) Find a 95% confidence interval on mean BOD when the 
time is 8 days. 


11-7 ADEQUACY OF THE REGRESSION MODEL 


Fitting a regression model requires several assumptions. Estimation of the model parameters 
requires the assumption that the errors are uncorrelated random variables with mean zero and 
constant variance. Tests of hypotheses and interval estimation require that the errors be nor- 
mally distributed. In addition, we assume that the order of the model is correct; that is, if we 
fit a simple linear regression model, we are assuming that the phenomenon actually behaves in 
a linear or first-order manner. 

The analyst should always consider the validity of these assumptions to be doubtful and 
conduct analyses to examine the adequacy of the model that has been tentatively entertained. 
In this section we discuss methods useful in this respect. 


11-7.1 Residual Analysis 


The residuals from a regression model are e; = y; — J;,i = 1, 2,...,n, where y; is an actual 
observation and j; is the corresponding fitted value from the regression model. Analysis of the 
residuals is frequently helpful in checking the assumption that the errors are approximately 
normally distributed with constant variance, and in determining whether additional terms in 
the model would be useful. 

As an approximate check of normality, the experimenter can construct a frequency his- 
togram of the residuals or a normal probability plot of residuals. Many computer programs 
will produce a normal probability plot of residuals, and since the sample sizes in regression 
are often too small for a histogram to be meaningful, the normal probability plotting method 
is preferred. It requires judgment to assess the abnormality of such plots. (Refer to the discus- 
sion of the “fat pencil” method in Section 6-6). 

We may also standardize the residuals by computing d; = e;/ Ve, i= 1,2,...,n. If 
the errors are normally distributed, approximately 95% of the standardized residuals should 
fall in the interval (—2, +2). Residuals that are far outside this interval may indicate the 
presence of an outlier, that is, an observation that is not typical of the rest of the data. Various 
rules have been proposed for discarding outliers. However, outliers sometimes provide 
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(c) (d) 
Figure 11-9 Patterns for residual plots. (a) Satisfactory, (b) Funnel, 


(c) Double bow, (d) Nonlinear. [Adapted from Montgomery, Peck, and 
Vining (2006). ] 


important information about unusual circumstances of interest to experimenters and should 
not be automatically discarded. For further discussion of outliers, see Montgomery, Peck, and 
Vining (2006). 

It is frequently helpful to plot the residuals (1) in time sequence (if known), (2), against 
the ,;, and (3) against the independent variable x. These graphs will usually look like one of 
the four general patterns shown in Fig. 11-9. Pattern (a) in Fig. 11-9 represents the ideal situ- 
ation, while patterns (b), (c), and (d) represent anomalies. If the residuals appear as in (b), the 
variance of the observations may be increasing with time or with the magnitude of y,or x;. Data 
transformation on the response y is often used to eliminate this problem. Widely used vari- 
ance-stabilizing transformations include the use of Vy, In y, or 1/y as the response. See 
Montgomery, Peck, and Vining (2006) for more details regarding methods for selecting an ap- 
propriate transformation. Plots of residuals against ; and x; that look like (c) also indicate in- 
equality of variance. Residual plots that look like (d) indicate model inadequacy; that is, 
higher order terms should be added to the model, a transformation on the x-variable or the 
y-variable (or both) should be considered, or other regressors should be considered. 


EXAMPLE 11-7 Oxygen Purity Residuals 

The regression model for the oxygen purity data in Example 
11-1 is P = 74.283 + 14.947x. Table 11-4 presents the ob- 
served and predicted values of y at each value of x from this 
data set, along with the corresponding residual. These values 
were computed using Minitab and show the number of deci- 
mal places typical of computer output. A normal probability 


plot of the residuals is shown in Fig. 11-10. Since the residu- 
als fall approximately along a straight line in the figure, we 
conclude that there is no severe departure from normality. The 
residuals are also plotted against the predicted value J; in Fig. 
11-11 and against the hydrocarbon levels x; in Fig. 11-12. 
These plots do not indicate any serious model inadequacies. 
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Table 11-4 Oxygen Purity Data from Example 11-1, Predicted Values, and Residuals 


Hydrocarbon 
Level, x 
0.99 
1.02 
1.15 
1.29 
1.46 
1.36 
0.87 
1.23 
1.55 
1.40 


OmAANIAHDUN PWN 


= 
j=) 


Oxygen Predicted Residual Hydrocarbon Oxygen Predicted Residual 

Purity,y Value.p e=y-—yp Level, x Purity,y Value,py e=y—yp 
90.01 89.081 0.929 11 1.19 93.54 92.071 1.469 
89.05 89.530 —0.480 12 1.15 92.52 91.473 1.047 
91.43 91.473 —0.043 13 0.98 90.56 88.932 1.628 
93.74 93.566 0.174 14 1.01 89.54 89.380 0.160 
96.73 96.107 0.623 15 1.11 89.85 90.875 —1.025 
94.45 94.612 —0.162 16 1.20 90.39 92.220 — 1.830 
87.59 87.288 0.302 17 1.26 93.25 93.117 0.133 
91.77 92.669 —0.899 18 1.32 93.41 94.014 —0.604 
99.42 97.452 1.968 19 1.43 94.98 95.658 —0.678 
93.65 95.210 —1.560 20 0.95 87.33 88.483 —1.153 


11-7.2 Coefficient of Determination (R’) 


Cumulative normal probability 


Figure 11-10 
Example 11-7. 


A widely used measure for a regression model is the following ratio of sum of squares. 


R2 


The coefficient of determination is 


Ra =1- (11-34) 


The coefficient is often used to judge the adequacy of a regression model. Subsequently, we 
will see that in the case where X and Y are jointly distributed random variables, R is the square 
of the correlation coefficient between X and Y. From the analysis of variance identity in 
Equations 11-24 and 11-25, 0 = R? < 1. We often refer loosely to R* as the amount of vari- 
ability in the data explained or accounted for by the regression model. For the oxygen purity 
regression model, we have R= SSp/SS7 = 152.13/173.38 = 0.877; that is, the model ac- 
counts for 87.7% of the variability in the data. 


Residuals 


-0.9 0.1 ii 2.1 87 89 91 93 95 97 99 
Residuals Predicted values, 9 


Normal probability plot of residuals, Figure 11-11 Plot of residuals versus predicted oxygen 
purity y, Example 11-7. 


s 


Figure 11-12 Plot of 
residuals versus hydro- 
carbon level x, 
Example 11-8. 
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Hydrocarbon level (%) 
x 


1.47 1.67 


The statistic R? should be used with caution, because it is always possible to make R? 
unity by simply adding enough terms to the model. For example, we can obtain a “perfect” fit 
to n data points with a polynomial of degree n — 1. In addition, R? will always increase if we 
add a variable to the model, but this does not necessarily imply that the new model is superior 
to the old one. Unless the error sum of squares in the new model is reduced by an amount 
equal to the original error mean square, the new model will have a larger error mean square 
than the old one, because of the loss of one error degree of freedom. Thus, the new model will 
actually be worse than the old one. 

There are several misconceptions about R*. In general, R* does not measure the magni- 
tude of the slope of the regression line. A large value of R* does not imply a steep slope. 
Furthermore, R? does not measure the appropriateness of the model, since it can be artificially 
inflated by adding higher order polynomial terms in x to the model. Even if y and x are related 
in a nonlinear fashion, R* will often be large. For example, R? for the regression equation in 
Fig. 11-6(b) will be relatively large, even though the linear approximation is poor. Finally, 
even though R? is large, this does not necessarily imply that the regression model will provide 
accurate predictions of future observations. 


EXERCISES FOR SECTION 11-7 


11-52. Refer to the compressive strength data in Exercise 
11-1. Use the summary statistics provided to calculate R? and 
provide a practical interpretation of this quantity. 


11-53. Refer to the NFL quarterback ratings data in 

Exercise 11-3. 

(a) Calculate R* for this model and provide a practical inter- 
pretation of this quantity. 

(b) Prepare a normal probability plot of the residuals from 
the least squares model. Does the normality assumption 
seem to be satisfied? 

(c) Plot the residuals versus the fitted values and against x. 
Interpret these graphs. 


11-54. Refer to the data in Exercise 11-4 on house selling 

price y and taxes paid x. 

(a) Find the residuals for the least squares model. 

(b) Prepare a normal probability plot of the residuals and in- 
terpret this display. 


(c) Plot the residuals versus j and versus x. Does the assump- 
tion of constant variance seem to be satisfied? 

(d) What proportion of total variability is explained by the 
regression model? 


11-55. Refer to the data in Exercise 11-5 on y = steam 

usage and x = average monthly temperature. 

(a) What proportion of total variability is accounted for by the 
simple linear regression model? 

(b) Prepare a normal probability plot of the residuals and 
interpret this graph. 

(c) Plot residuals versus and x. Do the regression assump- 
tions appear to be satisfied? 


11-56. Refer to the gasoline mileage data in Exercise 11-6. 

(a) What proportion of total variability in highway gaso- 
line mileage performance is accounted for by engine 
displacement? 

(b) Plot the residuals versus » and x, and comment on the graphs. 
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(c) Prepare a normal probability plot of the residuals. Does 
the normality assumption appear to be satisfied? 


11-57. Exercise 11-9 presents data on wear volume y and 

oil viscosity x. 

(a) Calculate R? for this model. Provide an interpretation of 
this quantity. 

(b) Plot the residuals from this model versus j and versus x. 
Interpret these plots. 

(c) Prepare a normal probability plot of the residuals. Does 
the normality assumption appear to be satisfied? 


11-58. Refer to Exercise 11-8, which presented data on 

blood pressure rise y and sound pressure level x. 

(a) What proportion of total variability in blood pressure rise 
is accounted for by sound pressure level? 

(b) Prepare a normal probability plot of the residuals from 
this least squares model. Interpret this plot. 

(c) Plot residuals versus and versus x. Comment on these plots. 


11-59. Refer to Exercise 11-10, which presented data on 

chloride concentration y and roadway area x. 

(a) What proportion of the total variability in chloride con- 
centration is accounted for by the regression model? 

(b) Plot the residuals versus # and versus x. Interpret these plots. 

(c) Prepare a normal probability plot of the residuals. Does 
the normality assumption appear to be satisfied? 


11-60. An article in the Journal of the American Statistical 
Association [“Markov Chain Monte Carlo Methods for 


Compressive Compressive 
Strength Density Strength Density 
3040 29.2 3840 30.7 
2470 24.7 3800 32.7 
3610 32.3 4600 32.6 
3480 31.3 1900 22.1 
3810 31.5 2530 25.3 
2330 24.5 2920 30.8 
1800 19.9 4990 38.9 
3110 27.3 1670 22.1 
3160 27.1 3310 29.2 
2310 24.0 3450 30.1 
4360 33.8 3600 31.4 
1880 21.5 2850 26.7 
3670 sy) 1590 22M 
1740 22.5 3770 30.3 
2250 Dale) 3850 32.0 
2650 25.6 2480 23.2 
4970 34.5 3570 30.3 
2620 26.2 2620 29.9 
2900 26.7 1890 20.8 
1670 21.1 3030 33.2 
2540 24.1 3030 28.2 
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Computing Bayes Factors: A Comparative Review” (2001, 

Vol. 96, pp. 1122—1132)] analyzed the tabulated data on com- 

pressive strength parallel to the grain versus resin-adjusted 

density for specimens of radiata pine. 

(a) Fit a regression model relating compressive strength to 
density. 

(b) Test for significance of regression with a = 0.05. 

(c) Estimate o” for this model. 

(d) Calculate R? for this model. Provide an interpretation of 
this quantity. 

(e) Prepare a normal probability plot of the residuals and in- 
terpret this display. 

(f) Plot the residuals versus j and versus x. Does the assump- 
tion of constant variance seem to be satisfied? 


11-61. Consider the rocket propellant data in Exercise 11-11. 

(a) Calculate R? for this model. Provide an interpretation of 
this quantity. 

(b) Plot the residuals on a normal probability scale. Do any 
points seem unusual on this plot? 

(c) Delete the two points identified in part (b) from the 
sample and fit the simple linear regression model to the re- 
maining 18 points. Calculate the value of R* for the new 
model. Is it larger or smaller than the value of R? com- 
puted in part (a)? Why? 

(d) Did the value of G? change dramatically when the two 
points identified above were deleted and the model fit to 
the remaining points? Why? 

11-62. Consider the data in Exercise 11-7 on y = green 

liquor Na,S concentration and x = paper machine production. 

Suppose that a 14th sample point is added to the original data, 

where y,4 = 59 and x,4 = 855. 

(a) Prepare a scatter diagram of y versus x. Fit the simple lin- 
ear regression model to all 14 observations. 

(b) Test for significance of regression with a = 0.05. 

(c) Estimate o? for this model. 

(d) Compare the estimate of o” obtained in part (c) above with 
the estimate of o* obtained from the original 13 points. 
Which estimate is larger and why? 

(e) Compute the residuals for this model. Does the value of 
€14 appear unusual? 

(f) Prepare and interpret a normal probability plot of the 
residuals. 

(g) Plot the residuals versus and versus x. Comment on 
these graphs. 


11-63. Consider the rocket propellant data in Exercise 11-11. 
Calculate the standardized residuals for these data. Does this 
provide any helpful information about the magnitude of the 
residuals? 


11-64. Studentized Residuals. Show that the variance of 
the ith residual is 


Hint: 


The ith studentized residual is defined as 


= fi 
ot) 


(a) Explain why 7; has unit standard deviation. 

(b) Do the standardized residuals have unit standard deviation? 

(c) Discuss the behavior of the studentized residual when the 
sample value x; is very close to the middle of the range of x. 
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(d) Discuss the behavior of the studentized residual when the 
sample value x; is very near one end of the range of x. 


11-65. Show that an equivalent way to define the test for 
significance of regression in simple linear regression is to base 
the test on R? as follows: to test Hp: B; = 0 versus H,: B, ~ 0, 
calculate 


R°(n — 2) 


Fy = 
ae 


and to reject Hy: 8, = 0 if the computed value fy > f..1,-2- 

Suppose that a simple linear regression model has been fit to 

n = 25 observations and R* = 0.90. 

(a) Test for significance of regression at a = 0.05. 

(b) What is the smallest value of R? that would lead to the 
conclusion of a significant regression if a = 0.05? 


Our development of regression analysis has assumed that x is a mathematical variable, mea- 
sured with negligible error, and that Y is a random variable. Many applications of regression 
analysis involve situations in which both X and Y are random variables. In these situations, it 
is usually assumed that the observations (X;, Y;), i = 1, 2,..., are jointly distributed random 
variables obtained from the distribution f(x, y). 

For example, suppose we wish to develop a regression model relating the shear strength 
of spot welds to the weld diameter. In this example, weld diameter cannot be controlled. We 
would randomly select n spot welds and observe a diameter (X;) and a shear strength (Y;) for 
each. Therefore (X;, Y;) are jointly distributed random variables. 

We assume that the joint distribution of X; and Y;, is the bivariate normal distribution pre- 
sented in Chapter 5, and wy and o} are the mean and variance of Y, wy and %¥x are the mean 
and variance of X, and p is the correlation coefficient between Y and_X. Recall that the corre- 


lation coefficient is defined as 


p= (11-35) 


where o-yy is the covariance between Y and_X. 
The conditional distribution of Y for a given value of X = x is 


where 


and the variance of the conditional distribution of Y given X = x is 


1/y¥ — Bo - en)! 
= 11-36 
V 270 |, exp 2 ( Oy|x ( ) 
Oy 
Bo = by — BxPGy (11-37) 
Oy 
Bi = GP (11-38) 
OF|x = Ty(1 — p’) (11-39) 
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That is, the conditional distribution of Y given X = x is normal with mean 


E(Y|x) = Bo + Bix (11-40) 


and variance OF|x- Thus, the mean of the conditional distribution of Y given X = x is a 
simple linear regression model. Furthermore, there is a relationship between the correlation 
coefficient p and the slope B;. From Equation 11-38 we see that if p = 0, then 8, = 0, which 
implies that there is no regression of Y on _X. That is, knowledge of X does not assist us in 
predicting Y. 

The method of maximum likelihood may be used to estimate the parameters By and B,. It 
can be shown that the maximum likelihood estimators of those parameters are 


Bo = Y- Bix (11-41) 
and 
¥(X; — X) 
a i=1 Syy 
B= = (11-42) 


(y,- xp 


i=1 


We note that the estimators of the intercept and slope in Equations 11-41 and 11-42 are 
identical to those given by the method of least squares in the case where X was assumed to be 
a mathematical variable. That is, the regression model with Y and_X jointly normally distrib- 
uted is equivalent to the model with X considered as a mathematical variable. This follows 
because the random variables Y given XY = x are independently and normally distributed with 
mean B, + 6,x and constant variance OF |x. These results will also hold for any joint distribu- 
tion of Y and_X such that the conditional distribution of Y given X is normal. 

It is possible to draw inferences about the correlation coefficient p in this model. The 
estimator of p is the sample correlation coefficient 


_ i=l = Syy 
© [Sar Si a ae oe 
X, — X Y¥,;- Y 
i=l i=l 
Note that 
es SS7\l/2 
B, = (=) R (11-44) 


so the slope By is just the sample correlation coefficient R multiplied by a scale factor that is 
the square root of the “spread” of the Y values divided by the “spread” of the X values.Thus, 
B, and R are closely related, although they provide somewhat different information. The 
sample correlation coefficient R measures the linear association between Y and_X, while By 
measures the predicted change in the mean of Y for a unit change in_X. In the case of a math- 
ematical variable x, R has no meaning because the magnitude of R depends on the choice of 
spacing of x. We may also write, from Equation 11-44, 


ea p25 - BiSxy _ SSp 
LSS Sp SS 


Test Statistic 
for Zero 
Correlation 


Confidence 
Interval for 
a Correlation 
Coefficient 
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which is just the coefficient of determination. That is, the coefficient of determination R? is 
just the square of the correlation coefficient between Y and X. 
It is often useful to test the hypotheses 
FA: p= 0 
A: p #0 (11-45) 


The appropriate test statistic for these hypotheses is 


poo (11-46) 


V1 — R? 


which has the ¢ distribution with n — 2 degrees of freedom if Hy: p = 0 is true. Therefore, we 

would reject the null hypothesis if |fo| > t, /2.n-2- This test is equivalent to the test of the hypothesis 

Ay: B, = 0 given in Section 11-5.1. This equivalence follows directly from Equation 11-46. 
The test procedure for the hypotheses 


Ho: P = Po 
Hy: p # po (11-47) 


where py ¥ 0 is somewhat more complicated. For moderately large samples (say, n = 25), the 
statistic 


1+R 


fa (11-48) 


1 
Z = arctanh R = > In 


is approximately normally distributed with mean and variance 


1 Ilt+p > 1 
Hz = arctanh p = = In 5 and rr 


respectively. Therefore, to test the hypothesis Hp: p = pp, we may use the test statistic 


Zo = (arctanh R — arctanh pp)(n — 3)!” (11-49) 


and reject Hy: p = fp if the value of the test statistic in Equation 11-49 is such that |zo| > z, /2» 
It is also possible to construct an approximate 100(1 — «)% confidence interval for p, using 
the transformation in Equation 11-48. The approximate 100(1 — «)% confidence interval is 


Zo/2 
Vin = 3 


Zo 
) <p < tanh (vera r+ 4 (11-50) 


tanh (arctan P= 


where tanh u = (e” — e“)/(e" + e-"). 
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Figure 11-13 Scatter 
plot of wire bond 
strength versus wire 0 5 10 


length, Example 11-8. Wire length 


EXAMPLE 11-8 Wire Bond Pull Strength 

In Chapter 1 (Section 1-3) an application of regression analysis 
is described in which an engineer at a semiconductor assembly 
plant is investigating the relationship between pull strength of a 
wire bond and two factors: wire length and die height. In this ex- 
ample, we will consider only one of the factors, the wire length. 
A random sample of 25 units is selected and tested, and the wire 
bond pull strength and wire length are observed for each unit. 
The data are shown in Table 1-2. We assume that pull strength 
and wire length are jointly normally distributed. 

Figure 11-13 shows a scatter diagram of wire bond 
strength versus wire length. We have used the Minitab option 
of displaying box plots of each individual variable on the scat- 
ter diagram. There is evidence of a linear relationship between 
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15 20 


Now S,. = 698.56 and S,,, = 2027.7132, and the sample 
correlation coefficient is 


Sxy 2027.7132 


” [S..S8S7]!2 — [(698.560)(6105.9) ] 


i = 0.9818 


Note that r? = (0.9818)’ = 0.9640 (which is reported in the 
Minitab output), or that approximately 96.40% of the variability 
in pull strength is explained by the linear relationship to wire 
length. 

Now suppose that we wish to test the hypotheses 


; Alo: p= 0 
the two variables. 
The Minitab output for fitting a simple linear regression Ay p #0 
model to the data is shown below. 
Regression Analysis: Strength versus Length 
The regression equation is 
Strength = 5.11 + 2.90 Length 
Predictor Coef SE Coef T Pp 
Constant 5.115 1.146 4.46 0.000 
Length 2.9027 0.1170 24.80 0.000 
S = 3.093 R-Sq = 96.4% R-Sq(adj) = 96.2% 


PRESS = 272.144 


Analysis of Variance 


Source DF 
Regression 1 
Residual Error 23 
Total 24 


R-Sq(pred) = 95.54% 


SS MS 
5885.9 5885.9 
220.1 9.6 


6105.9 


F P 
615.08 0.000 


with a = 0.05. We can compute the ¢-statistic of Equation 
11-46 as 


4 = PMB =2 _ _0.9818-V23 one 
Oo VWt—-r2 VI — 0.9640 


This statistic is also reported in the Minitab output as a test of 
Ay: By = 0. Because 9925.23 = 2.069, we reject Hy and con- 
clude that the correlation coefficient p # 0. 
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Finally, we may construct an approximate 95% confi- 
dence interval on p from Equation 11-50. Since arctanh r = 
arctanh 0.9818 = 2.3452, Equation 11-50 becomes 


t h( 2.3452 as) = <tonn(2 3452 + 5) 
an. a: = | = ta i = 
V2)” V22 


which reduces to 


0.9585 = p = 0.9921 


11-66. Suppose data is obtained from 20 pairs of (x, y) and 

the sample correlation coefficient is 0.8. 

(a) Test the hypothesis that Hp: p = 0 against H,: p # O with 
a = 0.05. Calculate the P-value. 

(b) Test the hypothesis that 1): p = 0.5 against H): p # 0.5 
with a = 0.05. Calculate the P-value. 

(c) Construct a 95% two-sided confidence interval for the 
correlation coefficient. Explain how the questions in parts 
(a) and (b) could be answered with a confidence interval. 


11-67. Suppose data are obtained from 20 pairs of (x, y) and 

the sample correlation coefficient is 0.75. 

(a) Test the hypothesis that Ho: p = 0 against H,: p > O with 
a = 0.05. Calculate the P-value. 

(b) Test the hypothesis that 1): p = 0.5 against H): p > 0.5 
with a = 0.05. Calculate the P-value. 

(c) Construct a 95% one-sided confidence interval for the 
correlation coefficient. Explain how the questions in 
parts (a) and (b) could be answered with a confidence 
interval. 


11-68. Arandom sample of n = 25 observations was made 

on the time to failure of an electronic component and the tem- 

perature in the application environment in which the compo- 

nent was used. 

(a) Given that r = 0.83, test the hypothesis that p = 0, using 
a = 0.05. What is the P-value for this test? 

(b) Find a 95% confidence interval on p. 

(c) Test the hypothesis Hp: p = 0.8 versus H;: p ¥ 0.8, using 
a = 0.05. Find the P-value for this test. 

11-69. A random sample of 50 observations was made on 

the diameter of spot welds and the corresponding weld shear 

strength. 

(a) Given that r = 0.62, test the hypothesis that p = 0, using 
a = 0.01. What is the P-value for this test? 

(b) Find a 99% confidence interval for p. 

(c) Based on the confidence interval in part (b), can you con- 
clude that p = 0.5 at the 0.01 level of significance? 


11-70. The following data gave X = the water content of 
snow on April 1 and Y = the yield from April to July (in 
inches) on the Snake River watershed in Wyoming for 1919 to 
1935. (The data were taken from an article in Research Notes, 
Vol. 61, 1950, Pacific Northwest Forest Range Experiment 
Station, Oregon.) 


x we x wa 
Pal 10.5 37.9 22.8 
32.8 16.7 30.5 14.1 
31.8 18.2 25.1 12.9 
32.0 17.0 12.4 8.8 
30.4 16.3 35.1 17.4 
24.0 10.5 31.5 14.9 
Bo) 23.1 21.1 10.5 
24.2 12.4 27.6 16.1 
5255) 24.9 


(a) Estimate the correlation between Y and_X. 

(b) Test the hypothesis that p = 0, using a = 0.05. 

(c) Fit a simple linear regression model and test for signifi- 
cance of regression using a = 0.05. What conclusions 
can you draw? How is the test for significance of regres- 
sion related to the test on p in part (b)? 

(d) Analyze the residuals and comment on model adequacy. 


11-71. The final test and exam averages for 20 randomly 

selected students taking a course in engineering statistics and a 

course in operations research follow. Assume that the final 

averages are jointly normally distributed. 

(a) Find the regression line relating the statistics final average 
to the OR final average. Graph the data. 

(b) Test for significance of regression using a = 0.05. 

(c) Estimate the correlation coefficient. 

(d) Test the hypothesis that p = 0, using a = 0.05. 

(e) Test the hypothesis that p = 0.5, using a = 0.05. 

(f) Construct a 95% confidence interval for the correlation 


coefficient. 

Statistics OR _ Statistics OR Statistics OR 
86 80 86 81 83 81 
75 81 71 76 75 70 
69 75 65 2, 71 73 
75 81 84 85 76 72 
90 92 71 2 84 80 
94 95 62 65 97 98 
83 80 90 93 


s 
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11-72. The weight and systolic blood pressure of 26 ran- 
domly selected males in the age group 25 to 30 are shown in 
the following table. Assume that weight and blood pressure 


_ are jointly normally distributed. 


Systolic Systolic 
Subject Weight BP Subject Weight BP 
1 165 130 14 172 153 
2 167 133 15 159 128 
3 180 150 16 168 132 
4 155 128 17 174 149 
5 212 151 18 183 158 
6 175 146 19 215 150 
7 190 150 20 195 163 
8 210 140 21 180 156 
9 200 148 DD) 143 124 
10 149 125 23 240 170 
11 158 133 24 235) 165 
12 169 135 25 192 160 
13 170 150 26 187 159 


(a) Find a regression line relating systolic blood pressure to 
weight. 

(b) Test for significance of regression using a = 0.05. 

(c) Estimate the correlation coefficient. 

(d) Test the hypothesis that p = 0, using a = 0.05. 

(e) Test the hypothesis that p = 0.6, using a = 0.05. 

(f) Construct a 95% confidence interval for the correlation 
coefficient. 


11-73. Inanarticle in JEEE Transactions on Instrumentation 
and Measurement (2001, Vol. 50, pp. 986-990), researchers 
studied the effects of reducing current draw in a magnetic core 
by electronic means. They measured the current in a magnetic 
winding with and without the electronics in a paired experi- 
ment. Data for the case without electronics are provided in the 
following table. 


Current Without 

Supply Voltage Electronics (mA) 
0.66 WD 
1.32 12.22 
1.98 16.34 
2.64 23.66 
3.3 28.06 
3.96 33.39 
4.62 34.12 
3.28 39.21 
5.94 44.21 


6.6 47.48 


(a) Graph the data and fit a regression line to predict cur- 
rent without electronics to supply voltage. Is there 
a significant regression at a = 0.05? What is the 
P-value? 

(b) Estimate the correlation coefficient. 

(c) Test the hypothesis that p = 0 against the alternative 
p # 0 with a = 0.05. What is the P-value? 

(d) Compute a 95% confidence interval for the correlation 
coefficient. 

11-74. The monthly absolute estimate of global (land 

and ocean combined) temperature indexes (degrees C) in 

2000 and 2001 are (source: http://www.ncdc.noaa.gov/ 

oa/climate/): 


2000: 12.28, 12.63, 13.22, 14.21, 15.13, 15.82, 16.05, 
16.02, 15.29, 14.29, 13.16, 12.47 

2001: 12.44, 12.55, 13.35, 14.22, 15.28, 15.99, 16.23, 
16.17, 15.44, 14.52, 13.52, 12.61 


(a) Graph the data and fit a regression line to predict 2001 
temperatures from those in 2000. Is there a significant 
regression at a = 0.05? What is the P-value? 

(b) Estimate the correlation coefficient. 

(c) Test the hypothesis that p = 0.9 against the alternative 
p # 0.9 with a = 0.05. What is the P-value? 

(d) Compute a 95% confidence interval for the correlation 
coefficient. 

11-75 Refer to the NFL quarterback ratings data in 

Exercise 11-3. 

(a) Estimate the correlation coefficient between the rat- 
ings and the average yards per attempt. 

(b) Test the hypothesis Hp: p = 0 versus H,: p # 0 using 
a = 0.05. What is the P-value for this test? 

(c) Construct a 95% confidence interval for p. 

(d) Test the hypothesis Ho: p = 0.7 versus H\: p # 0.7 us- 
ing a = 0.05. Find the P-value for this test. 

11-76. Consider the following (x, y) data. Calculate 

the correlation coefficient. Graph the data and comment 

on the relationship between x and y. Explain why the 
correlation coefficient does not detect the relationship 

between x and y. 


x yy x y 
=4 0 0 —4 
=3 —2.65 1 3.87 
=) 2.65 1 oe Ou 
=2 —3.46 2 3.46 
=) 3.46 2 —3.46 
—1 —3.87 3 2.65 
=i 3.87 3 = 253) 

0 4 4 0 
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11-9 REGRESSION ON TRANSFORMED VARIABLES 


We occasionally find that the straight-line regression model Y = By + B,x + € 1s inappropriate 
because the true regression function is nonlinear. Sometimes nonlinearity is visually deter- 
mined from the scatter diagram, and sometimes, because of prior experience or underlying the- 
ory, we know in advance that the model is nonlinear. Occasionally, a scatter diagram will exhibit 
an apparent nonlinear relationship between Y and x. In some of these situations, a nonlinear 
function can be expressed as a straight line by using a suitable transformation. Such nonlinear 
models are called intrinsically linear. 

As an example of a nonlinear model that is intrinsically linear, consider the exponential 
function 


Y= Boe?%e 


This function is intrinsically linear, since it can be transformed to a straight line by a logarithmic 
transformation 


In Y = InBp + B,x + Ine 


This transformation requires that the transformed error terms In € are normally and indepen- 
dently distributed with mean 0 and variance o”. 
Another intrinsically linear function is 


1 
y= By +B(+) +6 
By using the reciprocal transformation z = 1/x, the model is linearized to 


Y= £0 + Biz te 


Sometimes several transformations can be employed jointly to linearize a function. For ex- 
ample, consider the function 


1 
exp(By + Bx + €) 


Letting Y* = 1/Y, we have the linearized form 


InY* = Bo + Bix +e 


For examples of fitting these models, refer to Montgomery, Peck, and Vining (2006) or 
Myers (1990). 

Transformations can be very useful in many situations where the true relationship 
between the response Y and the regressor x is not well approximated by a straight line. The 
utility of a transformation is illustrated in the following example. 


EXAMPLE 11-9 Windmill Power 


A research engineer is investigating the use of a windmill to data. The regression model is 
generate electricity and has collected data on the DC output 
from this windmill and the corresponding wind velocity. The y = 0.1309 + 0.2411x 
data are plotted in Figure 11-14 and listed in Table 11-5 (p.439). 
Inspection of the scatter diagram indicates that the rela- The summary statistics for this model are R* = 0.8745, 


tionship between DC output Y and wind velocity (x) may be MS, = 6° = 0.0557, and Fy = 160.26 (the P-value is 
nonlinear. However, we initially fit a straight-line model to the <0.0001). 


438 


3.0 
; e oe 
a 2.0 ‘ ._ 2° 8 
3 Pe 
fo} 
B 10 gs 
e 
ee 
0.0 -y/ 2 
0 2 4 6 8 10 


Wind velocity, x 


Figure 11-14 Plot of DC output y versus wind velocity x 
for the windmill data. 


A plot of the residuals versus J; is shown in Figure 11-15. 
This residual plot indicates model inadequacy and implies that 
the linear relationship has not captured all of the information 
in the wind speed variable. Note that the curvature that was ap- 
parent in the scatter diagram of Figure 11-14 is greatly 
amplified in the residual plots. Clearly some other model form 
must be considered. 

We might initially consider using a quadratic model such as 


y = Bo + Bix + Box? + € 


to account for the apparent curvature. However, the scatter di- 
agram of Figure 11-14 suggests that as wind speed increases, 
DC output approaches an upper limit of approximately 2.5. 
This is also consistent with the theory of windmill operation. 
Since the quadratic model will eventually bend downward as 
wind speed increases, it would not be appropriate for these 
data. A more reasonable model for the windmill data that in- 
corporates an upper asymptote would be 


1 
y= By +B:(+) +6 


3.0 
Hoe 
ee 
a 2.0 te 
e 
=| a 
fo} 
B 10 > te 
e 
ee 
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0.10 0.20 0.30 0.40 0.50 


Figure 11-16 
windmill data. 


Plot of DC output versus x’ = 1/x for the 
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0.4 
0.2 ° m 


0.0 . 


-0.4 


-0.6 ° 


0.4 0.8 1.2 1.6 2.0 2.4 


Figure 11-15 Plot of residuals e; versus fitted 
values y; for the windmill data. 


Figure 11-16 is a scatter diagram with the transformed variable 
x’ = 1/x. This plot appears linear, indicating that the reciprocal 
transformation is appropriate. The fitted regression model is 


} = 2.9789 — 6.9345.’ 


The summary statistics for this model are R? = 0.9800, 
MS, = 6” = 0.0089, and Fy = 1128.43 (the P value is 
<0.0001). 

A plot of the residuals from the transformed model ver- 
sus ) is shown in Figure 11-17. This plot does not reveal any 
serious problem with inequality of variance. The normal prob- 
ability plot, shown in Figure 11-18, gives a mild indication 
that the errors come from a distribution with heavier tails than 
the normal (notice the slight upward and downward curve at 
the extremes). This normal probability plot has the z-score 
value plotted on the horizontal axis. Since there is no strong 
signal of model inadequacy, we conclude that the transformed 
model is satisfactory. 


0.4 
e e 
0.2 2 é e ar 
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e 
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Figure 11-17 Plot of residuals versus 
fitted values }; for the transformed model 
for the windmill data. 
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oe Observation Wind Velocity DC Output, 
Par | Number, 7 (mph), x; Vj 
- wooo’ 4 2.70 0.500 
eee? 5 10.00 2.236 
? or" 6 9.70 2.386 
e oe 7 9.55 2.294 
-0.2 o** 8 3.05 0.558 
: 9 8.15 2.166 
-0.4 . 10 6.20 1.866 
° 11 2.90 0.653 
-0.6 12 6.35 1.930 
2 -1 6) 1 2 
a 13 4.60 1.562 
Figure 11-18 Normal probability plot of os a0 ei 
the residuals for the transformed model for le ew une 
the windmill data. 16 3.60 1.137 
Table 11-5 Observed Values y; and Regressor Variable x; a aise ot 
for Example 11-9 — oe 
19 7.00 1.800 
Observation Wind Velocity DC Output, 20 5.45 1.501 
Numbers Enh Yi 21 9.10 2.303 
1 5.00 1.582 22 10.20 2.310 
2 6.00 1.822 23 4.10 1.194 
3 3.40 1.057 24 3.95 1.144 
continued 25 2.45 0.123 
EXERCISES FOR SECTION 11-9 
11-77. Determine if the following models are intrinsically (b) Fit a simple linear regression model to these data. — 
linear. If yes, determine the appropriate transformation to (c) Test for significance of regression using a = 0.05. What 5 
generate the linear model. conclusions can you draw? 

_ 8, _ 3+ 5x (d) Plot the residuals from the simple linear regression model 
P= pees (hy = es versus 3;. What do you conclude about model adequacy? 
(c) ¥ = BoBie (d) Y= x (e) The Clauss C lapeyron eens states ae In(P,)*— 7, 

Box + B, + xe where P,, is the vapor pressure of water. Repeat parts 
11-78. The vapor pressure of water at various temperatures (a)—(d). using an appropriate transformation. 
follows: 11-79. Anelectric utility is interested in developing a model 
relating peak hour demand (y in kilowatts) to total monthly 
Observation Vapor pressure energy usage during the month (x, in kilowatt hours). Data for 
Number, i Temperature (K) Cane 50 residential customers are shown in the following table. 
1 ils} 4.6 
a cle a2 Customer x y Customer x y 
3 293 eS 
rr 303 31.8 1 679 ~=—0.79 26 1434 0.31 
5 313 553 2 292 ~=-0..44 27 837 = 4.20 
6 323 92.5 3 1012 0.56 28 1748 4.88 
7 333 149.4 4 493 0.79 29 1381 3.48 
8 343 233.7 5) 582 2.70 30 1428 = =7.58 
9 353 355.1 6 1156 = 3.64 31 1255 2.63 
a re a a 
a a co 9 1097 = 5.34 34 2316 =8.19 
10 2078 6.85 35 1130 = 4.79 


(a) Draw a scatter diagram of these data. What type of 
relationship seems appropriate in relating y to x? continued 
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(a) Draw a scatter diagram of y versus x. 


uso a y (CUS a y (b) Fit the simple linear regression model. 
11 1818 5.84 36 463 0.51 (c) Test for significance of regression using a = 0.05. 
12 1700 5.21 37 7710 1.74 (d) Plot the residuals versus $; and comment on the underly- 
13 T47 = (3.25 38 724 4.10 ing regression assumptions. Specifically, does it seem that 
14 2030 84.43 39 808 3.94 the equality of variance assumption is satisfied? 
15 1643 3.16 40 790 0.96 (e) Find a simple linear regression model using Vy as the 
16 414. 0.50 41 733 3.29 response. Does this transformation on y stabilize the in- 
17 354. 0.17 42 406 0.44 equality of variance problem noted in part (d) above? 
18 1276 =: 11.88 43 1242 3.24 
19 745 (0.77 44 658 2.14 
20 795 3.70 45 1746 5.71 
21 540 0.56 46 895 4.12 
22 874 —-1.56 47 1114 1.90 
28 1543, 5.28 48 413 0.51 
24 1029 0.64 49 1787 8.33 
DS 710 ~=4.00 50 3560 =14.94 


11-10 LOGISTIC REGRESSION 


Linear regression often works very well when the response variable is quantitative. We now 
consider the situation where the response variable takes on only two possible values, 0 and 
1. These could be arbitrary assignments resulting from observing a qualitative response. 
For example, the response could be the outcome of a functional electrical test on a semi- 
conductor device for which the results are either a “success,” which means the device works 
properly, or a “failure,” which could be due to a short, an open, or some other functional 
problem. 
Suppose that the model has the form 


Y; = Bo + Bix, + (11-51) 


and the response variable Y; takes on the values either 0 or 1. We will assume that the response 
variable Y; is a Bernoulli random variable with probability distribution as follows: 


Y, Probability 
1 PY=l=q, 
0 P(Y; =0) =1-7,; 


Now since E(e;) = 0, the expected value of the response variable is 


E(Y,) = 1(m) + 011 — wi) 


This implies that 


12 
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This means that the expected response given by the response function E(Y;) = By + Bx; is 
just the probability that the response variable takes on the value 1. 

There are some substantive problems with the regression model in Equation 11-51. First, 
note that if the response is binary, the error terms €; can only take on two values, namely, 


€; = 1 — (Bo + By x) when Y; = | 
—(Bo + Bix) when Y; = 0 


é; 


Consequently, the errors in this model cannot possibly be normal. Second, the error variance 
is not constant, since 


oy, = E{Y, — E(Y)P 


(1 — war; + (0 — m)°(1 — a) 
a@{1 — 1) 


Notice that this last expression is just 


since E(Y;) = Bo + Bix; = 7;. This indicates that the variance of the observations (which is 
the same as the variance of the errors because e; = Y; — 7, and 7; is a constant) is a function 


L 


of the mean. Finally, there is a constraint on the response function, because 
O=E(Y) = 7,51 


This restriction can cause serious problems with the choice of a linear response function, as 
we have initially assumed in Equation 11-51. It would be possible to fit a model to the data for 
which the predicted values of the response lie outside the 0, 1 interval. 

Generally, when the response variable is binary, there is considerable empirical evidence 
indicating that the shape of the response function should be nonlinear. A monotonically 
increasing (or decreasing) S-shaped (or reverse S-shaped) function, such as shown in Figure 11-19, 
is usually employed. This function is called the logit response function, and has the form 


exp(Bo + Bix) 
1 + exp(Bo + By, x) 


AY)= (11-52) 


1.2 


Figure 11-19 Examples of the logistic response function. (a) E(Y) = 1/(1 + e7©°7!), (b) E(Y) = 1/(1 + e769F 1%), 
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or equivalently, 


1 
1 + exp[—(Bo + Bix)] 


E(Y) (11-53) 


In logistic regression we assume that E(Y) is related to x by the logit function. It is easy to 
show that 


EY) 
P= BY) = %P(Bo + Bx) (11-54) 
The quantity exp( By + Bx) on the right-hand side of Equation 11-54 is called the odds ratio. 
It has a straightforward interpretation: If the odds ratio is 2 for a particular value of x, it means 
that a success is twice as likely as a failure at that value of the regressor x. Notice that the 
natural logarithm of the odds ratio is a linear function of the regressor variable. Therefore the 
slope B, is the change in the log odds that results from a one-unit increase in x. This means that 
the odds ratio changes by e*' when x increases by one unit. 

The parameters in this logistic regression model are usually estimated by the method of 
maximum likelihood. For details of the procedure, see Montgomery, Peck, and Vining 
(2006). Minitab will fit logistic regression models and provide useful information on the 
quality of the fit. 

We will illustrate logistic regression using the data on launch temperature and O-ring fail- 
ure for the 24 space shuttle launches prior to the Challenger disaster of January 1986. There 
are six O-rings used to seal field joints on the rocket motor assembly. The table below presents 
the launch temperatures. A | in the “O-Ring Failure” column indicates that at least one O-ring 
failure had occurred on that launch. 


O-Ring O-Ring O-Ring 
Temperature Failure Temperature Failure Temperature Failure 
33} 1 68 0 WS 0 
56 1 69 0 13 1 
37/ 1 70 0 76 0 
63 0 70 1 76 0 
66 0 70 1 78 0 
67 0 70 1 719 0 
67 0 YZ 0 80 0 
67 0 73 0 81 0 


Figure 11-20 is a scatter plot of the data. Note that failures tend to occur at lower temperatures. 
The logistic regression model fit to this data from Minitab is shown in the following boxed 
display. 

The fitted logistic regression model is 


: 1 
* 1 + exp[—(10.875 — 0.17132x)] 
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Binary Logistic Regression: O-Ring Failure versus Temperature 


Link Function: Logit 
Response Information 
Variable Value Count 
O-Ring F 1 7 (Event) 

0 17 

Total 24 
Logistic Regression Table 

Odds 95% CI 

Predictor Coef SE Coef Z P Ratio Lower — Upper 
Constant 10.875 5.703 1.91 0.057 
Temperat —0.17132 0.08344 -—2.05 0.040 0.84 0.72 0.99 


Log-Likelihood = —11.515 
Test that all slopes are zero: G = 5.944, DF = 1, P-Value = 0.015 


The standard error of the slope B, is se(By) = 0.08344. For large samples, B, has an 
approximate normal distribution, and so Bi / se(B;) can be compared to the standard normal 
distribution to test Hy: B,; = 0. Minitab performs this test. The P-value is 0.04, indicating that 
temperature has a significant effect on the probability of O-ring failure. The odds ratio is 0.84, 
so every one degree increase in temperature reduces the odds of failure by 0.84. Figure 11-21 
shows the fitted logistic regression model. The sharp increase in the probability of O-ring 
failure is very evident in this graph. The actual temperature at the Challenger launch was 31°F. 
This is well outside the range of other launch temperatures, so our logistic regression model is 
not likely to provide highly accurate predictions at that temperature, but it is clear that a launch 
at 31°F is almost certainly going to result in O-ring failure. 

It is interesting to note that all of these data were available prior to launch. However, 
engineers were unable to effectively analyze the data and use them to provide a convincing 
argument against launching Challenger to NASA managers. Yet a simple regression analysis 
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Figure 11-20 Scatter plot of O-ring failures 
versus launch temperature for 24 space shuttle 
flights. 
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Figure 11-21 Probability of O-ring failure versus 
launch temperature (based on a logistic regression 
model). 
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of the data would have provided a strong quantitative basis for this argument. This is one of 
the more dramatic instances that points out why engineers and scientists need a strong 
background in basic statistical techniques. 


EXERCISES FOR SECTION 11-10 


11-80 A study was conducted attempting to relate home 
ownership to family income. Twenty households were selected 
and family income was estimated, along with information con- 
cerning home ownership (y = 1 indicates yes and y = 0 
indicates no). The data are shown below. 


Home 
Ownership 
Household Income Status 
1 38,000 0 
2 51,200 1 
3 39,600 0 
4 43,400 1 
5 47,700 0 
6 53,000 0 
7 41,500 1 
8 40,800 0 
9 45,400 1 
10 52,400 1 
11 38,700 1 
12 40,100 0 
13 49,500 1 
14 38,000 0 
15 42,000 1 
16 54,000 1 
17 51,700 1 
18 39,400 0 
19 40,900 0 
20 52,800 1 


(a) Fit a logistic regression model to the response variable y. 
Use a simple linear regression model as the structure for 
the linear predictor. 

(b) Is the logistic regression model in part (a) adequate? 

(c) Provide an interpretation of the parameter , in this model. 


11-81 The compressive strength of an alloy fastener used 
in aircraft construction is being studied. Ten loads were se- 
lected over the range 2500- 4300 psi and a number of fasten- 
ers were tested at those loads. The numbers of fasteners failing 
at each load were recorded. The complete test data follow. 


Load, x (psi) | Sample Size, n Number Failing, r 
2500 50 10 
2700 70 17 
2900 100 30 
3100 60 21 
3300 40 18 
3500 85 43 
3700 90 54 
3900 50 33 
4100 80 60 
4300 65 51 


(a) Fit a logistic regression model to the data. Use a simple 
linear regression model as the structure for the linear pre- 
dictor. 

(b) Is the logistic regression model in part (a) adequate? 


11-82 The market research department of a soft drink man- 
ufacturer is investigating the effectiveness of a price discount 
coupon on the purchase of a two-liter beverage product. A 
sample of 5500 customers was given coupons for varying 
price discounts between 5 and 25 cents. The response variable 
was the number of coupons in each price discount category re- 
deemed after one month. The data are shown below. 


Number 
Discount, x Sample Size, n Redeemed, r 
5 500 100 
7 500 122 
9 500 147 
11 500 176 
13 500 211 
15 500 244 
17 500 DHT 
19 500 310 
21 500 343 
23 500 372 
25 500 391 


(a) Fit a logistic regression model to the data. Use a simple lin- 
ear regression model as the structure for the linear predictor. 

(b) Is the logistic regression model in part (a) adequate? 

(c) Draw a graph of the data and the fitted logistic regression 
model. 


SS 


(d) Expand the linear predictor to include a quadratic term. Is 
there any evidence that this quadratic term is required in 
the model? 

(e) Draw a graph of this new model on the same plot that you pre- 
pared in part (c). Does the expanded model visually provide a 
better fit to the data than the original model from part (a)? 


11-83 A study was performed to investigate new automobile 
purchases. A sample of 20 families was selected. Each family was 
surveyed to determine the age of their oldest vehicle and their to- 
tal family income. A follow-up survey was conducted six months 
later to determine if they had actually purchased a new vehicle 
during that time period (y = 1 indicates yes and y = 0 indicates 
no). The data from this study are shown in the following table. 


Income, x, Age,x, y | Income,x, Age,x, y 
45,000 2 0 37,000 5 1 
40,000 4 0 31,000 q 1 
60,000 3 1 40,000 4 1 
50,000 2 1 75,000 2 0 
55,000 2 0 43,000 g 1 
50,000 5 1 49,000 2 0 
35,000 T 1 37,500 4 1 
65,000 2 1 71,000 1 0 
53,000 2 0 34,000 3) 0 
48,000 1 0 27,000 6 0 


(a) Fit a logistic regression model to the data. 

(b) Is the logistic regression model in part (a) adequate? 

(c) Interpret the model coefficients 8B, and Bp». 

(d) What is the estimated probability that a family with an in- 
come of $45,000 and a car that is five years old will pur- 
chase a new vehicle in the next six months? 

(e) Expand the linear predictor to include an interaction term. 
Is there any evidence that this term is required in the model? 


Supplemental Exercises 


11-84. Show that, for the simple linear regression model, 
the following statements are true: 


@ BOi-H=0 — &) Yl -F)xi=0 


12 _ 
© BIAS 


11-85. Anarticle in the JEEE Transactions on Instrumenta- 
tion and Measurement [“Direct, Fast, and Accurate Measure- 
ment of V; and K of MOS Transistor Using V;-Sift Circuit” 
(1991, Vol. 40, pp. 951-955)] described the use of a simple 
linear regression model to express drain current y (in 
milliamperes) as a function of ground-to-source voltage x 
(in volts). The data are as follows: 
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JP x Je x 
0.734 1.1 1.50 1.6 
0.886 1.2 1.66 1.7 
1.04 1.3 1.81 1.8 
1.19 1.4 1.97 1.9 
1.35 1.5 2.12 2.0 


(a) Draw a scatter diagram of these data. Does a straight-line 
relationship seem plausible? 

(b) Fit a simple linear regression model to these data. 

(c) Test for significance of regression using a = 0.05. What 
is the P-value for this test? 

(d) Find a 95% confidence interval estimate on the slope. 

(e) Test the hypothesis Hy: By = 0 versus H,: By ~ 0 using 
a = 0.05. What conclusions can you draw? 

11-86. The strength of paper used in the manufacture of card- 

board boxes (y) is related to the percentage of hardwood con- 

centration in the original pulp (x). Under controlled conditions, a 

pilot plant manufactures 16 samples, each from a different batch 

of pulp, and measures the tensile strength. The data are shown 

in the table that follows: 


y 101.4 117.4 117.1 106.2 
x 1.0 1.5 1.5 1.5 
y 131.9 146.9 146.8 133.9 
x 2.0 2.0 2.2 2.4 
y 111.0 123.0 125.1 145.2 
x 2.5 2.5 2.8 2.8 
y 134.3 | 144.5 143.7 146.9 
x 3.0 | 3.0 3.2 3:3 


(a) Fit a simple linear regression model to the data. 

(b) Test for significance of regression using a = 0.05. 

(c) Construct a 90% confidence interval on the slope PB. 

(d) Construct a 90% confidence interval on the intercept Bp. 

(e) Construct a 95% confidence interval on the mean strength 
atx = 2.5. 

(f) Analyze the residuals and comment on model adequacy. 

11-87. Consider the following data. Suppose that the 

relationship between Y and x is hypothesized to be 

Y = (Bo + B,x + €) |. Fit an appropriate model to the data. 

Does the assumed model form seem reasonable? 


cl eae cal ea 


y lot | 0.13 | 0.09 | 0.15 | 0.20 | 0.21 | 0.18 | 0.24 


—< 


s 


> 
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11-88. The following data, adapted from Montgomery, 
Peck, and Vining (2006), present the number of certified men- 
tal defectives per 10,000 of estimated population in the United 
Kingdom (y) and the number of radio receiver licenses issued 
(x) by the BBC (in millions) for the years 1924 through 1937. 
Fit a regression model relating y and x. Comment on the 
model. Specifically, does the existence of a strong correlation 
imply a cause-and-effect relationship? 


Year y x Year y x 

1924 8 1.350 1931 16 4.620 
1925 8 1.960 1932 18 5.497 
1926 g) 2.270 1933 19 6.260 
1927 10 2.483 1934 20 7.012 
1928 11 2.730 1935 21 7.618 
1929 11 3.091 1936 22 8.131 
1930 12 3.674 1937 23 8.593 

11-89. Consider the weight and blood pressure data in 


Exercise 11-72. Fit a no-intercept model to the data, and com- 
pare it to the model obtained in Exercise 11-70. Which model 
is superior? 

11-90. An article in Air and Waste [“Update on Ozone 
Trends in California’s South Coast Air Basin” (Vol. 43, 1993)] 
studied the ozone levels on the South Coast air basin of 
California for the years 1976-1991. The author believes that the 
number of days that the ozone level exceeds 0.20 parts per 
million depends on the seasonal meteorological index (the sea- 
sonal average 850 millibar temperature). The data follow: 


Year Days Index Year Days Index 
1976 91 16.7 1984 81 18.0 
1977 105 17.1 1985 65 17.2 
1978 106 18.2 1986 61 16.9 
1979 108 18.1 1987 48 17.1 
1980 88 17.2 1988 61 18.2 
1981 91 18.2 1989 43 17.3 
1982 58 16.0 1990 33 17.5 
1983 82 17.2 1991 36 16.6 


(a) Construct a scatter diagram of the data. 

(b) Fit a simple linear regression model to the data. Test for 
significance of regression. 

(c) Find a 95% Cl on the slope B). 

(d) Analyze the residuals and comment on model adequacy. 


11-91. Anarticle in the Journal of Applied Polymer Science 
(Vol. 56, pp. 471-476, 1995) studied the effect of the mole 
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ratio of sebacic acid on the intrinsic viscosity of copolyesters. 
The data follow: 


Mole ratio 

x 1.0 | 0.9 | 0.8 | 0.7 | 0.6 | 0.5 | 0.4 | 0.3 
Viscosity 

y 0.45 | 0.20 | 0.34 | 0.58 | 0.70 | 0.57 | 0.55 | 0.44 


(a) Construct a scatter diagram of the data. 

(b) Fit a simple linear repression model. 

(c) Test for significance of regression. Calculate R? for the 
model. 

(d) Analyze the residuals and comment on model adequacy. 


11-92. Two different methods can be used for measuring 
the temperature of the solution in a Hall cell used in aluminum 


an 


smelting, a thermocouple implanted in the cell and an indirect { ] 


measurement produced from an IR device. The indirect 
method is preferable because the thermocouples are even- 
tually destroyed by the solution. Consider the following 10 
measurements: 


Thermocouple 921 935 916 920 940 
IR 918 934 924 | 921 945 
Thermocouple 936 925 940 | 933 927 
IR 930 919 943 | 932 935 


(a) Construct a scatter diagram for these data, letting x = 
thermocouple measurement and y = IR measurement. 

(b) Fit a simple linear regression model. 

(c) Test for significance a regression and calculate R?. What 
conclusions can you draw? 

(d) Is there evidence to support a claim that both devices pro- 
duce equivalent temperature measurements? Formulate 
and test an appropriate hypothesis to support this claim. 

(e) Analyze the residuals and comment on model adequacy. 


11-93. The grams of solids removed from a material (y) is 
thought to be related to the drying time. Ten observations 
obtained from an experimental study follow: 


y | 4.3 | 1.5 | 1.8 | 4.9 | 4.2 | 4.8 | 5.8 | 6.2 | 7.0 | 7.9 


x 125 | 3.0 | 3.5 | 4.0 | 4.5 | 5.0 | 5.5 | 6.0 | 6.5 | 7.0 


(a) Construct a scatter diagram for these data. 

(b) Fit a simple linear regression model. 

(c) Test for significance of regression. 

(d) Based on these data, what is your estimate of the mean grams 
of solids removed at 4.25 hours? Find a 95% confidence in- 
terval on the mean. 

(e) Analyze the residuals and comment on model adequacy. 


i) 


11-94. Cesium atoms cooled by laser light could be used 
to build inexpensive atomic clocks. In a study in JEEE 
Transactions on Instrumentation and Measurement (2001, 
Vol. 50, pp. 1224-1228), the number of atoms cooled by lasers 
of various powers were counted. 


Number of 
Power (mW) Atoms (X10E9) 

11 0 

12 0.02 

18 0.08 

21 0.13 

22, 0.15 

24 0.18 

28 0.31 

32 0.4 

37 0.49 

39 0.57 

41 0.64 

46 0.71 

48 0.79 

50 0.82 

51 0.83 


(a) Graph the data and fit a regression line to predict the num- 
ber of atoms from laser power. Comment on the adequacy 
of a linear model. 

(b) Is there a significant regression at a = 0.05? What is the 
P-value? 

(c) Estimate the correlation coefficient. 

(d) Test the hypothesis that p = 0 against the alternative 
p # 0 with a = 0.05. What is the P-value? 

(e) Compute a 95% confidence interval for the slope coefficient. 


11-95. The following data related diamond carats to pur- 


oe chase prices. It appeared in Singapore’s Business Times, 


February 18, 2000. 


Carat Price Carat Price 
0.3 1302 0.33 1327 
0.3 1510 0.33 1098 
0.3 1510 0.34 1693 
0.3 1260 0.34 1551 
0.31 1641 0.34 1410 
0.31 1555 0.34 1269 
0.31 1427 0.34 1316 
0.31 1427 0.34 1222 


0.31 1126 0.35 1738 
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Carat Price Carat Price 
0.31 1126 0.35 1593 
0.32 1468 0.35 1447 
0.32 1202 0.35 1255 
0.36 1635 0.45 1572 
0.36 1485 0.46 2942 
0.37 1420 0.48 2532 
0.37 1420 0.5 3501 
0.4 1911 0.5 3501 
0.4 1525 0.5 3501 
0.41 1956 0.5 3293 
0.43 1747 0.5 3016 


(a) Graph the data. What is the relation between carat and 
price? Is there an outlier? 

(b) What would you say to the person who purchased the 
diamond that was an outlier? 

(c) Fit two regression models, one with all the data and the 
other with unusual data omitted. Estimate the slope coeffi- 
cient with a 95% confidence interval in both cases. 
Comment on any difference. 

11-96. The following table shows the population and the 
average count of wood storks sighted per sample period for 
South Carolina from 1991 to 2004. Fit a regression line 
with population as the response and the count of wood 
storks as the predictor. Such an analysis might be used to 
evaluate the relationship between storks and babies. Is re- 
gression significant at « = 0.05? What do you conclude about 
the role of regression analysis to establish a cause-and-effect 
relationship? 


Year Population Stork Count 
1991 3,559,470 0.342 
1992 3,600,576 0.291 
1993 3,634,507 0.291 
1994 3,666,456 0.291 
1995 3,699,943 0.291 
1996 =. 3,738,974 0.509 
1997 3,790,066 0.294 
1998 3,839,578 0.799 
1999 =. 3,885,736 0.542 
2000 4,012,012 0.495 
2001 4,061,209 0.859 
2002 =4,105,848 0.364 
2003 4,148,744 0.501 
2004 4,198,068 0.656 
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MIND-EXPANDING EXERCISES 


11-97. Suppose that we have n pairs of observations 
(x; y;) such that the sample correlation coefficient r is 
unity (approximately). Now let z; = y?and consider the 
sample correlation coefficient for the n-pairs of data 
(x; z;). Will this sample correlation coefficient be ap- 
proximately unity? Explain why or why not. 


11-98. Consider the simple linear regression model 
Y = By + Bix + €, with E(e) = 0, Ve) = o”, and the 
errors € uncorrelated. 

(a) Show that cov(Bo, 8;) = —x07/S,,,. 

(b) Show that cov(Y, 8) = 0. 

11-99. Consider the simple linear regression model 
Y = By + Bix + €, with E(e) = 0, Ve) = o”, and the 
errors € uncorrelated. 

(a) Show that E(é?) = E(MS;,) = o°. 

(b) Show that E(MS,) = o? + B7S,,. 

11-100. Suppose that we have assumed the straight- 
line regression model 


Y= Bo + Bix, + € 


but the response is affected by a second variable x» such 
that the true regression function is 


E(Y) = Bo + Bix, + Box» 


Is the estimator of the slope in the simple linear regres- 
sion model unbiased? 

11-101. Suppose that we are fitting a line and we 
wish to make the variance of the regression coefficient 
8, as small as possible. Where should the observations 
x; i = 1, 2,..., n, be taken so as to minimize Y(B,)? 
Discuss the practical implications of this allocation of 
the x;. 

11-102. Weighted Least Squares. Suppose that we 
are fitting the line Y = By + Byx + e, but the variance 


IMPORTANT TERMS AND CONCEPTS 


of Y depends on the level of x; that is, 


WY, |x) = 0? = 


where the w, are constants, often called weights. Show 
that for an objective function in which each squared 
residual is multiplied by the reciprocal of the variance of 
the corresponding observation, the resulting weighted 
least squares normal equations are 


n n n 
Bo Sw; + By D> wixi = > Wii 
= i=l = 
n n n 
A A lees 
Bo >» wx; + By > WX; = > WiXi Vj 
i=l i=l i=l 


Find the solution to these normal equations. The solutions 
are weighted least squares estimators of By and B,. 


11-103. 


Consider a situation where both Y and_X are 


random variables. Let s, and s, be the sample standard 
deviations of the observed x’s and y’s, respectively. 
Show that an alternative expression for the fitted simple 
linear regression model } = Bo te Bix is 


Ss 


y 
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11-104. 


Suppose that we are interested in fitting a 


simple linear regression model Y = By + Byx + «€, 


where the intercept, Bo, is known. 


(a) Find the least squares estimator of B;. 
(b) What is the variance of the estimator of the slope in 


part (a)? 


(c) Find an expression for a 100(1 — «)% confidence in- 
terval for the slope 8). Is this interval longer than the 
corresponding interval for the case where both the in- 
tercept and slope are unknown? Justify your answer. 


Analysis of variance 
test in regression 
Confidence interval 
on mean response 
Correlation 
coefficient 
Empirical model 


Confidence intervals on 
model parameters 
Intrinsically linear model 
Least squares estimation 
of regression model 

parameters 
Logistic regression 


Model adequacy checking 

Odds ratio 

Prediction interval on a 
future observation 

Regression analysis 

Residual plots 

Residuals 


Scatter diagram 
Significance of regression 
Simple linear regression 
model standard errors 
Statistical tests on 
model parameters 
Transformations 
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Multiple Linear Regression 


This chapter generalizes the simple linear regression to a situation where there is more 
than one predictor or regressor variable. This situation occurs frequently in science and 
engineering; for example, in Chapter 1 we provided data on the pull strength of a wire 
bond on a semiconductor package and illustrated its relationship to the wire length and 
the die height. Understanding the relationship between strength and the other two vari- 
ables may provide important insight to the engineer when the package is designed, or to 
the manufacturing personnel who assemble the die into the package. We used a multiple 
linear regression model to relate strength to wire length and die height. There are many 
examples of such relationships: The life of a cutting tool is related to the cutting speed and 
the tool angle; patient satisfaction in a hospital is related to patient age, type of procedure 
performed, and length of stay; and the fuel economy of a vehicle is related to the type of 
vehicle (car versus truck), engine displacement, horsepower, type of transmission, and 
vehicle weight. Multiple regression models give insight into the relationships between 
these variables that can have important practical implications. 

This chapter shows how to fit multiple linear regression models, perform the statis- 
tical tests and confidence procedures that are analogous to those for simple linear 
regression, and check for model adequacy. We also show how models that have polyno- 
mial terms in the regressor variables are just multiple linear regression models. We also 
discuss some aspects of building a good regression model from a collection of candidate 
regressors. 


CHAPTER OUTLINE 

12-1 MULTIPLE LINEAR REGRESSION 12-1.3 Matrix Approach to Multiple 
MODEL Linear Regression 
12-1.1 Introduction 12-1.4 Properties of the Least Squares 


12-1.2 Least Squares Estimation of Estimators 


the Parameters 
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12-2. HYPOTHESIS TESTS INMULTIPLE 12-4 PREDICTION OF NEW 
LINEAR REGRESSION OBSERVATIONS 
12-2.1 Test for Significance of 12-5 MODEL ADEQUACY CHECKING 
Regression 


12-2.2 Tests on Individual Regression 
Coefficients and Subsets of 


12-5.1 Residual Analysis 


12-5.2 Influential Observations 


Gach aes 12-6 ASPECTS OF MULTIPLE 
REGRESSION MODELING 
12-3. CONFIDENCE INTERVALS 
IN MULTIPLE LINEAR 12-6.1 Polynomial Regression Models 
REGRESSION 


12-3.1 Confidence Intervals on 
Individual Regression 


12-6.2 Categorical Regressors and 
Indicator Variables 


12-6.3 Selection of Variables and 


Coefficients 


Model Building 


12-3.2 Confidence Interval on 12-6.4 Multicollinearity 


the Mean Response 


LEARNING OBJECTIVES 


After careful study of this chapter you should be able to do the following: 


1. Use multiple regression techniques to build empirical models to engineering and scientific 
data 


Understand how the method of least squares extends to fitting multiple regression models 
Assess regression model adequacy 


Test hypotheses and construct confidence intervals on the regression coefficients 


nN & WN 


Use the regression model to estimate the mean response and to make predictions and to construct 
confidence intervals and prediction intervals 


6. Build regression models with polynomial terms 


7. Use indicator variables to model categorical regressors 


8. Use stepwise regression and other model building techniques to select the appropriate set of vari- 
ables for a regression model 


12-1 MULTIPLE LINEAR REGRESSION MODEL 
12-1.1 Introduction 


Many applications of regression analysis involve situations in which there are more than one 

regressor or predictor variable. A regression model that contains more than one regressor vari- 
able is called a multiple regression model. 

As an example, suppose that the effective life of a cutting tool depends on the cutting speed 

and the tool angle. A multiple regression model that might describe this relationship is 

Y = Bo + Bix, + Box. + € (12-1) 

where Y represents the tool life, x, represents the cutting speed, x, represents the tool angle, 

and € is a random error term. This is a multiple linear regression model with two regressors. 

The term linear is used because Equation 12-1 is a linear function of the unknown parameters 


Bo, B,, and Bo. 


Figure 12-1 
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(a) 


(b) 
(a) The regression plane for the model E(Y) = 50 + 10x, + 7x. (b) The contour plot. 


The regression model in Equation 12-1 describes a plane in the three-dimensional space 
of Y, x,, and x». Figure 12-1(a) shows this plane for the regression model 


E(Y) = 50+ 10x, + 7X9 


where we have assumed that the expected value of the error term is zero; that is E(e) = 0. The 
parameter By is the intercept of the plane. We sometimes call B, and 8, partial regression 
coefficients, because B, measures the expected change in Y per unit change in x, when x, is 
held constant, and B, measures the expected change in Y per unit change in x, when x, is held 
constant. Figure 12-1(b) shows a contour plot of the regression model—that is, lines of con- 
stant E(Y) as a function of x, and x). Notice that the contour lines in this plot are straight lines. 

In general, the dependent variable or response Y may be related to k independent or 
regressor variables. The model 


Y = Bo + By + Bory ++ + yxy + € (12-2) 


is called a multiple linear regression model with k regressor variables. The parameters 8,, 
j = 0, 1,...,k, are called the regression coefficients. This model describes a hyperplane in 
the k-dimensional space of the regressor variables {x;}. The parameter B; represents the 
expected change in response Y per unit change in x; when all the remaining regressors x; i # j) 
are held constant. 

Multiple linear regression models are often used as approximating functions. That is, the 
true functional relationship between Y and x), x5, ... , x, iS unknown, but over certain ranges 
of the independent variables the linear regression model is an adequate approximation. 

Models that are more complex in structure than Equation 12-2 may often still be analyzed 
by multiple linear regression techniques. For example, consider the cubic polynomial model 
in one regressor variable. 


Y=By) + Bix t+ Box? + Bax te (12-3) 


If we let x, = x, x) = x’, x; = x°, Equation 12-3 can be written as 
Y = Bo + Bixy + Boro + Bsx3 + € (12-4) 


which is a multiple linear regression model with three regressor variables. 
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Models that include interaction effects may also be analyzed by multiple linear regres- 
sion methods. An interaction between two variables can be represented by a cross-product 
term in the model, such as 


Y = Bo + Bix + Boro + Birr. + € (12-5) 
If we let x, = x,x, and B, = Bj, Equation 12-5 can be written as 
Y = Bo + Bix + Box. + Bgx3 + € 


which is a linear regression model. 
Figure 12-2(a) and (b) shows the three-dimensional plot of the regression model 


Y=50+ 10x, + 7X + 5x 1X2 


and the corresponding two-dimensional contour plot. Notice that, although this model is a 
linear regression model, the shape of the surface that is generated by the model is not linear. 
In general, any regression model that is linear in parameters (the B’s) is a linear regression 
model, regardless of the shape of the surface that it generates. 

Figure 12-2 provides a nice graphical interpretation of an interaction. Generally, interaction 
implies that the effect produced by changing one variable (x,, say) depends on the level of the 
other variable (x,). For example, Fig. 12-2 shows that changing x, from 2 to 8 produces a much 
smaller change in E(Y) when x, = 2 than when x, = 10. Interaction effects occur frequently in 
the study and analysis of real-world systems, and regression methods are one of the techniques 
that we can use to describe them. 

As a final example, consider the second-order model with interaction 


Y = Bo + Bixy + Boxe + Brixt + Borx3 + Bioxixy + € (12-6) 


If we let x, = x7, x4 = x3,X5 = X1%, Bs = By, Bs = Bo», and B; = B,, Equation 12-6 can be 
written as a multiple linear regression model as follows: 


Y = Bo + Bix + Box. + B3x3 + Bax + Bsx5 + € 
Figure 12-3(a) and (b) show the three-dimensional plot and the corresponding contour plot for 
E(Y) = 800 + 10x, + 7x. — 8.5x7 — 5x3 + 4x x, 
These plots indicate that the expected change in Y when x, is changed by one unit (say) is a 
function of both x, and x,. The quadratic and interaction terms in this model produce a mound- 
shaped function. Depending on the values of the regression coefficients, the second-order 


model with interaction is capable of assuming a wide variety of shapes; thus, it is a very 
flexible regression model. 


12-1.2 Least Squares Estimation of the Parameters 


The method of least squares may be used to estimate the regression coefficients in the mul- 
tiple regression model, Equation 12-2. Suppose that n > k observations are available, and let 
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Figure 12-2 (a) Three-dimensional plot of the regression model Figure 12-3 (a) Three-dimensional plot of the regression 

E(Y) = 50 + 10x; + 7x2 + 5x,x. (b) The contour plot. model E(Y) = 800 + 10x, + 7x, — 8.5x7 — 5x3 + 4x42). 
(b) The contour plot. 


X;; denote the ith observation or level of variable x; The observations are 
(Xj 1 Xi25 +++» Xike Vis i= 1, 25 sec Kt and n>k 


It is customary to present the data for multiple regression in a table such as Table 12-1. 
Each observation (%;1, X;2, ---, Xi, ¥;), Satisfies the model in Equation 12-2, or 


Vi = Bo + Brxin + Borin Foe + Bex + & 


=Bot DS Bxyte §=1,2,...,n (12-7) 
1 


y XxX X2 noo Xx 
yi X11 Xi2 Xk 
y2 X21 X22 X2K 
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The least squares function is 


n 


n k 2 
=> == @ =py= > 8x, ) (12-8) 
i=1 j=l 


i=1 


We want to minimize L with respect to By, B,,..., By. The least squares estimates of Bo, 
B,,..., B, must satisfy 
OL n : ko 
OBo ls « = -2 >) (9: - Bo — > Bexy ) = 0 (12-9a) 
Bo Bos Bi.--- Be i=l j=l 
and 
OL n . k m : 
aa lee ag em 7 Bo De Btw ay = 90 F= 12-4 (2-96) 
Bj Bo.Bi.---s By i=1 j=l 


Simplifying Equation 12-9, we obtain the least squares normal equations 
~ ~ n oe a n n 
nmBo + Bi xn + Box t+ Be De = Dy 
i=1 i=1 i=1 


n n n n n 

P r : 7 é 

Bo ari +B Sxh + Bo Daze to + Be DS xe = Yomi 
i= i=1 i=1 1=1 i=1 


mn n i n ee n z n n 
Bo it + Bi > xxi + Bo Sot t+ Be x = Dixyy;, (12-10) 
i= =I =i = =I 


Note that there are p = k + 1 normal equations, one for each of the unknown regression 
coefficients. The solution to the normal equations will be the least squares estimators of the 
regression coefficients, Bo, By, ..., By. The normal equations can be solved by any method 


appropriate for solving a system of linear equations. 


EXAMPLE 12-1 Wire Bond Strength 
In Chapter 1, we used data on pull strength of a wire bond ina 
semiconductor manufacturing process, wire length, and die 
height to illustrate building an empirical model. We will use 
the same data, repeated for convenience in Table 12-2, and 
show the details of estimating the model parameters. A three- 
dimensional scatter plot of the data is presented in Fig. 1-15. 
Figure 12-4 shows a matrix of two-dimensional scatter plots of 
the data. These displays can be helpful in visualizing the 
relationships among variables in a multivariable data set. For 
example, the plot indicates that there is a strong linear 
relationship between strength and wire length. 

Specifically, we will fit the multiple linear regression 
model 


Y= Bo + Bix, + Box + € 


where Y = pull strength, x, = wire length, and x, = die 
height. From the data in Table 12-2 we calculate 


25 
n= 25, >) yj = 725.82 
i=1 


25 25 
Dd x1 = 206, S x2 = 8,294 
i=1 i=1 
25 25 
SD xh = 2,396, Sixz = 3,531,848 
i=1 i=1 


25 25 
>) xn%2 = 77,177, >) xy; = 8,008.47, 
i=1 i=1 


25 
SD x29; = 274,816.71 
i=1 
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Table 12-2 Wire Bond Data for Example 12-1 


Observation Pull Strength Wire Length Die Height Observation Pull Strength Wire Length Die Height 
Number y xy Xz Number y xy Xp 
1 9195 2 50 14 11.66 2 360 
2 24.45 8 110 15 21.65 4 205 
3 31.75 11 120 16 17.89 4 400 
4 35.00 10 550 17 69.00 20 600 
5) 25,02 8 295 18 10.30 1 585 
6 16.86 4 200 19 34.93 10 540 
7 14.38 2 375 20 46.59 15 250 
8 9.60 2 52 21 44.88 15 290 
g 24.35 9 100 22 54.12 16 510 
10 27.50 8 300 23 56.63 17 590 
11 17.08 4 412 24 22.13 6 100 
12 37.00 11 400 25 21.15 5 400 
13 41.95 12 500 
For the model Y = By + B,x, + B2x2 + €, the normal equa- Inserting the computed summations into the normal equa- 
tions 12-10 are tions, we obtain 


ll 


nBo + By pri + Bo SD x2 >); 25R) + 2068, + 82948, = 725.82 
i= i=] i=l 


2068) + 23968, +  77,177B> = 8,008.47 


n n n n 
Bo > sat 6, Sa + Bo Die = Dray - is m 
| a a a 82948) + 77,1778; + 3,531,8488, = 274,816.71 


n n n n 

a 4 % , 

Bo S x12 + Bi DS xxi + Bo Mx = DS xiv; 
I a I I 
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Figure 12-4 Matrix of scatter plots (from Minitab) for the wire bond pull 
strength data in Table 12-2. 
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The solution to this set of equations is Practical Interpretation: This equation can be used to 


By = 2.26379, 


Therefore, the fitted regression equation is 


B, = 2.74427, B, = 0.01253 


predict pull strength for pairs of values of the regressor vari- 
ables wire length (x,) and die height (x2). This is essentially 
the same regression model given in Section 1-3. Figure 1-16 
shows a three-dimensional plot of the plane of predicted val- 


¥ = 2.26379 + 2.74427x, + 0.01253x, ues j generated from this equation. 


12-1.3 Matrix Approach to Multiple Linear Regression 


Normal 
Equations 


In fitting a multiple regression model, it is much more convenient to express the mathemati- 
cal operations using matrix notation. Suppose that there are k regressor variables and n ob- 
servations, (Xj, Xj2,--->Xixs Ys i = 1, 2,..., and that the model relating the regressors to the 
response is 


Yi = Bo + Bit + Boxji2 +7 + Byxin + E; i= 1,2,...57 


This model is a system of n equations that can be expressed in matrix notation as 


y=Xpte (12-11) 
where 
1 Lox x2 Xk Bo €1 
y2 1 x x ae E> 
y= xX- 21 22 2k B= By Raat iene 
Yn 1 Xnt = Xn2 sie Xnk By En 


In general, y is an (n X 1) vector of the observations, X is an (n X p) matrix of the levels 
of the independent variables (assuming that the intercept is always multiplied by a constant 
value—unity), B is a(p X 1) vector of the regression coefficients, and € is a (n X 1) vector 
of random errors. The X matrix is often called the model matrix. 

We wish to find the vector of least squares estimators, B, that minimizes 


L= > e; = €’e = (y — XB)'(y — XB) 


The least squares estimator B is the solution for B in the equations 


oL 
ak 


We will not give the details of taking the derivatives above; however, the resulting equations 
that must be solved are 


X’XB = X’y (12212) 


Least Square 
Estimate of B 
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Equations 12-12 are the least squares normal equations in matrix form. They are identical to 
the scalar form of the normal equations given earlier in Equations 12-10. To solve the normal 
equations, multiply both sides of Equations 12-12 by the inverse of X’X. Therefore, the least 
squares estimate of B is 


B= x) xy (12-13) 


Note that there are p = k + 1 normal equations in p = k + 1 unknowns (the values of 
Bo, Bi, aks B,). Furthermore, the matrix X’X is always nonsingular, as was assumed above, 
so the methods described in textbooks on determinants and matrices for inverting these ma- 
trices can be used to find (X’X)~!. In practice, multiple regression calculations are almost 
always performed using a computer. 

It is easy to see that the matrix form of the normal equations is identical to the scalar form. 
Writing out Equation 12-12 in detail, we obtain 


n n n 
n Dx Sx oe S xin a“ . 
i=l i=l i=l 
n n n n 
ya >i Drive a Dra 6 Su. 


n n n n , ; 
: aX; cenin: «oe 2 e n 
»» Xik »» NikXil »» Nik Xi2 »» Nik By Ss Rav: 


If the indicated matrix multiplication is performed, the scalar form of the normal equations 
(that is, Equation 12-10) will result. In this form it is easy to see that X’X is a (p X p) sym- 
metric matrix and X’y is a(p X 1) column vector. Note the special structure of the X’X ma- 
trix. The diagonal elements of X’X are the sums of squares of the elements in the columns of 
X, and the off-diagonal elements are the sums of cross-products of the elements in the 
columns of X. Furthermore, note that the elements of X’y are the sums of cross-products of 
the columns of X and the observations {y;}. 
The fitted regression model is 


5=Bot DSBx, §=1,2,...,0 (12-14) 
1 
In matrix notation, the fitted model is 
y = Xp 


The difference between the observation y, and the fitted value $; is a residual, say, 
e; = y; — y;. The (n X 1) vector of residuals is denoted by 


e=y-j (12-15) 
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EXAMPLE 12-2 Wire Bond Strength with Matrix Notation 


In Example 12-1, we illustrated fitting the multiple regression 
model 


y = Bo + Bix, + Box. + € 


where y is the observed pull strength for a wire bond, x, is the 
wire length, and x, is the die height. The 25 observations are in 
Table 12-2. We will now use the matrix approach to fit the re- 
gression model above to these data. The model matrix X and y 
vector for this model are 


1 2 50 9.95 
1 8 110 24.45 
1 11 120 31.75 
1 10 550 35.00 
1 8 295 25.02 
1 4 200 16.86 
1 2 375 14.38 
1 2 382 9.60 
1 9 100 24.35 
1 8 300 27.50 
1 4 412 17.08 
1 11 400 37.00 
X=] 1 12 500 | y=| 41.95 
1 2 360 11.66 
1 4 205 21.65 
1 4 400 17.89 
1 20 600 69.00 
1 1 585 10.30 
1 10 540 34.93 
1 15 250 46.59 
1 15 290 44.88 
1 16 510 54.12 
1 17 590 56.63 
1 100 22.13 
1 5 400 21.15 
The X’X matrix is 
ea ee. 2 50 
X’X=|2 8 - 5 : — 
L50 110 --- 400 i. apo 
| 25 206 8,294 
=| 206 2,396 77,177 
| 8,294 77,177 3,531,848 


and the X’y vector is 


ae CT a | 725.82 
24.45 
Xy=(2 8 -- 5 >” | =| 8,008.47 
50 110 -- 400]} | ° 274,816.71 
21.15 


The least squares estimates are found from Equation 12-13 as 


B = (X'X)'X’y 
or 
Bo 25 206 8,294 ]~![ 725.82 
6,} =| 206 2,396 77,177 8,008.37 
Bo 8,294 77,177 3,531,848 274,811.31 
0.214653 —0.007491 —0.000340 725.82 
= | —0.007491 0.001671 —0.000019 8,008.47 
| —0.000340 —0.000019 +0.0000015 } |. 274,811.31 
[ 2.26379143 
= | 2.74426964 
| 0.01252781 


Therefore, the fitted regression model with the regression 
coefficients rounded to five decimal places is 


¥ = 2.26379 + 2.74427x, + 0.01253x, 


This is identical to the results obtained in Example 12-1. 

This regression model can be used to predict values of 
pull strength for various values of wire length (x,) and die 
height (x2). We can also obtain the fitted values }; by substi- 
tuting each observation (x;,, x;2), i= 1, 2,..., n, into the 
equation. For example, the first observation has x,, = 2 and 
Xj. = 50, and the fitted value is 


$1 = 2.26379 + 2.74427x,, + 0.01253x19 
= 2.26379 + 2.74427(2) + 0.01253(50) 
= 8.38 


The corresponding observed value is y, = 9.95. The residual 
corresponding to the first observation is 


e=y-h 
= 9.95 — 8.38 
= 1.57 


Table 12-3 displays all 25 fitted values ¥; and the correspon- 
ding residuals. The fitted values and residuals are calculated to 
the same accuracy as the original data. 
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Table 12-3 Observations, Fitted Values, and Residuals for Example 12-2 


Observation Observation 
Number Ji di = yi — Yi Number Ji Sj = Si wy 
1 9.95 8.38 1.57 14 11.66 12.26 —0.60 
2 24.45 25.60 —1.15 15 21.65 15.81 5.84 
3 31.75 83395) 20) 16 17.89 18.25 —0.36 
4 35.00 36.60 —1.60 17 69.00 64.67 4.33 
5) 25.02 27.91 —2.89 18 10.30 12.34 —2.04 
6 16.86 15.75 1.11 19 34.93 36.47 —1.54 
7 14.38 12.45 1.93 20 46.59 46.56 0.03 
8 9.60 8.40 1.20 21 44.88 47.06 —2.18 
9 24.35 28.21 —3.86 22 54.12 52.56 1.56 
10 27.50 27.98 —0.48 23 56.63 56.31 0.32 
11 17.08 18.40 = 1l3v2 24 22 MB) 19.98 2.15 
12 37.00 37.46 —0.46 25 21.15 21.00 0.15 
13 41.95 41.46 0.49 
Computers are almost always used in fitting multiple regression models. Table 12-4 pre- 
sents some annotated output from Minitab for the least squares regression model for wire bond 
pull strength data. The upper part of the table contains the numerical estimates of the regres- 
sion coefficients. The computer also calculates several other quantities that reflect important 
information about the regression model. In subsequent sections, we will define and explain the 
quantities in this output. 
Estimating 07 
Just as in simple linear regression, it is important to estimate o”, the variance of the error term 
€, ina multiple regression model. Recall that in simple linear regression the estimate of o” was 
obtained by dividing the sum of the squared residuals by n — 2. Now there are two parame- 
ters in the simple linear regression model, so in multiple linear regression with p parameters a 
logical estimator for o7 is 
Estimator 
of Variance u 


ge == (12-16) 


This is an unbiased estimator of o’. Just as in simple linear regression, the estimate of 0” is usu- 
ally obtained from the analysis of variance for the regression model. The numerator of Equation 
12-16 is called the error or residual sum of squares, and the denominator n — p is called the 
error or residual degrees of freedom. 

We can find a computing formula for SS, as follows: 


SSp = Yi ie 5) = de =e'e 
i=l i=l 
Substituting e = y — y=y— xp into the above, we obtain 
SSp = y'y — B'X'y 
= 27,178.5316 — 27,063.3581 = 115.174 (12-17) 
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Table 12-4 Minitab Multiple Regression Output for the Wire Bond Pull Strength Data 


Regression Analysis: Strength versus Length, Height 


The regression equation is 
Strength = 2.26 + 2.74 Length + 0.0125 Height 


Predictor Coef SE Coef T P VIF 
Constant By 2.264 1.060 2.14 0.044 
Length 6, > 2.74427 0.09352 29.34 0.000 «1.2 


Height 8, 0.012528 0.002798 448 0.000 12 


S = 2.288 R-Sq = 98.1% R-Sq (adj) = 97.9% 
PRESS = 156.163 R-Sq (pred) = 97.44% 


Analysis of Variance 


Source DF SS MS F P 
Regression 2 5990.8 2995.4 572.17 0.000 
Residual Error 22 115.2 5.2< 6 

Total 24 6105.9 

Source DF Seq SS 

Length 1 5885.9 

Height 1 104.9 


Predicted Values for New Observations 


New Obs Fit SE Fit 95.0% CI 95.0% PI 
1 27.663 0.482 (26.663, 28.663) (22.814, 32.512) 


Values of Predictors for New Observations 


New Obs Length Height 
1 8.00 275 


Table 12-4 shows that the estimate of o? for the wire bond pull strength regression model 
is 6? = 115.2/22 = 5.2364. The Minitab output rounds the estimate to 6? = 5.2. 


12-1.4 Properties of the Least Squares Estimators 
The statistical properties of the least squares estimators Bo, B,, ..., 8, may be easily found, 
under certain assumptions on the error terms €,, €5,..., €,,, in the regression model. Paralleling 
the assumptions made in Chapter 11, we assume that the errors €; are statistically independent 
with mean zero and variance o”. Under these assumptions, the least squares estimators 
Bo, B;,..., 8, are unbiased estimators of the regression coefficients Bo, B,, ... , By. This 
property may be shown as follows: 


[(X'X) 'X’Y] 

E[(X’X)'X'(XB + €)] 

= E[(X'X)!X’XB + (X'X)!X’e] 
B 


= 

wo 
II 
eI 


since E(é) = 0 and (X'X) 'X’X =I, the identity matrix. Thus, B is an unbiased estimator of B. 
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The variances of the B’s are expressed in terms of the elements of the inverse of the X’X 
matrix. The inverse of X’X times the constant o7 represents the covariance matrix of the 
regression coefficients B. The diagonal elements of o” (X'x)! are the variances of Bo, 
Bi. or Bis and the off-diagonal elements of this matrix are the covariances. For example, if we 
have k = 2 regressors, such as in the pull-strength problem, 


Coo Cor Cor 
C=(X'X)7=]Cy Cu Cp 
Coo Cr Cr 
which is symmetric (Cyy = Co}, Cop = Cop, and C,, = C,2) because (X'X)~! is symmetric, 


and we have 


V(b) =0C;,  jf=0,1,2 
cov(B;, Bj) = oC; i Fj 


In general, the covariance matrix of pB isa(p X p) symmetric matrix whose jjth element is the 
variance of B; and whose i, jth element is the covariance between ; and §,, that is, 


cov(B) = 


The estimates of the wananees of these regression coefficients are obtained by replacing 
o” with an estimate. When o? is replaced by its estimate G7, the square root of the estimated 
variance of the jth regression coefficient is called the estimated standard error of B; or 
Se (6) = V 6°Cj. . These standard errors are a useful measure of the precision of estimation 
for the deprestion coefficients; small standard errors imply good precision. 

Multiple regression computer programs usually display these standard errors. For 
example, the Minitab output in Table 12-4 reports se(Bo) = 1.060, se(B,) = 0.09352, and 
se(Bo) = = 0.002798. The intercept estimate is about twice the magnitude of its standard error, and 
8, and £5 are considerably larger than se(f,) and se({5). This implies reasonable precision 
of estimation, although the parameters B, and 8, are much more precisely estimated than the 


o°(X’X) | =0°C 


intercept (this is not unusual in multiple regression). 


EXERCISES FOR SECTION 12-1 


12-1. A study was performed to investigate the shear 

strength of soil (y) as it related to depth in feet (x,) and % 

moisture content (x). Ten observations were collected, and the 

following summary quantities obtained: n = 10, Sx; = 223, 

Yxp = 553, Sy; = 1,916, Sx}, = 5,200.9, Dx% = 31,729, 

LDxpX = 12,352, Vx; = 43,550.8, Yxjy; = 104,736.8, 

and Sy? = 371,595.6. 

(a) Set up the least squares normal equations for the model 
Y= Bo + Bit + Box) + €. 

(b) Estimate the parameters in the model in part (a). 

(c) What is the predicted strength when x, = 18 feet and 
Xy = 43%? 

12-2. A regression model is to be developed for predicting 

the ability of soil to absorb chemical contaminants. Ten obser- 

vations have been taken on a soil absorption index (y) and two 

regressors: x, = amount of extractable iron ore and x, = 


amount of bauxite. We wish to fit the model Y = By + Bx, + 
Box, + €. Some necessary quantities are: 


. 1.17991 ~7,30982 E-3 7.3006 E-4 
(X’X)"! = |-7.30982 E-3 7.9799E-5 —1.23713E-4|, 
| 7.3006 E-4 —1.23713E-4 4.6576 E-4 
r 220 
X’y = | 36,768 
| 9,965 


(a) Estimate the regression coefficients in the model specified 
above. 

(b) What is the predicted value of the absorption index y 
when x, = 200 and x, = 50? 

12-3. A chemical engineer is investigating how the amount 

of conversion of a product from a raw material (y) depends on 
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reaction temperature (x,) and the reaction time (x,). He has de- 
veloped the following regression models: 
1. § = 100 + 2x, + 4x, 
2.5=95 4 
Both models have been built over the range 0.5 = x, = 10. 
(a) What is the predicted value of conversion when x, = 2? 
Repeat this calculation for x, = 8. Draw a graph of the 
predicted values for both conversion models. Comment 
on the effect of the interaction term in model 2. 
(b) Find the expected change in the mean conversion for a 
unit change in temperature x, for model 1 when x, = 5. 


1.5x, + 3x + 2x,x5 


17 58 61 1 4.6 Sy 
18 60 71 1 J 43 
19 62 62 0 VD 46 
20 68 38 0 7.8 56 
21 70 41 1 7.0 59 
22 79 66 1 6.2 26 
2 63 31 1 4.1 52 
24 39 42 0 3.5 83 
DS 49 40 1 2.1 75 


Does this quantity depend on the specific value of reac- 
tion time selected? Why? 

(c) Find the expected change in the mean conversion for a 
unit change in temperature x, for model 2 when x, = 5. 
Repeat this calculation for x, = 2 and x, = 8. Does the 
result depend on the value selected for x,? Why? 


12-4. You have fit a multiple linear regression model and 
the (X’X)~! matrix is: 


0.893758 —0.0282448 —0.0175641 
(X’X)' = | —0.028245 0.0013329 0.0001547 
—0.017564 0.0001547 0.0009108 


(a) How many regressor variables are in this model? 

(b) If the error sum of squares is 307 and there are 15 obser- 
vations, what is the estimate of 07? 

(c) What is the standard error of the regression coefficient Bi? 


12-5. Data from a patient satisfaction survey in a hospital 
are shown in the following table: 


Obser- Satis- 
vation Age Severity Surg-Med Anxiety faction 
1 55 50 0 Dell 68 
2 46 24 1 2.8 77 
3 30 46 1 3.3) 96 
4 35 48 1 4.5 80 
5 59 58 0 2.0 43 
6 61 60 0 5.1 44 
7 74 65 1 So) 26 
8 38 42 1 3.2 88 
9 27 42 0 3.1 75 
10 51 50 1 2.4 57 
11 53 38 1 DD, 56 
12 41 30 0 2.1 88 
13 37 31 0 1.9 88 
14 24 34 0 3.1 102 
15 42 30 0 3.0 88 
16 50 48 1 4.2 70 


The regressor variables are the patient’s age, an illness sever- 

ity index (larger values indicate greater severity), an indicator 

variable denoting whether the patient is a medical patient (0) 

or a surgical patient (1), and an anxiety index (larger values in- 

dicate greater anxiety). 

(a) Fit a multiple linear regression model to the satisfaction 
response using age, illness severity, and the anxiety index 
as the regressors. 

(b) Estimate o7. 

(c) Find the standard errors of the regression coefficients. 

(d) Are all of the model parameters estimated with nearly the 
same precision? Why or why not? 


12-6. The electric power consumed each month by a chem- 
ical plant is thought to be related to the average ambient 
temperature (x,), the number of days in the month (x), the 
average product purity (x3), and the tons of product produced 
(x4). The past year’s historical data are available and are pre- 
sented in the following table: 


v x1 2 *3 4 
240 25 24 91 100 
236 31 21 90 95 
270 45 24 88 110 
274 60 25 87 88 
301 65 Ds) 91 94 
316 72 26 94 99 
300 80 Ds) 87 97 
296 84 25 86 96 
267 VD 24 88 110 
276 60 25 91 105 
288 50 DS) 90 100 
261 38 23 89 98 


(a) Fit a multiple linear regression model to these data. 
(b) Estimate o7. 


=, 


Table 12-5 DaimlerChrysler Fuel Economy and Emissions 
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mfr carline car/truck cid rhp trns drv od etw cmp axle n/v a/c he co co2 mpg 
20  300C/SRT-8 Cc Palby Pésyek ILS) 4 2 4500 99 3.07 30.9 Y 0.011 0.09 288 30.8 
20 CARAVAN 2WD T 201 180 L4 F 2 4500 9.3 249 32.3 Y 0.014 O11 274 32.5 
20 CROSSFIRE ROADSTER C 196 168 LS5 R 2 3375 10 3.27 37.1 Y 0.001 0.02 250 35.4 
20 DAKOTA PICKUP 2WD T 226 210 L4 R 2 4500 9.2 3.55 29.6 Y 0.012 0.04 316 28.1 
20 DAKOTA PICKUP 4WD T 226 210 L4 4 2 5000 92 3.55 296 Y 0.011 0.05 365 24.4 
20 DURANGO 2WD T 348 345 LS R 2 5250 86 3.55 27.2 Y 0.023 0.15 367 24.1 
20 GRAND CHEROKEE 2WD T 226 210 L4 R 2 4500 9.2 3.07 304 Y 0.006 0.09 312 28.5 
20 GRAND CHEROKEE 4WD T 348 230 LS 4 2 5000 9 3.07 247 Y 0.008 0.11 369 24.2 
20 LIBERTY/CHEROKEE 2WD T 148 150 M6 R 2 4000 95 4.1 41 Y 0.004 0.41 270 32.8 
20 LIBERTY/CHEROKEE 4WD T 226 210 L4 4 2 4250 92 3.73 31.2 Y 0.003 0.04 317 28 

20 NEON/SRT-4/SX 2.0 C 122 132 L4 F 2 3000 9.8 2.69 39.2 Y 0.003 0.16 214 41.3 
20 PACIFICA 2WD T 215 249 L4 F 2 4750 9.9 2.95 35.3. Y 0.022 0.01 295 30 

20 PACIFICA AWD T 215 249 L4 4 2 5000 99 2.95 35.3 Y 0.024 0.05 314 28.2 
20 PT CRUISER T 148 220 L4 F 2 3625 95 2.69 37.33 Y 0.002 0.03 260 34.1 
20 RAM 1500 PICKUP 2WD T 500 500 M6 2 5250 96 41 223 Y O01 O1 474 187 
20 RAM 1500 PICKUP 4WD T 348 345 LS 4 2 6000 86 3.92 29 Y O 0 0 20.3 
20 SEBRING 4-DR C 165 200 L4 F 2 3625 9.7 269 368 Y 0.011 0.12 252 35.1 
20 STRATUS 4-DR Cc 148 167 L4 F 2 3500 95 2.69 36.8 Y 0.002 0.06 233 37.9 
20 TOWN & COUNTRY 2WD T 148 150 L4 F 2 4250 94 269 349 Y 0O 0.09 262 33.8 
20 VIPER CONVERTIBLE Cc 500 501 M6 R 2 3750 9.6 3.07 19.4 Y 0.007 0.05 342 25.9 
20 WRANGLER/TJ 4WD ali 148 150 M6 4 2 3625 95 3.73 40.1 Y 0.004 0.43 337 264 


mfr-mfr code 

carline-car line name (test vehicle model name) 
car/truck-‘C’ for passenger vehicle and “T” for truck 
cid-cubic inch displacement of test vehicle 
rhp-rated horsepower 

trns-transmission code 

drv-drive system code 

od-overdrive code 

etw-equivalent test weight 


(c) Compute the standard errors of the regression coeffi- 

cients. Are all of the model parameters estimated with the 

same precision? Why or why not? 

(d) Predict power consumption for a month in which 
xX, = 75°F, x, = 24 days, x; = 90%, and x, = 98 tons. 


12-7. Table 12-5 provides the highway gasoline mileage test 


G results for 2005 model year vehicles from DaimlerChrysler. The 


full table of data (available on the book’s Web site) contains the 

same data for 2005 models from over 250 vehicles from many 

manufacturers (source: Environmental Protection Agency Web 

site www.epa.gov/ otaq/cert/mpg/testcars/database). 

(a) Fit a multiple linear regression model to these data to esti- 
mate gasoline mileage that uses the following regressors: 
cid, rhp, etw, cmp, axle, n/v. 


cmp-compression ratio 

axle-axle ratio 

n/y-n/v ratio (engine speed versus vehicle speed at 50 mph) 
a/c-indicates air conditioning simulation 

he-HC(hydrocarbon emissions) Test level composite results 
co-CO(carbon monoxide emissions) Test level composite results 
co2-CO2(carbon dioxide emissions) Test level composite results 
mpg-mpg(fuel economy, miles per gallon) 


(b) Estimate o” and the standard errors of the regression co- 
efficients. 

(c) Predict the gasoline mileage for the first vehicle in the 
table. 


12-8. The pull strength of a wire bond is an important char- 

acteristic. The following table gives information on pull 

strength (y), die height (x), post height (x2), loop height (x3), 

wire length (x,), bond width on the die (x5), and bond width on 

the post (%6). 

(a) Fit a multiple linear regression model using x,, x3, x4, and 
Xs as the regressors. 

(b) Estimate o”. 

(c) Find the se(B)). How precisely are the regression coeffi- 
cients estimated, in your opinion? 
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v 4 x2 x3 x4 X5 6 
8.0 De 19.6 29.6 94.9 2.1 m3) 
8.3 5.2 19.8 32.4 89.7 2.1 1.8 
8.5 5.8 19.6 31.0 96.2 2.0 2.0 
8.8 6.4 19.4 32.4 95.6 2.2 2.1 
9.0 5.8 18.6 28.6 86.5 2.0 1.8 
9.3 5.2 18.8 30.6 84.5 2.1 2.1 
3) 5.6 20.4 32.4 88.8 DP) 1.9 
9.5 6.0 19.0 32.6 85.7 2.1 1.9 
9.8 De 20.8 B32 93.6 23) Dal 
10.0 5.8 19.9 31.8 86.0 2.1 1.8 
10.3 6.4 18.0 32.6 87.1 2.0 1.6 
10.5 6.0 20.6 33.4 93.1 2.1 2.1 
10.8 6.2 20.2 31.8 83.4 D2, Dal 
11.0 6.2 20.2 32.4 94.5 2.1 1.9 
11.3 6.2 19.2 31.4 83.4 1.9 1.8 
11.5 5.6 17.0 33.2 85.2 2.1 2.1 
11.8 6.0 19.8 35.4 84.1 2.0 1.8 
12.3 5.8 18.8 34.0 86.9 2.1 1.8 
12.5 5.6 18.6 34.2 83.0 1.9 2.0 


(d) Use the model from part (a) to predict pull strength when 

Xy = 20, x3 = 30, x, = 90, and x; = 2.0. 
12-9. An engineer at a semiconductor company wants to 
model the relationship between the device HFE (y) and three 
parameters: Emitter-RS (x,), Base-RS (x7), and Emitter-to-Base 
RS (43). The data are shown in the following table. 


x} x2 X3 yy, 
Emitter-RS Base-RS E-B-RS HFE-1M-5V 
14.620 226.00 7.000 128.40 
15.630 220.00 3.375 52.62 
14.620 217.40 6.375 113.90 
15.000 220.00 6.000 98.01 
14.500 226.50 7.625 139.90 
15.250 224.10 6.000 102.60 
16.120 220.50 3.375 48.14 
15.130 223.50 6.125 109.60 
15.500 217.60 5.000 82.68 
15.130 228.50 6.625 112.60 
15.500 230.20 5.750 97.52 
16.120 226.50 3.750 59.06 
15.130 226.60 6.125 111.80 
15.630 225.60 5.375 89.09 
15.380 229.70 5.875 101.00 
14.380 234.00 8.875 171.90 


15.500 
14.250 
14.500 
14.620 


230.00 
224.30 
240.50 
223.70 


4.000 
8.000 
10.870 
7.375 


66.80 
157.10 
208.40 
133.40 


(a) Fit a multiple linear regression model to the data. 


(b) Estimate o7. 


(c) Find the standard errors se( B)). Are all of the model para- 
meters estimated with the same precision? Justify your answer. 

(d) Predict HFE when x, = 14.5, x, = 220, and x3 = 5.0. 

12-10. Heat treating is often used to carburize metal parts, 

such as gears. The thickness of the carburized layer is consid- —=> 
ered a crucial feature of the gear and contributes to the overall 
reliability of the part. Because of the critical nature of this fea- 

ture, two different lab tests are performed on each furnace 
load. One test is run on a sample pin that accompanies each 
load. The other test is a destructive test, where an actual part 

is cross-sectioned. This test involves running a carbon analy- 

sis on the surface of both the gear pitch (top of the gear tooth) 

and the gear root (between the gear teeth). Table 12-6 shows 

the results of the pitch carbon analysis test for 32 parts. 

The regressors are furnace temperature (TEMP), carbon 
concentration and duration of the carburizing cycle 
(SOAKPCT, SOAKTIME), and carbon concentration and 
duration of the diffuse cycle (DIFFPCT, DIFFTIME). 

(a) Fita linear regression model relating the results of the pitch 
carbon analysis test (PITCH) to the five regressor variables. 


(b) Estimate o”. 


(c) Find the standard errors se(f;). 

(d) Use the model in part (a) to predict PITCH when 
TEMP = 1650, SOAKTIME = 1.00, SOAKPCT = 1.10, 
DIFFTIME = 1.00, and DIFFPCT = 0.80. 

12-11. An article in Electronic Packaging and Production 

(2002, Vol. 42) considered the effect of X-ray inspection of 

integrated circuits. The rads (radiation dose) were studied as a 

function of current (in milliamps) and exposure time (in minutes). 


Rads mAmps Exposure Time 
74 10 0.25 (_] 
14.8 10 0.5 
29.6 10 1 
59.2 10 pi] 
88.8 10 3 
296 10 10 
444 10 15 
592 10 20 
11.1 15 0.25 
22.2 15 0.5 
44.4 15 1 
continued 


Rads mAmps Exposure Time 
88.8 15 2) 
133.2 15 3 
444 15 10 
666 15 15 
888 15 20 
14.8 20 0.25 
29.6 20 0.5 
59.2 20 1 
118.4 20 2 
177.6 20 3 
592 20 10 
888 20 15 
1184 20 20 
22.2 30 0.25 
44.4 30 0.5 
88.8 30 1 
177.6 30 D 
266.4 30 3 
888 30 10 
1332 30 15 
1776 30 20 
29.6 40 0.25 
59.2 40 0.5 
118.4 40 1 
236.8 40 2 
355.2 40 3 
1184 40 10 
1776 40 15 
2368 40 20 


(a) Fit a multiple linear regression model to these data with 
rads as the response. 
(b) Estimate o? and the standard errors of the regression co- 
efficients. 
(c) Use the model to predict rads when the current is 15 mil- 
liamps and the exposure time is 5 seconds. 
12-12. Anarticle in Cancer Epidemiology, Biomarkers and 
Prevention (1996, Vol. 5, pp. 849-852) conducted a pilot 
study to assess the use of toenail arsenic concentrations as an 
indicator of ingestion of arsenic-containing water. Twenty-one 
participants were interviewed regarding use of their private 
(unregulated) wells for drinking and cooking, and each pro- 
vided a sample of water and toenail clippings. The table below 
showed the data of age (years), sex of person (1 = male, 2 = 
female), proportion of times household well used for drinking 
( = 1/44, 2 = 1/44, 3 = 1/2, 4 = 3/4, 5 = 3/4), proportion of 
times household well used for cooking (1 = 1/4,2 = 1/4,3 = 
1/2, 4 = 3/4, 5 = 3/4), arsenic in water (ppm), and arsenic in 
toenails (ppm) respectively. 
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Drink Cook Arsenic Arsenic 
Age Sex Use Use Water Nails 
44 2 5 5 0.00087 0.119 
45 2 4 5 0.00021 0.118 
44 1 5 5 0 0.099 
66 2 3 5 0.00115 0.118 
37 1 2 5 0 0.277 
45 2 5 5 0 0.358 
47 1 5 5 0.00013 0.08 
38 2 4 5 0.00069 0.158 
41 2, 3 2 0.00039 0.31 
49 2 4 5 0 0.105 
72) 2 5 5 0 0.073 
45 2 1 5 0.046 0.832 
53} 1 5 5 0.0194 0.517 
86 2 5 5 0.137 2.252 
8 2 D D 0.0214 0.851 
32 2 5 5 0.0175 0.269 
44 1 5 5 0.0764 0.433 
63 2 Se) 5 0 0.141 
42 1 5 5 0.0165 0.275 
62 1 5 5 0.00012 0.135 
36 1 5 3S 0.0041 0.175 


(a) Fit a multiple linear regression model using arsenic con- 
centration in nails as the response and age, drink use, cook 
use, and arsenic in the water as the regressors. 

(b) Estimate o” and the standard errors of the regression co- 
efficients. 

(c) Use the model to predict the arsenic in nails when the age 
is 30, the drink use is category 5, the cook use is category 
5, and arsenic in the water is 0.135 ppm. 

12-13. Inanarticle in JEEE Transactions on Instrumentation 

and Measurement (2001, Vol. 50, pp. 2033-2040) powdered 

mixtures of coal and limestone were analyzed for permittivity. 

The errors in the density measurement was the response. 


Density Dielectric Constant Loss Factor 
0.749 2.05 0.016 
0.798 2.15 0.02 
0.849 225 0.022 
0.877 2.3 0.023 
0.929 2.4 0.026 
0.963 2.47 0.028 
0.997 2.54 0.031 
1.046 2.64 0.034 
1.133 2.85 0.039 
1.17 2.94 0.042 
1.215 3.05 0.045 


nt 


S 
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Table 12-6 

TEMP SOAKTIME SOAKPCT DIFFTIME DIFFPCT PITCH 
1650 0.58 1.10 0.25 0.90 0.013 
1650 0.66 1.10 0.33 0.90 0.016 
1650 0.66 1.10 0.33 0.90 0.015 
1650 0.66 1.10 0.33 0.95 0.016 
1600 0.66 1.15 0.33 1.00 0.015 
1600 0.66 1.15 0.33 1.00 0.016 
1650 1.00 1.10 0.50 0.80 0.014 
1650 1.17 1.10 0.58 0.80 0.021 
1650 IL IL7 1.10 0.58 0.80 0.018 
1650 1.17 1.10 0.58 0.80 0.019 
1650 1.17 1.10 0.58 0.90 0.021 
1650 1.17 1.10 0.58 0.90 0.019 
1650 1.17 1.15 0.58 0.90 0.021 
1650 1.20 1.15 1.10 0.80 0.025 
1650 2.00 1.15 1.00 0.80 0.025 
1650 2.00 1.10 1.10 0.80 0.026 
1650 2.20 1.10 1.10 0.80 0.024 
1650 2.20 1.10 1.10 0.80 0.025 
1650 2.20 1.15 1.10 0.80 0.024 
1650 2.20 1.10 1.10 0.90 0.025 
1650 2.20 1.10 1.10 0.90 0.027 
1650 2.20 1.10 1.50 0.90 0.026 
1650 3.00 1.15 1.50 0.80 0.029 
1650 3.00 1.10 1.50 0.70 0.030 
1650 3.00 1.10 1.50 0.75 0.028 
1650 3.00 1.15 1.66 0.85 0.032 
1650 3.33 1.10 1.50 0.80 0.033 
1700 4.00 1.10 1.50 0.70 0.039 
1650 4.00 1.10 1.50 0.70 0.040 
1650 4.00 1.15 1.50 0.85 0.035 
1700 12.50 1.00 1.50 0.70 0.056 
1700 18.50 1.00 1.50 0.70 0.068 


(a) Fit a multiple linear regression model to these data with 
the density as the response. 

(b) Estimate o? and the standard errors of the regression 
coefficients. 

(c) Use the model to predict the density when the dielectric 
constant is 2.5 and the loss factor is 0.03. 

12-14. An article in Biotechnology Progress (2001, Vol. 

17, pp. 366-368) reported on an experiment to investigate 


and optimize nisin extraction in aqueous two-phase systems 
(ATPS). The nisin recovery was the dependent variable (y). 
The two regressor variables were concentration (%) of PEG 
4000 (denoted as x,) and concentration (%) of Na,SO4 (de- 
noted as x). 

(a) Fit a multiple linear regression model to these data. 

(b) Estimate o? and the standard errors of the regression 

coefficients. 


al x2 Y 

13 11 62.8739 
15 11 76.1328 
13 13 87.4667 
15 13 102.3236 
14 12 76.1872 
14 12 77.5287 
14 12 76.7824 
14 12 774381 
14 12 78.7417 


(c) Use the model to predict the nisin recovery when 
x; = 14.5 and x, = 12.5. 

12-15. An article in Optical Engineering [Operating 
Curve Extraction of a Correlator’s Filter” (2004, Vol. 43, pp. 
2775-2779)] reported on use of an optical correlator to per- 
form an experiment by varying brightness and contrast. The 
resulting modulation is characterized by the useful range of 
gray levels. The data are shown below: 


Brightness (%): 54 61 65 100 100 100 50 57 54 
Contrast (%): 56 80 70 50 65 80 25 35 26 
Useful range (ng): 96 50 50 112 96 80 155 144 255 


(a) Fit a multiple linear regression model to these data. 

(b) Estimate o°. 

(c) Compute the standard errors of the regression coefficients. 

(d) Predict the useful range when brightness = 80 and 
contrast = 75. 


12-16. An article in Technometrics (1974, Vol. 16, pp. 
523-531) considered the following stack-loss data from a 
plant oxidizing ammonia to nitric acid. Twenty-one daily re- 
sponses of stack loss y (the amount of ammonia escaping) 
were measured with air flow x,, temperature x, and acid con- 
centration x3. 


Stack loss y = 42, 37, 37, 28, 18, 18, 19, 20, 15, 14, 14, 13, 
11, 12, 8,7, 8, 8, 9, 15, 15 

x, = 80, 80, 75, 62, 62, 62, 62, 62, 58, 58, 58, 58, 58, 58, 50, 
50, 50, 50, 50, 56, 70 

X_ = 27, 27, 25, 24, 22, 23, 24, 24, 23, 18, 18, 17, 18, 19, 18, 
18, 19, 19, 20, 20, 20 

Xx; = 89, 88, 90, 87, 87, 87, 93, 93, 87, 80, 89, 88, 82, 93, 89, 
86, 72, 79, 80, 82, 91 


(a) Fit a linear regression model relating the results of the 
stack loss to the three regressor varilables. 

(b) Estimate o”. 

(c) Find the standard error se(B,)). 

(d) Use the model in part (a) to predict stack loss when x, 
X> = 26, and x3 = 85. 


= 60, 


12-1 MULTIPLE LINEAR REGRESSION MODEL 467 


12-17. Table 12-7 presents quarterback ratings for the 2008 

National Football League season (source: The Sports Network). 

(a) Fit a multiple regression model to relate the quarterback 
rating to the percentage of completions, the percentage of 
TDs, and the percentage of interceptions. 

(b) Estimate o7. 

(c) What are the standard errors of the regression coefficients? 

(d) Use the model to predict the rating when the percentage of 
completions is 60%, the percentage of TDs is 4%, and the 
percentage of interceptions is 3%. 

12-18. Table 12-8 presents statistics for the National Hockey 

League teams from the 2008-2009 season (source: The Sports 

Network). Fit a multiple linear regression model that relates 

Wins to the variables GF through FG. Because teams play 82 

games W = 82 — L — T — OTL, but such a model does not 

help build a better team. Estimate o7 and find the standard 

errors of the regression coefficients for your model. 

12-19. A study was performed on wear of a bearing y and its 

relationship to x, = oil viscosity and x. = load. The following 

data were obtained. 


av x] x2 
293 1.6 851 
230 15.5 816 
172 DARD 1058 

91 43.0 1201 
113 33.0 1357 
125 40.0 1115 


(a) Fit a multiple linear regression model to these data. 

(b) Estimate o” and the standard errors of the regression 
coefficients. 

(c) Use the model to predict wear when x, = 25 and x, = 1000. 

(d) Fit a multiple linear regression model with an interaction 
term to these data. 

(e) Estimate o7 and se(,) for this new model. How did these 
quantities change? Does this tell you anything about the 
value of adding the interaction term to the model? 

(f) Use the model in (d) to predict when x, = 25 and x, = 
1000. Compare this prediction with the predicted value 
from part (c) above. 


12-20. Consider the linear regression model 


Y¥; = Bo + BiG — %1) + Bole 

x;,/n and X) = Sx;2/n. 

(a) Write out the least squares normal equations for this model. 

(b) Verify that the least squares estimate of the intercept in 
this model is Bh = Sjy;/n = j. 

(c) Suppose that we use y; — y as the response variable in the 
model above. What effect will this have on the least 
squares estimate of the intercept? 


Xp) rv &; 


where x; = 
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Table 12-7 Quarterback Ratings for the 2008 National Football League Season 
Pct Yds per Pct Pct Rating 

Player Team Att Comp Comp Yds Att TD TD Lng Int’ Int Pts 
Philip Rivers SD 478 312 65.3 4,009 8.39 34 7.1 67 LI 230535 
Chad Pennington MIA 476 = 321 67.4 3,653 7.67 19 40 80 7 15 97.4 
Kurt Warner ARI 598 401 67.1 4,583 7.66 30 50 79 14. 2.3 96.9 
Drew Brees NO 635 413 65 5,069 7.98 34 S54 84 17. 2.7 96.2 
Peyton Manning IND 555 371 66.8 4,002 WAN 27 49 75 (2 eS) 
Aaron Rodgers GB 536 = 341 63.6 4,038 7.53 28 5.2 71 13. 24 93.8 
Matt Schaub HOU 380 251 66.1 3,043 8.01 15 39 65 10 2.6 Oe 
Tony Romo DAL 450 276 61.3 3,448 7.66 26 5.8 75 14 3.1 91.4 
Jeff Garcia TB 376 =. 244 64.9 eile: 7.21 We 3A Til 6 16 90.2 
Matt Cassel NE 516 = 327 63.4 3,693 7.16 21 4.1 76 11 2.1 89.4 
Matt Ryan ATL 434 265 61.1 3,440 193 16 3.7 70 il 28) 87.7 
Shaun Hill SF 288 181 62.8 2,046 7.10 13 45 48 8 28 87.5 
Seneca Wallace SEA 242 141 58.3 1,532 6.33 11 45 90 3 We 7 
Eli Manning NYG 479 289 60.3 3,238 6.76 21 44 48 10 2.1 86.4 
Donovan McNabb PHI 571 345 60.4 3,916 6.86 23 «64.0 90 11 1.9 86.4 
Jay Cutler DEN 616 384 62.3 4,526 7.39 25. 4.1 93 18 2.9 86 
Trent Edwards BUF 374 245 65.5 2,699 VLD 11 29 65 10. 2.7 85.4 
Jake Delhomme CAR 414 246 59.4 3,288 7.94 15 36 65 12 2.9 84.7 
Jason Campbell WAS 506 315 62.3 3,245 6.41 13 26 67 6 12 84.3 
David Garrard JAC 535 335 62.6 3,620 6.77 15 28 41 13. 24 81.7 
Brett Favre NYJ 522 343 65.7 3,472 6.65 22 42 56 22 42 ~ 81 
Joe Flacco BAL 428 257 60 2,971 6.94 14 33 70 12 2.8 80.3 
Kerry Collins TEN 415 242 58.3 2,676 6.45 12 29 56 ley 80.2 
Ben Roethlisberger PIT 469 281 59.9 3,301 7.04 17 36 865 15: 32 80.1 
Kyle Orton CHI 465 272 58.5 DMD 6.39 18 3.9 65 12 2.6 79.6 
JaMarcus Russell OAK 368 198 53.8 2,423 6.58 13° 3.5 84 8 2.2 771 
Tyler Thigpen KC 420 230 54.8 2,608 6.21 18 43 75 1 DBO TKS) 
Gus Frerotte MIN 301 178 59.1 2,157 TAT 12 40 99 15 5.0 73.7 
Dan Orlovsky DET 255 143 56.1 1,616 6.34 8 3.1 96 8 3.1 72.6 
Marc Bulger STL 440 251 57 2,720 6.18 11 2.5 80 13. 3.0 71.4 
Ryan Fitzpatrick CIN 372 221 59.4 1,905 5.12 8 2S) 9 24 70 
Derek Anderson CLE 283 142 50.2 1,615 5.71 9 3.2 70 8 28 66.5 
Att Attempts (number of pass attempts) 
Comp Completed passes 
Pct Comp Percentage of completed passes 
Yds Yards gained passing 
Yds per Att Yards gained per pass attempt 
TD Number of touchdown passes 
Pct TD Percentage of attempts that are touchdowns 
Long Longest pass completion 
Int Number of interceptions 
Pct Int Percentage of attempts that are interceptions 


Rating Pts 


Rating points 
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Team W L OTL PTS GF GA ADV PPGF PCTG PEN BMI AVG SHT PPGA PKPCT SHGF SHGA FG 
Anaheim 42 33 7 91 238 235 309 73 23.6 1418 8 174 385 78 79.7 6 6 43 
Atlanta 35 41 6 76 250 279 357 69 19.3 1244 12 15.3 366 88 76 = 13 9 39 
Boston 53 19 10 116 270 190 313 74 23.6 1016 12 12.5 306 54 82.4 8 7 47 
Buffalo 41 32 9 91 242 229 358 75 21 1105 16 13.7 336 61 81.8 7 4 44 
Carolina 45 30 7 97 236 221 374 70 18.7 786 16 9.8 301 59 80.4 8 7 39 
Columbus 41 31 10 92 220 223 322 41 12.7 1207 20 15 346 62 82.1 8 9 41 
Calgary 46 30 6 98 251 246 358 61 17 1281 18 15.8 349 58 83.4 6 13 37 
Chicago 46 24 12 104 260 209 363 70 19.3 1129 28 14.1 330 64 80.6 10 5 43 
Colorado 32 45 5 69 190 253 318 50 15.7 1044 18 13 318 64 79.9 4 5 31 
Dallas 36 35 11 83 224 251 351 54 15.4 1134 10 14 327 70 78.6 2 2 38 
Detroit 51 21 10 112 289 240 353 90 25.5 810 14 10 327 71 78.3 6 4 46 
Edmonton 38 35 9 85 228 244 354 60 17. 1227 20 15.2 338 76 77.5 3 8 39 
Florida 41 30 11 93 231 223 308 51 166 884 16 11 311 54 82.6 7 6 39 
Los Angeles 34 37 11 79 202 226 360 69 192 1191 16 14.7 362 62 82.9 4 7 39 
Minnesota 40 33 9 89 214 197 328 66 20.1 869 20 108 291 36 87.6 9 6 39 
Montreal 41 30 11 93 242 240 374 72 19.2 1223 6 15 370 65 82.4 10 10 38 
New Jersey 51 27 4 106 238 207 307 58 18.9 1038 20 12.9 324 65 Mo if 3 44 
Nashville 40 34 8 88 207 228 318 50 15.7 982 12 12.1 338 59 82.5 9 8 41 
NI Islanders 26 47 9 61 198 274 320 54 16.9 1198 18 148 361 73 Ms Wa 5 37) 
NY Rangers 43 30 9 95 200 212 346 48 13.9 1175 24 146 329 40 87.8 9 13 42 
Ottawa 36 35 11 83 213 231 339 66 195 1084 14 134 346 64 81.5 8 5 46 
Philadelphia 44 27 11 99 260 232 316 71 22.5 1408 26 17.5 393 67 83 16 1 43 
Phoenix 36 39 67) 79 «2205 249 344 50 14.5 1074 18 13.3 293 68 76.8 S 4 36 
Pittsburgh 45 28 9 99 258 233 360 62 172 1106 8 13.6 347 60 82.7 7 11 46 
San Jose 53 18 11 117 251 199 360 87 24.2 1037 16 12.8 306 51 83.3 12 10 46 
St. Louis 41 31 10 92 227 227 351 72 205 1226 22 15.2 357 58 83.8 10 8 35 
Tampa Bay 24 40 18 66 207 269 343 61 17.8 1280 26 15.9 405 89 78 4 8 34 
Toronto 34 35 13 81 244 286 330 62 18.8 1113 12 13.7 308 78 74.7 6 7 40 
Vancouver 45 27 10 100 243 213 357 67 18.8 1323 28 16.5 371 69 81.4 7 5 47 
Washington 50 24 8 108 268 240 337 85 25.2 1021 20 12.7 387 75 80.6 7 9 45 
W Wins PEN Total penalty minutes including bench minutes 
L Losses during regular time BMI Total bench minor minutes 
OTL Overtime losses AVG Average penalty minutes per game 
PTS Points. Two points for winning a game, one point for SHT Total times short-handed. Measures opponent 
a tie or losing in overtime, zero points for losing in opportunities. 
regular time. PPGA Power-play goals against 
GF Goals for PKPCT Penalty killing percentage. Measures a team’s 
GA Goals against ability to prevent goals while its opponent is ona 
ADV Total advantages. Power play opportunities. power play. Opponent opportunities minus power 
PPGF  Power-play goals for. Goals scored while on power play goals divided by opponent’s opportunities. 
play. SHGF Short-handed goals for 
PCTG Power play percentage. Power-play goals divided by SHGA Short-handed goals against 
total advantages. FG Games scored first 
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12-2 HYPOTHESIS TESTS IN MULTIPLE LINEAR REGRESSION 


In multiple linear regression problems, certain tests of hypotheses about the model parameters 
are useful in measuring model adequacy. In this section, we describe several important 
hypothesis-testing procedures. As in the simple linear regression case, hypothesis testing 
requires that the error terms e; in the regression model are normally and independently dis- 
tributed with mean zero and variance o”. 


12-2.1 Test for Significance of Regression 


Hypotheses 
for ANOVA Test 


Test Statistic 
for ANOVA 


The test for significance of regression is a test to determine whether a linear relationship exists 
between the response variable y and a subset of the regressor variables x), x2, ..., x, The 
appropriate hypotheses are 


Ho Ri = B2 > — B;, — 0 

H,:B;#0 for at least one j (12-18) 
Rejection of Hp: B} = Bo = -:- = B, = O implies that at least one of the regressor variables 
X, X9,..., X, contributes significantly to the model. 


The test for significance of regression is a generalization of the procedure used in simple 
linear regression. The total sum of squares SS; is partitioned into a sum of squares due to the 
model or to regression and a sum of squares due to error, say, 


SSp = SSpt SSp 


Now if Ho: B; = Bx = ++: = By = 0 is true, SSp/o” is a chi-square random variable with k de- 
grees of freedom. Note that the number of degrees of freedom for this chi-square random vari- 
able is equal to the number of regressor variables in the model. We can also show that the 
SS,,/o7 is a chi-square random variable with n — p degrees of freedom, and that SS, and SSp 
are independent. The test statistic for Hp: 8B} = Bo = --: = B, = 0 is 


SSp/k MSp 
= (12-19) 


F. — 
°” SSi/(n -p) MSz 


We should reject Hy if the computed value of the test statistic in Equation 12-19, fo, is greater than 
Sokn-p The procedure is usually summarized in an analysis of variance table such as Table 12-9. 

A computational formula for SSz may be found easily. Now since SS; = D'_, y7— 
(Si y)°/n = y'y — (27, y)?/n, we may rewrite Equation 12-19 as 


SSe = y'y — 
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Table 12-9 Analysis of Variance for Testing Significance of Regression in Multiple Regression 


Source of Degrees of 
Variation Sum of Squares Freedom Mean Square Fo 
Regression SSp k MSz MS;/MS;, 
Error or residual SS n-p MS; 
Total SSy n-1 
or 
SSr = SSr — SSp 
Therefore, the regression sum of squares is 
n 2 
(3) 
= p’x’y - — (12-21) 


EXAMPLE 12-3 Wire Bond Strength ANOVA 


We will test for significance of regression (with a = 0.05) us- 
ing the wire bond pull strength data from Example 12-1. The 
total sum of squares is 


( SJ G52) 


SSr = y'y n = 27,178.5316 — 95 


= 6105.9447 


The regression or model sum of squares is computed from 
Equation 12-20 as follows: 


n 2 

(3) 
i=1 
n 


= 5990.7712 


(725.82) 


SSp = B'X’y a 


= 27,063.3581 — 


and by subtraction 
SS~ = SS — SSp = y'y — B'X'y = 115.1716 


The analysis of variance is shown in Table 12-10. To test 
Ay: By = Bo = 0, we calculate the statistic 


MSp — 2995.3856 
A= = = 572.17 
MS; ‘5.2352 


Since fy > fo,05,2.22 = 3.44 (or since the P-value is consider- 
ably smaller than a = 0.05), we reject the null hypothesis and 
conclude that pull strength is linearly related to either wire 
length or die height, or both. 

Practical Interpretation: Rejection of Hy does not neces- 
sarily imply that the relationship found is an appropriate 
model for predicting pull strength as a function of wire length 
and die height. Further tests of model adequacy are required 
before we can be comfortable using this model in practice. 


Most multiple regression computer programs provide the test for significance of regression 
in their output display. The middle portion of Table 12-4 is the Minitab output for this example. 
Compare Tables 12-4 and 12-10 and note their equivalence apart from rounding. The P-value is 
rounded to zero in the computer output. 


Table 12-10 Test for Significance of Regression for Example 12-3 


Source of Degrees of 

Variation Sum of Squares Freedom Mean Square ho P-value 
Regression 5990.7712 2 2995 .3856 Sealy 1.08E-19 
Error or residual 115.1735 22, 5.2352 


Total 6105.9447 24 
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Adjusted 
R2 


R’ and Adjusted R? 
We may also use the coefficient of multiple determination R’ as a global statistic to assess 
the fit of the model. Computationally, 


R= =l]- (12-22) 


For the wire bond pull strength data, we find that R? = SSp/SS; = 5990.7712/6105.9447 = 
0.9811. Thus the model accounts for about 98% of the variability in the pull strength response 
(refer to the Minitab output in Table 12-4). The R? statistic is somewhat problematic as a 
measure of the quality of the fit for a multiple regression model because it never decreases 
when a variable is added to a model. 

To illustrate, consider the model fit to the wire bond pull strength data in Example 11-8. 
This was a simple linear regression model with x, = wire length as the regressor. The value of 
R° for this model is R? = 0.9640. Therefore, adding x. = die height to the model increases R? 
by 0.9811 — 0.9640 = 0.0171, a very small amount. Since R* can never decrease when a 
regressor is added, it can be difficult to judge whether the increase is telling us anything useful 
about the new regressor. It is particularly hard to interpret a small increase, such as observed 
in the pull strength data. 

Many regression users prefer to use an adjusted R° statistic: 


Ria = 1 - (12-23) 


Because SS;,/(n — p) is the error or residual mean square and SS7/(n — 1) is a constant, Req 
will only increase when a variable is added to the model if the new variable reduces the error 
mean square. Note that for the multiple regression model for the pull strength data Ria = 
0.979 (see the Minitab output in Table 12-4), whereas in Example 11-8 the adjusted R? for the 
one-variable model is Riaj = 0.962. Therefore, we would conclude that adding x, = die 
height to the model does result in a meaningful reduction in unexplained variability in the 
response. 

The adjusted R* statistic essentially penalizes the analyst for adding terms to the 
model. It is an easy way to guard against overfitting, that is, including regressors that are 
not really useful. Consequently, it is very useful in comparing and evaluating competing 
regression models. We will use Riai for this when we discuss variable selection in regres- 
sion in Section 12-6.3. 


12-2.2 Tests on Individual Regression Coefficients 
and Subsets of Coefficients 


We are frequently interested in testing hypotheses on the individual regression coefficients. 
Such tests would be useful in determining the potential value of each of the regressor variables 
in the regression model. For example, the model might be more effective with the inclusion of 
additional variables or perhaps with the deletion of one or more of the regressors presently in 
the model. 
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The hypothesis to test if an individual regression coefficient, say 8; equals a value Bj is 


Ho: Bj = Bjo 
A: Bj = Bjo (12-24) 


The test statistic for this hypothesis is 


ee nae Bj — Bo 
: Vane, se(B;) 


(12-25) 


where C;; is the diagonal element of (x’x) corresponding to B | Notice that the denominator 
of Equation 12-24 is the standard error of the regression coefficient B je The null hypothesis Ho: 
B; = Bjo is rejected if |fo] > tuy2,n—p- This is called a partial or marginal test because the 
regression coefficient B ; depends on all the other regressor variables x,(i # j) that are in the 
model. More will be said about this in the following example. 

An important special case of the previous hypothesis occurs for Bj) = 0. If Ho: B; = 0 is 
not rejected, this indicates that the regressor x; can be deleted from the model. Adding a vari- 
able to a regression model always causes the sum of squares for regression to increase and the 
error sum of squares to decrease (this is why R* always increases when a variable is added). 
We must decide whether the increase in the regression sum of squares is large enough to jus- 
tify using the additional variable in the model. Furthermore, adding an unimportant variable 
to the model can actually increase the error mean square, indicating that adding such a vari- 
able has actually made the model a poorer fit to the data (this is why Riai is a better measure 
of global model fit then the ordinary R’). 


EXAMPLE 12-4 Wire Bond Strength Coefficient Test 


Consider the wire bond pull strength data, and suppose that we Note that we have used the estimate of o” reported to four dec- 
want to test the hypothesis that the regression coefficient for x, imal places in Table 12-10. Since f 9959. = 2.074, we reject 
(die height) is zero. The hypotheses are H: B, = 0 and conclude that the variable x, (die height) con- 
tributes significantly to the model. We could also have used a 

Ho: By = 0 P-value to draw conclusions. The P-value for fy = 4.477 is 

P = 0.0002, so with a = 0.05 we would reject the null hypothesis. 

Ay: By #0 Practical Interpretation: Note that this test measures the 

marginal or partial contribution of x, given that x, is in the 

The main diagonal element of the (X’X)~! matrix correspon- model. That is, the f-test measures the contribution of adding 
ding to By is Czy = 0.0000015, so the ¢-statistic in Equation the variable x, = die height to a model that already contains x, 
12-25 is = wire length. Table 12-4 shows the value of the t-test com- 


puted by Minitab. The Minitab t-test statistic is reported to two 
7 decimal places. Note that the computer produces a t-test for 
Bo 0.01253 = each regression coefficient in the model. These f-tests indicate 


iy = Ve v/ . 
oCyy (5.2352)(0.0000015) that both regressors contribute to the model. 
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EXAMPLE 12-5 Wire Bond Strength One-Sided Coefficient Test 


There is an interest in the effect of die height on strength. This 
can be evaluated by the magnitude of the coefficient for die 
height. To conclude that the coefficient for die height exceeds 
0.01 the hypotheses become 
Ao: Bo = 0.01 Ay: By > 0.01 

For such a test, computer software can complete much of the 
hard work. We only need to assemble the pieces. From the 
Minitab output in Table 12-4, 8, = 0.012528 and the standard 


error of Bo = 0.002798. Therefore the t-statistic is 


0.012528 — 0.01 
0.002798 


to = 0.9035 

with 22 degrees of freedom (error degrees of freedom). From 
Table IV in Appendix A, {995,99 = 0.686 and fo 1,9) = 1.321. 
Therefore, the P-value can be bounded as 0.1 < P-value 
< 0.25. One cannot conclude that the coefficient exceeds 0.01 
at common levels of significance. 


There is another way to test the contribution of an individual regressor variable to the 
model. This approach determines the increase in the regression sum of squares obtained by 
adding a variable x; (say) to the model, given that other variables x;(i # j) are already included 


in the regression equation. 


The procedure used to do this is called the general regression significance test, or the 
extra sum of squares method. This procedure can also be used to investigate the contribution 
of a subset of the regressor variables to the model. Consider the regression model with k 


regressor variables 


y=Xptre (12-26) 


where y is(n X 1), Xis(n X p), Bis(p X 1), €is(m X 1), and p = k + 1. We would like to de- 


termine if the subset of regressor variables x,, X, . . 


.,X,(r < k) as a whole contributes signifi- 


cantly to the regression model. Let the vector of regression coefficients be partitioned as follows: 


B= re (12-27) 
Bo 
where B, is (r X 1) and B, is [(p — r) X 1]. We wish to test the hypotheses 
Hypotheses 
for General Hp: B; = 0 
Regression Test 
A,: B, #0 (12-28) 
where 0 denotes a vector of zeroes. The model may be written as 
y = XB + « = XP, + XP. + € (12-29) 


where X, represents the columns of X associated with B, and X, represents the columns of X 


associated with B,. 


For the full model (including both B, and B,), we know that ® = (X’X)~! X’y. In 
addition, the regression sum of squares for all variables including the intercept is 


SSR(B) = B’X'y 


and 


(p =k + 1 degrees of freedom) 
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SS,(B) is called the regression sum of squares due to B. To find the contribution of the terms 
in B, to the regression, fit the model assuming the null hypothesis Hp: B,; = 0 to be true. The 
reduced model is found from Equation 12-29 as 

y= XB, +e (12-30) 


The least squares estimate of B, is B) = (X5X>) 'X4y, and 


SSp(B2) = BSX5y — (p — r degrees of freedom) (12-31) 
The regression sum of squares due to B, given that B, is already in the model is 


SSp(Bi|B2) = SSr(B) — SSr(B2) (12-32) 


This sum of squares has r degrees of freedom. It is sometimes called the extra sum of squares 
due to B,. Note that SSp(B,|B2) is the increase in the regression sum of squares due to 
including the variables x,, x,, ..., x, in the model. Now SS;p(B,|B2) is independent of MS, and 
the null hypothesis B, = 0 may be tested by the statistic. 


F Statistic 
for General SSe(B1|B2)/r 
Regression Test Fo= wo (12-33) 
E 


If the computed value of the test statistic fo > fan—p, We reject Hp, concluding that at least one 
of the parameters in 8, is not zero and, consequently, at least one of the variables x,, x, ..., x, 
in X, contributes significantly to the regression model. Some authors call the test in Equation 
12-33 a partial F-test. 

The partial F-test is very useful. We can use it to measure the contribution of each indi- 
vidual regressor x; as if it were the last variable added to the model by computing 


SSr(B;|Bos By,..., Bj-1, Bytes By) i= 1, Qe csiaceng Ke 


This is the increase in the regression sum of squares due to adding x; to a model that already 
includes x), ... , Xj-1, Xj+1, +--+, X, The partial F-test is a more general procedure in that we 
can measure the effect of sets of variables. In Section 12-6.3 we show how the partial F-test 
plays a major role in model building—that is, in searching for the best set of regressor vari- 
ables to use in the model. 


EXAMPLE 12-6 Wire Bond Strength General Regression Test 


Consider the wire bond pull-strength data in Example 12-1. We To test this hypothesis, we need the extra sum of squares due 
will investigate the contribution of two new variables, x; and x4, to B3 and BR, or 

to the model using the partial F-test approach. The new variables 

are explained at the end of this example. That is, we wish to test SSx(B4, Bs|B2- Bi. Bo) = SSa(B4. Bs. Ba» Bi Bo) — SSe(Bo, Bi, Bo) 


= SSr(Ba, Bs, B2, Bi]Bo) — SSr(B2, BilBo) 
Ay: B3 = By = 0 A,:B3 #0 or By #0 
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In Example 12-3 we calculated 


(3) 


SSr (Bo, Bi|Bo) = B'X'y = 


= 5990.7712 (two 


degrees of freedom) 


Also, Table 12-4 shows the Minitab output for the model with 
only x, and x, as predictors. In the analysis of variance table, 
we can see that SSp = 5990.8 and this agrees with our calcu- 
lation. In practice, the computer output would be used to ob- 
tain this sum of squares. 

If we fit the model Y = Bo + Bix, + Box. + B3x3 4 
4x4, we can use the same matrix formula. Alternatively, we can 
look at SS, from computer output for this model. The analysis 
of variance table for this model is shown in Table 12-11 and we 
see that 


SSp(B4, B3, Bz, B1|Bo) = 6024.0 (four degrees of freedom) 


Therefore, 


SSp(Ba, B3|B2, Bi, Bo) = 6024.0 — 5990.8 = 33.2 (two 


degrees of freedom) 


This is the increase in the regression sum of squares due to 
adding x3 and x, to a model already containing x, and x>. To 
test Ho, calculate the test statistic 


_ SSr(Ba, B3|B2, Bi, Bo)/2 — 33.2/2 _ 
MS; 41 


0 4.05 


Note that MS; from the full model using x), x2, x3 and x4 
is used in the denominator of the test statistic. Because 
Jo.0s, 2,20 = 3.49, we reject Hy and conclude that at least one 
of the new variables contributes significantly to the model. 
Further analysis and tests will be needed to refine the model 
and determine if one or both of x3 and x4 are important. 

The mystery of the new variables can now be explained. 
These are quadratic powers of the original predictors wire 
length and wire height. That is, x, = xt and x, = x3. A test for 
quadratic terms is a common use of partial F-tests. With this 
information and the original data for x, and x, you can use 
computer software to reproduce these calculations. Multiple 
regression allows models to be extended in such a simple man- 
ner that the real meaning of x; and x, did not even enter into 
the test procedure. Polynomial models such as this are dis- 
cussed further in Section 12-6. 


If a partial F-test is applied to a single variable, it is equivalent to a t-test. To see this, con- 
sider the Minitab regression output for the wire bond pull strength in Table 12-4. Just below 
the analysis of variance summary in this table, the quantity labeled ” ‘SeqSS”’ shows the sum 


Table 12-11 Regression Analysis: y versus x1, x2, x3, x4 


The regression equation is y = 5.00 + 1.90 x1 + 0.0151 x2 + 0.0460 x3 — 0.000008 x4 


Predictor Coef SE Coef T P 
Constant 4.996 1.655 3.02 0.007 
xl 1.9049 0.3126 6.09 0.000 
x2 0.01513 0.01051 1.44 0.165 
x3 0.04595 0.01666 2.76 0.012 
x4 —0.00000766 0.00001641 —0.47 0.646 
S = 2.02474 R-Sq = 98.7% R-—Sq (adj) = 98.4% 


Analysis of Variance 


Source DF 
Regression 4 
Residual Error 20 
Total 24 
Source DF Seq SS 
xl 1 5885.9 
x2 1 104.9 
x3 1 32.3 
x4 1 0.9 


SS MS F P 
6024.0 1506.0 367.35 0.000 
82.0 4.1 
6105.9 
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of squares obtained by fitting x, alone (5885.9) and the sum of squares obtained by fitting x, 
after x, (104.9). In out notation, these are referred to as SSp(B;|Bo) and SSp(B2, B;|Bo), respec- 
tively. Therefore, to test Hy: By = 0, H,: B. # 0 the partial F-test is 


_ SSp(B2|B1,Bo)/1 _ 104.92 _ 
eo MS; 5.24 


20.2 


where MS, is the mean square for residual in the computer output in Table 12.4. This statistic 
should be compared to an F-distribution with 1 and 22 degrees of freedom in the numerator 
and denominator, respectively. From Table 12-4, the t-test for the same hypothesis is to = 4.48. 
Note that 1} = 4.48? = 20.07 = fo, except for round-off error. Furthermore, the square of a 
t-random variable with v degrees of freedom is an F-random variable with one and v degrees of 
freedom. Consequently, the t-test provides an equivalent method to test a single variable for 
contribution to a model. Because the f-test is typically provided by computer output, it is the 


preferred method to test a single variable. 


EXERCISES FOR SECTION 12-2 


12-21. Consider the computer output below. 


The regression equation is 
Y = 254 + 2.77 xl — 3.58 x2 


Predictor Coef SE Coef T P 

Constant 253.810 4.781 ? ? 

xl 2.7738 0.1846 15.02 ? 

x2 —3.5753 0.1526 ? ? 

S = 5.05756 R-Sq =? R-Sq (adj) = 98.4% 

Analysis of Variance 

Source DF SS MS P 

Regression 2 22784 11392 ? ? 

Residual Error ? ? ? 

Total 14 23091 

(a) Fill in the missing quantities. You may use bounds for the 
P-values. 

(b) What conclusions can you draw about the significance of 
regression? 


(c) What conclusions can you draw about the contributions of 
the individual regressors to the model? 


12-22. You have fit a regression model with two regressors 
to a data set that has 20 observations. The total sum of squares 
is 1000 and the model sum of squares is 750. 

(a) What is the value of R? for this model? 

(b) What is the adjusted R? for this model? 

(c) What is the value of the F-statistic for testing the signifi- 
cance of regression? What conclusions would you draw 
about this model if a = 0.05? What if a = 0.01? 

(d) Suppose that you add a third regressor to the model and as 
a result the model sum of squares is now 785. Does it seem 
to you that adding this factor has improved the model? 


12-23. Consider the regression model fit to the soil shear 
strength data in Exercise 12-1. 


(a) Test for significance of regression using a = 0.05. What 
is the P-value for this test? 

(b) Construct the ¢-test on each regression coefficient. What 
are your conclusions, using a = 0.05? Calculate P-values. 


12-24. Consider the absorption index data in Exercise 12-2. 

The total sum of squares for y is SS; = 742.00. 

(a) Test for significance of regression using a = 0.01. What 
is the P-value for this test? 

(b) Test the hypothesis Hy: B, = 0 versus H;: B, # O using 
a = 0.01. What is the P-value for this test? 

(c) What conclusion can you draw about the usefulness of x, 
as a regressor in this model? 


12-25. Aregression model Y = By + Bix, + Box + B3x5 4 

e has been fit to a sample of n = 25 observations. The calcu- 

lated t-ratios B;/se(B,). J = 1, 2,3 are as follows: for B), f = 

4.82, for Bs, f) = 8.21 and for B3, fp = 0.98. 

(a) Find P-values for each of the t-statistics. 

(b) Using a = 0.05, what conclusions can you draw about 
the regressor x;? Does it seem likely that this regressor 
contributes significantly to the model? 


12-26. Consider the electric power consumption data in 

Exercise 12-6. 

(a) Test for significance of regression using a = 0.05. What 
is the P-value for this test? 

(b) Use the t-test to assess the contribution of each regressor 
to the model. Using « = 0.05, what conclusions can you 
draw? 


12-27. Consider the gasoline mileage data in Exercise 12-7. 

(a) Test for significance of regression using a = 0.05. What 
conclusions can you draw? 

(b) Find the f-test statistic for each regressor. Using a = 0.05, 
what conclusions can you draw? Does each regressor con- 
tribute to the model? 
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12-28. Consider the wire bond pull strength data in 

Exercise 12-8. 

(a) Test for significance of regression using a = 0.05. Find 
the P-value for this test. What conclusions can you 
draw? 

(b) Calculate the t-test statistic for each regression coeffi- 
cient. Using a = 0.05, what conclusions can you draw? 
Do all variables contribute to the model? 


12-29. Reconsider the semiconductor data in Exercise 12-9. 

(a) Test for significance of regression using a = 0.05. What 
conclusions can you draw? 

(b) Calculate the t-test statistic and P-value for each regres- 
sion coefficient. Using a = 0.05, what conclusions can 
you draw? 

12-30. Consider the regression model fit to the arsenic data 

in Exercise 12-12. Use arsenic in nails as the response and age, 

drink use, and cook use as the regressors. 

(a) Test for significance of regression using a = 0.05. What 
is the P-value for this test? 

(b) Construct a t-test on each regression coefficient. What 
conclusions can you draw about the variables in this 
model? Use a = 0.05. 


12-31. Consider the regression model fit to the X-ray in- 

spection data in Exercise 12-11. Use rads as the response. 

(a) Test for significance of regression using a = 0.05. What 
is the P-value for this test? 

(b) Construct a t-test on each regression coefficient. What 
conclusions can you draw about the variables in this 
model? Use a = 0.05. 


12-32. Consider the regression model fit to the nisin extrac- 

tion data in Exercise 12-14. Use nisin extraction as the response. 

(a) Test for significance of regression using a = 0.05. What 
is the P-value for this test? 

(b) Construct a t-test on each regression coefficient. What 
conclusions can you draw about the variables in this 
model? Use a = 0.05. 

(c) Comment on the effect of a small sample size to the tests 
in the previous parts. 


12-33. Consider the regression model fit to the grey range 

modulation data in Exercise 12-15. Use the useful range as the 

response. 

(a) Test for significance of regression using a = 0.05. What 
is the P-value for this test? 

(b) Construct a t-test on each regression coefficient. What 
conclusions can you draw about the variables in this 
model? Use a = 0.05. 


12-34. Consider the regression model fit to the stack loss 

data in Exercise 12-16. Use stack loss as the response. 

(a) Test for significance of regression using a = 0.05. What 
is the P-value for this test? 

(b) Construct a t-test on each regression coefficient. What 
conclusions can you draw about the variables in this 
model? Use a = 0.05. 


12-35. Consider the NFL data in Exercise 12-17. 

(a) Test for significance of regression using a = 0.05. What 
is the P-value for this test? 

(b) Conduct the t-test for each regression coefficient. Using 
a = 0.05, what conclusions can you draw about the vari- 
ables in this model? 

(c) Find the amount by which the regressor x, (TD percent- 
age) increases the regression sum of squares, and conduct 
an F-test for Hp: B. =O versus H;: B. #0 using 
a = 0.05. What is the P-value for this test? What conclu- 
sions can you draw? 

12-36. Exercise 12-10 presents data on heat treating gears. 

(a) Test the regression model for significance of regression. 
Using a = 0.05, find the P-value for the test and draw 
conclusions. 

(b) Evaluate the contribution of each regressor to the model 
using the ¢-test with a = 0.05. 

(c) Fit a new model to the response PITCH using new 
regressors x, = SOAKTIME X SOAKPCT and x, = 
DIFFTIME X DIFFPCT. 

(d) Test the model in part (c) for significance of regression 
using a = 0.05. Also calculate the f-test for each regres- 
sor and draw conclusions. 

(e) Estimate o7 for the model from part (c) and compare this 
to the estimate of o? for the model in part (a). Which 
estimate is smaller? Does this offer any insight regarding 
which model might be preferable? 

12-37. Consider the bearing wear data in Exercise 12-19. 

(a) For the model with no interaction, test for significance of 

regression using a = 0.05. What is the P-value for this 

test? What are your conclusions? 

For the model with no interaction, compute the f-statistics 

for each regression coefficient. Using a = 0.05, what con- 

clusions can you draw? 

(c) For the model with no interaction, use the extra sum of 

squares method to investigate the usefulness of adding 

X2 = load to a model that already contains x, = oil vis- 

cosity. Use a = 0.05. 

Refit the model with an interaction term. Test for signifi- 

cance of regression using a = 0.05. 

(e) Use the extra sum of squares method to determine whether 
the interaction term contributes significantly to the model. 
Use a = 0.05. 

(f) Estimate o° for the interaction model. Compare this to the 
estimate of o? from the model in part (a). 

12-38. Data on National Hockey League team performance 

was presented in Exercise 12-18. 

(a) Test the model from this exercise for significance of 
regression using a = 0.05. What conclusions can you draw? 

(b) Use the t-test to evaluate the contribution of each 
regressor to the model. Does it seem that all regressors are 
necessary? Use a = 0.05. 

(c) Fit aregression model relating the number of games won to 
the number of goals for and the number of power play goals 


(b 


YS 


(d 


YS 


s 


12-3 CONFIDENCE INTERVALS IN MULTIPLE LINEAR REGRESSION 


for. Does this seem to be a logical choice of regressors, con- 
sidering your answer to part (b)? Test this new model for 
significance of regression and evaluate the contribution of 
each regressor to the model using the t-test. Use a = 0.05. 


12-39. Data from a hospital patient satisfaction survey were 

presented in Exercise 12-5. 

(a) Test the model from this exercise for significance of regression. 
What conclusions can you draw if « = 0.05? What if «a = 0.01? 

(b) Test the contribution of the individual regressors using the 
t-test. Does it seem that all regressors used in the model are 
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(a) Fit aregression model using only the patient age and severity 
regressors. Test the model from this exercise for significance 
of regression. What conclusions can you draw if a = 0.05? 
What if a = 0.01? 
(b) Test the contribution of the individual regressors using the 
t-test. Does it seem that all regressors used in the model are 
really necessary? 
(c) Find an estimate of the error variance o”. Compare this esti- 
mate of o” with the estimate obtained from the model con- 
taining the third regressor, anxiety. Which estimate is 


really necessary? 


smaller? Does this tell you anything about which model 
might be preferred? 


12-40. Data from a hospital patient satisfaction survey were 
presented in Exercise 12-5. 


12-3. CONFIDENCE INTERVALS IN MULTIPLE LINEAR REGRESSION 


12-3.1 Confidence Intervals on Individual Regression Coefficients 


In multiple regression models, it is often useful to construct confidence interval estimates for 
the regression coefficients {B;}. The development of a procedure for obtaining these confi- 
dence intervals requires that the errors {e;} are normally and independently distributed with 
mean zero and variance o’. This is the same assumption required in hypothesis testing. 
Therefore, the ObSeEvanOn {Y;} are normally and independently distributed with mean By + 
> j=1 Bx; and variance o”. Since the least squares estimator B i is a linear combination of the 
observations, it a that p is normally distributed with mean vector B and covariance 
matrix o?(X’X)~!. Then each of the statistics 


Bee) 
Vere, 


j=0,1,...,k (12-34) 


has af distribution with n — p degrees of freedom, where C;; is the jjth element of the (X’X) ~ : 
matrix, and 6? is the estimate of the error variance, obtained from Equation 12-16. This 


leads to the following 100(1 — «)% confidence interval for the regression coefficient 


Bed = 0; Lvtay he 


Confidence 
Interval on a A 1001 — a) % confidence interval on the regression coefficient Bag = Ollie. 
Regression k in the multiple linear regression model is given by 
Coefficient 
Bj — taan—pVOP°Cy S By SB; + tayan—pV OC; (12-35) 


Because V67?C ‘; is the standard error of the regression coefficient B ), we would also write the 


CI formula as B; = Valais se(B;) = B; = B; Tea se(B)). 
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EXAMPLE 12-7 Wire Bond Strength Confidence Interval 


We will construct a 95% confidence interval on the parameter B, 
in the wire bond pull strength problem. The point estimate of B, 
is 8, = 2.74427 and the diagonal element of (X’X)~! corre- 
sponding to B, is C,, = 0.001671. The estimate of o° is 
6? = 5.2352, and 10.025,22 = 2.074. Therefore, the 95% CI on B, 
is computed from Equation 12-35 as 


2.74427 — (2.074) V(5.2352)(.001671) = B, = 2.74427 
+ (2.074) V(5.2352)(.001671) 


which reduces to 


2.55029 = B, = 2.93825 


Also, computer software such as Minitab can be used to help 
calculate this confidence interval. From the regression output 
in Table 10-4, 8, = 2.74427 and the standard error of 
8, = 0.0935. This standard error is the multiplier of the 
t-table constant in the confidence interval. That is, 0.0935 = 
V(5.2352)(0.001671). Consequently, all the numbers are 
available from the computer output to construct the interval 
and this is the typical method used in practice. 


12-3.2 Confidence Interval on the Mean Response 


We may also obtain a confidence interval on the mean response at a particular point, say, 


Xo1> Xo25 +++» Xox- To estimate the mean response at this point, define the vector 
1 
X01 
Xo ~ | X02 
X0k 


The mean response at this point is E(Y|Xo) = by|x, = Xo, which is estimated by 


fiy|x, = X08 (12-36) 


This estimator is unbiased, since E(xpB) = xB = E(Y|xo) = b-y|x, and the variance of fiy| x, is 


V(By|x,) 


= 0°x)(X'X) xo (12-37) 


A 100(1 — a) % Cl on py)x, can be constructed from the statistic 


Py|x) — PY|x5 


(12-38) 


V 6x) (X'X) 1 xo 


Confidence 
Interval on the 
Mean Response 


For the multiple linear regression model, a 100(1 — «)% confidence interval on the 
mean response at the point X), Xo, - - 


a 4 /a2o ’ =il 
Py|x) — ba/2.n—p oO Xo (xX X) Xo 
A q /A2G! , = 
= LY|x, = LY|x, + ley/2.n—p o xo(X X) "xo 


+> Xox is 


(12-39) 
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Equation 12-39 is a CI about the regression plane (or hyperplane). It is the multiple regression 
generalization of Equation 11-32. 


EXAMPLE 12-8 Wire Bond Strength Confidence Interval on the Mean Response 

The engineer in Example 12-1 would like to construct a 95% Therefore, a 95% CI on the mean pull strength at this point is 
CI on the mean pull strength for a wire bond with wire length found from Equation 12-39 as 

x, = 8 and die height x. = 275. Therefore, 


27.66 — 2.074-V0.23244 < py; < 27.66 


x, =| 8 + 2.074 V0.23244 
275 : 
which reduces to 
The estimated mean response at this point is found from 
Equation 12-36 as 26.66 = pty |x, = 28.66 
2.26379 


Some computer software packages will provide estimates of 
the mean for a point of interest x) and the associated CI. Table 
12-4 shows the Minitab output for Example 12-8. Both the es- 
timate of the mean and the 95% CI are provided. 


fiyp, = x08 =[1 8 275] | 2.74427 | = 27.66 
0.01253 


The variance of fiy)x, is estimated by 


6°x(X'X)!xp = 5.2352 [1 8 275] 


214653 —.007491 —.000340 1 
x | —.007491 .001671 —.000019 8 
—.000340 —.000019 0000015} 275 


= 5.2352 (0.0444) = 0.23244 


12-4 PREDICTION OF NEW OBSERVATIONS 


A regression model can be used to predict new or future observations on the response 
variable Y corresponding to particular values of the independent variables, say, x,, Xo2,--- 5 Xox- 
If xp = [1, X01, Xo2, --- > Xox], a point estimate of the future observation Yo at the point x9), 
X02, +++» Xpx 1S 


30 = xoB (12-40) 


Prediction 
Interval A 100(1 — a)% prediction interval for this future observation is 


So — tayan—p WO*(1 + x4 (X’X)~!xo) 
= Yo = So + tayrp-p VO(1 + x4(X'X)~ 1X9) (12-41) 


This prediction interval is a generalization of the prediction interval given in Equation 11-33 
for a future observation in simple linear regression. If you compare the prediction interval 
Equation 12-41 with the expression for the confidence interval on the mean, Equation 12-39, 
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Figure 12-5 An example of extrapolation in 
multiple regression. 


you will observe that the prediction interval is always wider than the confidence interval. The 
confidence interval expresses the error in estimating the mean of a distribution, while the pre- 
diction interval expresses the error in predicting a future observation from the distribution at 
the point X9. This must include the error in estimating the mean at that point, as well as the in- 
herent variability in the random variable Y at the same value x = Xo. 

Also, one might want to predict the mean of several values of Y, say m, all at the same 
value x = Xo. Because the variance of a sample mean is o”/m, Equation 12-41 is modified as 
follows. Replace the constant 1 under the square root with 1/m to reflect the lower variability 
in the mean of m observations. This results in a narrower interval. 

In predicting new observations and in estimating the mean response at a given point 
Xo1, Xo2 «+ + » Xox, We must be careful about extrapolating beyond the region containing the 
original observations. It is very possible that a model that fits well in the region of the origi- 
nal data will no longer fit well outside of that region. In multiple regression it is often easy 
to inadvertently extrapolate, since the levels of the variables (xj), Xj2,...,Xi),i = 1,2, ...,, 
jointly define the region containing the data. As an example, consider Fig. 12-5, which illus- 
trates the region containing the observations for a two-variable regression model. Note that 
the point (Xo), Xo) lies within the ranges of both regressor variables x, and x, but it is outside the 
region that is actually spanned by the original observations. This is sometimes called a hidden 
extrapolation. Either predicting the value of a new observation or estimating the mean re- 
sponse at this point is an extrapolation of the original regression model. 


EXAMPLE 12-9 Wire Bond Strength Confidence Interval 


Suppose that the engineer in Example 12-1 wishes to con- and the 95% prediction interval is 
struct a 95% prediction interval on the wire bond pull strength 
when the wire length is x, = 8 and the die height is x, = 275. 22.81 = Y) = 32.51 


Note that xp = [1 8 275], and the point estimate of the 


pull strength is $9 = X9B = 27.66. Also, in Example 12-8 . pues a ; 
we calculated x(X’X)~ lx, = 0.04444. Therefore. from Notice that the prediction interval is wider than the confidence 
; interval on the mean response at the same point, calculated in 


Example 12-8. The Minitab output in Table 12-4 also displays 
27.66 — 2.074V5.2352(1 + 0.0444) = Yo = 27.66 this prediction interval. 


+ 2.074 V'5.2352(1 + 0.0444) 


Equation 12-41 we have 


EXERCISES FOR SECTIONS 12-3 AND 12-4 
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12-41. Consider the regression model fit to the shear 

strength of soil in Exercise 12-1. 

(a) Calculate 95% confidence intervals on each regression 
coefficient. 

(b) Calculate a 95% confidence interval on mean strength 
when x, = 18 feet and x, = 43%. 

(c) Calculate 95% prediction interval on strength for the same 
values of the regressors used in the previous part. 


12-42. Consider the soil absorption data in Exercise 12-2. 

(a) Find 95% confidence intervals on the regression coeffi- 
cients. 

(b) Find a 95% confidence interval on mean soil absorption 
index when x, = 200 and x, = 50. 

(c) Find a 95% prediction interval on the soil absorption in- 
dex when x, = 200 and x, = 50. 


12-43. Consider the semiconductor data in Exercise 12-9. 

(a) Find 99% confidence intervals on the regression coefficients. 

(b) Find a 99% prediction interval on HFE when x, = 14.5, 
xX, = 220, and x; = 5.0. 

(c) Find a 99% confidence interval on mean HFE when x; = 
14.5, x. = 220, and x; = 5.0. 


12-44. Consider the electric power consumption data in 

Exercise 12-6. 

(a) Find 95% confidence intervals on B,, B2, B3, and By. 

(b) Find a 95% confidence interval on the mean of Y when 
X, = 75, X) = 24, x3 = 90, and x4 = 98. 

(c) Find a 95% prediction interval on the power consumption 
when x; = 75, x) = 24, x3 = 90, and x, = 98. 

12-45. Consider the bearing wear data in Exercise 12-19. 

(a) Find 99% confidence intervals on B, and B5. 

(b) Recompute the confidence intervals in part (a) after the in- 
teraction term x,x, is added to the model. Compare the 
lengths of these confidence intervals with those computed 
in part (a). Do the lengths of these intervals provide any 
information about the contribution of the interaction term 
in the model? 


12-46. Consider the wire bond pull strength data in Exercise 

12-8. 

(a) Find 95% confidence interval on the regression coefficients. 

(b) Find a 95% confidence interval on mean pull strength 
when x, = 20, x; = 30, x, = 90, and x; = 2.0. 

(c) Find a 95% prediction interval on pull strength when x, = 
20, x3 = 30, x4 = 90, and x; = 2.0. 


12-47. Consider the regression model fit to the X-ray in- 

spection data in Exercise 12-11. Use rads as the response. 

(a) Calculate 95% confidence intervals on each regression 
coefficient. 

(b) Calculate a 99% confidence interval on mean rads at 15 
milliamps and 1 second on exposure time. 

(c) Calculate a 99% prediction interval on rads for the same 
values of the regressors used in the previous part. 


12-48. Consider the regression model fit to the arsenic data 
in Exercise 12-12. Use arsenic in nails as the response and age, 
drink use, and cook use as the regressors. 

(a) Calculate 99% confidence intervals on each regression 
coefficient. 

(b) Calculate a 99% confidence interval on mean arsenic con- 
centration in nails when age = 30, drink use = 4, and 
cook use = 4. 

(c) Calculate a prediction interval on arsenic concentration 
in nails for the same values of the regressors used in the 
previous part. 

12-49. Consider the regression model fit to the coal and 

limestone mixture data in Exercise 12-13. Use density as the 

response. 

(a) Calculate 90% confidence intervals on each regression 
coefficient. 

(b) Calculate a 90% confidence interval on mean density 
when the dielectric constant = 2.3 and the loss factor = 
0.025. 

(c) Calculate a prediction interval on density for the same 
values of the regressors used in the previous part. 

12-50. Consider the regression model fit to the nisin extrac- 

tion data in Exercise 12-14. 

(a) Calculate 95% confidence intervals on each regression 
coefficient. 

(b) Calculate a 95% confidence interval on mean nisin 
extraction when x; = 15.5 and x, = 16. 

(c) Calculate a prediction interval on nisin extraction for the 
same values of the regressors used in the previous part. 

(d) Comment on the effect of a small sample size to the 
widths of these intervals. 

12-51. Consider the regression model fit to the grey range 

modulation data in Exercise 12-15. Use the useful range as the 

response. 

(a) Calculate 99% confidence intervals on each regression 
coefficient. 

(b) Calculate a 99% confidence interval on mean useful range 
when brightness = 70 and contrast = 80. 

(c) Calculate a prediction interval on useful range for the 
same values of the regressors used in the previous part. 

(d) Calculate a 99% confidence interval and a 99% a prediction 
interval on useful range when brightness = 50 and 
contrast = 25. Compare the widths of these intervals to 
those calculated in parts (b) and (c). Explain any differ- 
ences in widths. 

12-52. Consider the stack loss data in Exercise 12-16. 

(a) Calculate 95% confidence intervals on each regression 
coefficient. 

(b) Calculate a 95% confidence interval on mean stack loss 
when x, = 80, x. = 25 and x3; = 90. 

(c) Calculate a prediction interval on stack loss for the same 
values of the regressors used in the previous part. 


i) 


= 
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(d) Calculate a 95% confidence interval and a 95% prediction 
interval on stack loss when x; = 80, x» = 19, and x3 = 93. 
Compare the widths of these intervals to those calculated 
in parts (b) and (c). Explain any differences in widths. 

12-53. Consider the NFL data in Exercise 12-17. 

(a) Find 95% confidence intervals on the regression coefficients. 

(b) What is the estimated standard error of fiy|x, when the 
percentage of completions is 60%, the percentage of TDs 
is 4%, and the percentage of interceptions is 3%. 

(c) Find a 95% confidence interval on the mean rating when 
the percentage of completions is 60%, the percentage of 
TDs is 4%, and the percentage of interceptions is 3%. 

12-54. Consider the heat treating data from Exercise 12-10. 

(a) Find 95% confidence intervals on the regression coeffi- 
cients. 

(b) Find a 95% confidence interval on mean PITCH when 
TEMP = 1650, SOAKTIME = 1.00, SOAKPCT = 
1.10, DIFFTIME = 1.00, and DIFFPCT = 0.80. 

(c) Fit a model to PITCH using regressors x; = SOAK- 
TIME X SOAKPCT and x, = DIFFTIME X DIFFPCT. 
Using the model with regressors x, and x, find a 95% 
confidence interval on mean PITCH when SOAK- 
TIME = 1.00, SOAKPCT = 1.10, DIFFTIME = 1.00, 
and DIFFPCT = 0.80. 

(d) Compare the length of this confidence interval with the 
length of the confidence interval on mean PITCH at 
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12-5.1 Residual Analysis 


the same point from part (b), where an additive model in 
SOAKTIME, SOAKPCT, DIFFTIME, and DIFFPCT was 
used. Which confidence interval is shorter? Does this tell 
you anything about which model is preferable? 


12-55. Consider the gasoline mileage data in Exercise 12-7. 

(a) Find 99% confidence intervals on the regression coeffi- 
cients. 

(b) Find a 99% confidence interval on the mean of Y for the 
regressor values in the first row of data. 

(c) Fit a new regression model to these data using cid, etw, 
and axle as the regressors. Find 99% confidence intervals 
on the regression coefficients in this new model. 

(d) Compare the lengths of the confidence intervals from part 
(c) with those found in part (a). Which intervals are 
longer? Does this offer any insight about which model is 
preferable? 


12-56. Consider the NHL data in Exercise 12-18. 

(a) Find a 95% confidence interval on the regression coeffi- 
cient for the variable GE 

(b) Fit a simple linear regression model relating the response 
variable W to the regressor GF. 

(c) Find a 95% confidence interval on the slope for the simple 
linear regression model from part (b). 

(d) Compare the lengths of the two confidence intervals com- 
puted in parts (a) and (c). Which interval is shorter? Does 
this tell you anything about which model is preferable? 


The residuals from the multiple regression model, defined by e; = y; — J,, play an important 
role in judging model adequacy just as they do in simple linear regression. As noted in 
Section 11-7.1, several residual plots are often useful; these are illustrated in Example 12-10. 
It is also helpful to plot the residuals against variables not presently in the model that are 
possible candidates for inclusion. Patterns in these plots may indicate that the model may be 
improved by adding the candidate variable. 


EXAMPLE 12-10 Wire Bond Strength Residuals 


The residuals for the model from Example 12-1 are shown in 
Table 12-3. A normal probability plot of these residuals is 
shown in Fig. 12-6. No severe deviations from normality are 


The standardized residuals 


obviously apparent, although the two largest residuals (e,; = 
5.84 and e,, = 4.33) do not fall extremely close to a straight 
line drawn through the remaining residuals. 


Standardized 
Residual 


(12-42) 
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Figure 12-6 Normal probability plot of residuals. Figure 12-7 Plot of residuals against J. 


are often more useful than the ordinary residuals when assessing residual magnitude. For the 
wire bond strength example, the standardized residuals corresponding to e;; and e,7 are 
dis = 5.84/V5.2352 = 2.55 and dj)7 = 4.33/V5.2352 = 1.89, and they do not seem unusually 
large. Inspection of the data does not reveal any error in collecting observations 15 and 17, nor 
does it produce any other reason to discard or modify these two points. 

The residuals are plotted against } in Fig. 12-7, and against x, and x, in Figs. 12-8 and 12-9, 
respectively.* The two largest residuals, e,; and e,7, are apparent. Figure 12-8 gives some indica- 
tion that the model underpredicts the pull strength for assemblies with short wire length (x, =< 6) 
and long wire length (x, = 15) and overpredicts the strength for assemblies with intermediate wire 
length (7 = x, <= 14). The same impression is obtained from Fig. 12-7. Either the relationship be- 
tween strength and wire length is not linear (requiring that a term involving x7, say, be added to the 
model), or other regressor variables not presently in the model affected the response. 

In the wire bond strength example we used the standardized residuals d; = e;/ Ve asa 
measure of residual magnitude. Some analysts prefer to plot standardized residuals instead of 
ordinary residuals, because the standardized residuals are scaled so that their standard 
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Figure 12-8 Plot of residuals against x). Figure 12-9 Plot of residuals against x). 


“There are other methods, described in Montgomery, Peck, and Vining (2006) and Myers (1990), that plot a modified 
version of the residual, called a partial residual, against each regressor. These partial residual plots are useful in 
displaying the relationship between the response y and each individual regressor. 
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Studentized 
Residual 


Diagonal Elements 
of Hat Matrix 


deviation is approximately unity. Consequently, large residuals (that may indicate possible 
outliers or unusual observations) will be more obvious from inspection of the residual plots. 

Many regression computer programs compute other types of scaled residuals. One of the 
most popular are the studentized residuals 


r= 9 eae (12-43) 


where h,; is the ith diagonal element of the matrix 
H = X(X’X) 'X’ 
The H matrix is sometimes called the “‘hat’’ matrix, since 
jy = XP = X(X’X) 'X’y = Hy 


Thus H transforms the observed values of y into a vector of fitted values y. 
Since each row of the matrix X corresponds to a vector, say x} = [1, xj, Xj2,---. Xix]; 
another way to write the diagonal elements of the hat matrix is 


hy = x/ (XX) 'x; (12-44) 


Note that apart from o”, h;; is the variance of the fitted value j;. The quantities /;; were used in 
the computation of the confidence interval on the mean response in Section 12-3.2. 

Under the usual assumptions that the model errors are independently distributed with 
mean zero and variance o”, we can show that the variance of the ith residual e; is 


V(e;) = o(1 = hi). i=1,2,...,n 


Furthermore, the ,; elements must fall in the interval 0 < h;; = 1. This implies that the stan- 
dardized residuals understate the true residual magnitude; thus, the studentized residuals 
would be a better statistic to examine in evaluating potential outliers. 

To illustrate, consider the two observations identified in the wire bond strength data 
(Example 12-10) as having residuals that might be unusually large, observations 15 and 17. 
The standardized residuals are 


£15 a ae a = = 
Ve V5.2352  V/MS,  V5.2352 


dys 1.89 


Now /y515 = 0.0737 and hz 7 = 0.2593, so the studentized residuals are 


C15 5.84 
V6(1 — Asis) | W5.2352(1 — 0.0737) 


2.65 


r15 
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and 


e17 4.33 
Vel — Ayy17) V5.2352(1 — 0.2593) 


iq 


Notice that the studentized residuals are larger than the corresponding standardized residuals. 
However, the studentized residuals are still not so large as to cause us serious concern about 
possible outliers. 


12-5.2 Influential Observations 


Cook’s 
Distance 


Figure 12-10 A 
point that is remote 
in x-space. 


When using multiple regression, we occasionally find that some subset of the observations 
is unusually influential. Sometimes these influential observations are relatively far away 
from the vicinity where the rest of the data were collected. A hypothetical situation for two 
variables is depicted in Fig. 12-10, where one observation in x-space is remote from the rest 
of the data. The disposition of points in the x-space is important in determining the proper- 
ties of the model. For example, point (x;;, xj.) in Fig. 12-10 may be very influential in de- 
termining R°, the estimates of the regression coefficients, and the magnitude of the error 
mean square. 

We would like to examine the influential points to determine whether they control many 
model properties. If these influential points are “bad” points, or erroneous in any way, they 
should be eliminated. On the other hand, there may be nothing wrong with these points, but at 
least we would like to determine whether or not they produce results consistent with the rest of 
the data. In any event, even if an influential point is a valid one, if it controls important model 
properties, we would like to know this, since it could have an impact on the use of the model. 

Montgomery, Peck, and Vining (2006) and Myers (1990) describe several methods for 
detecting influential observations. An excellent diagnostic is the distance measure developed 
by Dennis R. Cook. This is a measure of the squared distance between the usual least squares 
estimate of B based on all n observations and the estimate obtained when the ith point is 


removed, say, Bi. The Cook’s distance measure is 


D,;= =) = IP oso ald 


x2 


Region containing 
all observations 
except the ith 
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Clearly, if the ith point is influential, its removal will result in Bi changing considerably from 


the value B. Thus, a large value of D; implies that the ith point is influential. The statistic D; is 


actually computed using 


Cook’s 
Distance pe ae 
Formula iL== “ ae? ween. (12-45) 
P (1 — hi) 


From Equation 12-44 we see that D; consists of the squared studentized residual, which 
reflects how well the model fits the ith observation y; [recall that r; = e;/V 6°(1 — h;)] anda 
component that measures how far that point is from the rest of the data [h;;/(1 — h,;) is a mea- 
sure of the distance of the ith point from the centroid of the remaining n — 1 points]. A value 
of D; > 1 would indicate that the point is influential. Either component of D; (or both) may 


contribute to a large value. 


EXAMPLE 12-11 Wire Bond Strength Cook’s Distances 


Table 12-12 lists the values of the hat matrix diagonals h,, and 
Cook’s distance measure D; for the wire bond pull strength data 
in Example 12-1. To illustrate the calculations, consider the 
first observation: 


pet 
' P (L = hy) 
[e\/VMSz(1 — hy)? fn 
z (1 — An) 


[1.57/V5.2352(1 — 0.1573) 0.1573 


= 0.035 


3 


(1 — 0.1573) 


The Cook distance measure D; does not identify any poten- 
tially influential observations in the data, for no value of D; 


exceeds unity. 


Table 12-12 Influence Diagnostics for the Wire Bond Pull Strength Data 2 


Observations Cook’s Distance Measure Observations Cook’s Distance Measure 
i like D; i hi D; 
1 0.1573 0.035 14 0.1129 0.003 
2 0.1116 0.012 15 0.0737 0.187 
3 0.1419 0.060 16 0.0879 0.001 
4 0.1019 0.021 17 0.2593 0.565 
5 0.0418 0.024 18 0.2929 0.155 
6 0.0749 0.007 19 0.0962 0.018 
7 0.1181 0.036 20 0.1473 0.000 
8 0.1561 0.020 21 0.1296 0.052 
9 0.1280 0.160 20) 0.1358 0.028 
10 0.0413 0.001 23 0.1824 0.002 
11 0.0925 0.013 24 0.1091 0.040 
12 0.0526 0.001 25 0.0729 0.000 


o 
= 
So 
io) 
NO 
i> 


0.001 


EXERCISES FOR SECTION 12-5 
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12-57. Consider the gasoline mileage data in Exercise 12-7. 

(a) What proportion of total variability is explained by this 
model? 

(b) Construct a normal probability plot of the residuals and 
comment on the normality assumption. 

(c) Plot residuals versus ) and versus each regressor. Discuss 
these residual plots. 

(d) Calculate Cook’s distance for the observations in this data 
set. Are any observations influential? 


12-58. Consider the electric power consumption data in 

Exercise 12-6. 

(a) Calculate R? for this model. Interpret this quantity. 

(b) Plot the residuals versus ) and versus each regressor. 
Interpret this plot. 

(c) Construct a normal probability plot of the residuals and 
comment on the normality assumption. 


12-59. Consider the regression model for the NFL data in 

Exercise 12-17. 

(a) What proportion of total variability is explained by this model? 

(b) Construct a normal probability plot of the residuals. What 
conclusion can you draw from this plot? 

(c) Plot the residuals versus } and versus each regressor, and 
comment on model adequacy. 

(d) Are there any influential points in these data? 


12-60. Consider the regression model for the heat treating 

data in Exercise 12-10. 

(a) Calculate the percent of variability explained by this model. 

(b) Construct a normal probability plot for the residuals. 
Comment on the normality assumption. 

(c) Plot the residuals versus ¥ and interpret the display. 

(d) Calculate Cook’s distance for each observation and pro- 
vide an interpretation of this statistic. 


12-61. Consider the regression model fit to the X-ray in- 

spection data in Exercise 12-11. Use rads as the response. 

(a) What proportion of total variability is explained by this 
model? 

(b) Construct a normal probability plot of the residuals. What 
conclusion can you draw from this plot? 

(c) Plot the residuals versus ) and versus each regressor, and 
comment on model adequacy. 

(d) Calculate Cook’s distance for the observations in this data 
set. Are there any influential points in these data? 


12-62. Consider the regression model fit to the arsenic data 

in Exercise 12-12. Use arsenic in nails as the response and age, 

drink use, and cook use as the regressors. 

(a) What proportion of total variability is explained by this 
model? 

(b) Construct a normal probability plot of the residuals. What 
conclusion can you draw from this plot? 

(c) Plot the residuals versus ) and versus each regressor, and 
comment on model adequacy. 


(d) Calculate Cook’s distance for the observations in this data 
set. Are there any influential points in these data? 


12-63. Consider the regression model fit to the coal and lime- 

stone mixture data in Exercise 12-13. Use density as the response. 

(a) What proportion of total variability is explained by this 
model? 

(b) Construct a normal probability plot of the residuals. What 
conclusion can you draw from this plot? 

(c) Plot the residuals versus ) and versus each regressor, and 
comment on model adequacy. 

(d) Calculate Cook’s distance for the observations in this data 
set. Are there any influential points in these data? 


12-64. Consider the regression model fit to the nisin extrac- 

tion data in Exercise 12-14. 

(a) What proportion of total variability is explained by this 
model? 

(b) Construct a normal probability plot of the residuals. What 
conclusion can you draw from this plot? 

(c) Plot the residuals versus and versus each regressor, and 
comment on model adequacy. 

(d) Calculate Cook’s distance for the observations in this data 
set. Are there any influential points in these data? 


12-65. Consider the regression model fit to the grey range 

modulation data in Exercise 12-15. Use the useful range as the 

response. 

(a) What proportion of total variability is explained by this 
model? 

(b) Construct a normal probability plot of the residuals. What 
conclusion can you draw from this plot? 

(c) Plot the residuals versus ) and versus each regressor, and 
comment on model adequacy. 

(d) Calculate Cook’s distance for the observations in this data 
set. Are there any influential points in these data? 


12-66. Consider the stack loss data in Exercise 12-16. 

(a) What proportion of total variability is explained by this 
model? 

(b) Construct a normal probability plot of the residuals. What 
conclusion can you draw from this plot? 

(c) Plot the residuals versus ) and versus each regressor, and 
comment on model adequacy. 

(d) Calculate Cook’s distance for the observations in this data 
set. Are there any influential points in these data? 


12-67. Consider the bearing wear data in Exercise 12-19. 

(a) Find the value of R* when the model uses the regressors 
x, and x5. 

(b) What happens to the value of R? when an interaction term 
X 1X, is added to the model? Does this necessarily imply 
that adding the interaction term is a good idea? 


12-68. Fit a model to the response PITCH in the heat treating 
data of Exercise 12-10 using new regressors x, = SOAKTIME 
X SOAKPCT and x, = DIFFTIME xX DIFFPCT. 


490 CHAPTER 12 MULTIPLE LINEAR REGRESSION 


(a) Calculate the R? for this model and compare it to the 
value of R* from the original model in Exercise 12-10. 
Does this provide some information about which model 
is preferable? 

(b) Plot the residuals from this model versus } and on a 
normal probability scale. Comment on model adequacy. 

(c) Find the values of Cook’s distance measure. Are any ob- 
servations unusually influential? 


12-69. Consider the semiconductor HFE data in Exercise 12-9. 

(a) Plot the residuals from this model versus ». Comment on 
the information in this plot. 

(b) What is the value of R? for this model? 

(c) Refit the model using log HFE as the response variable. 

(d) Plot the residuals versus predicted log HFE for the model 
in part (c). Does this give any information about which 
model is preferable? 

(e) Plot the residuals from the model in part (d) versus the 
regressor x3. Comment on this plot. 

(f) Refit the model to log HFE using x,, x,, and 1/x3, as the re- 
gressors. Comment on the effect of this change in the model. 


12-70. Consider the regression model for the NHL data 

from Exercise 12-18. 

(a) Fit a model using GF as the only regressor. 

(b) How much variability is explained by this model? 

(c) Plot the residuals versus ) and comment on model adequacy. 

(d) Plot the residuals from part (a) versus PPGF, the points 
scored while in power play. Does this indicate that the 
model would be better if this variable were included? 


12-71. The diagonal elements of the hat matrix are often 
used to denote leverage— that is, a point that is unusual in its 
location in the x-space and that may be influential. Generally, 
the ith point is called a leverage point if its hat diagonal 
h, exceeds 2p/n, which is twice the average size of all the hat 
diagonals. Recall that p = k + 1. 

(a) Table 12-12 contains the hat diagonal for the wire bond 
pull strength data used in Example 12-1. Find the average 
size of these elements. 

(b) Based on the criterion above, are there any observations 
that are leverage points in the data set? 


12-6 ASPECTS OF MULTIPLE REGRESSION MODELING 


In this section we briefly discuss several other aspects of building multiple regression models. 
For more extensive presentations of these topics and additional examples refer to Montgomery, 
Peck, and Vining (2006) and Myers (1990). 


12-6.1 Polynomial Regression Models 


The linear model Y = XP + € is a general model that can be used to fit any relationship that 
is linear in the unknown parameters B. This includes the important class of polynomial 
regression models. For example, the second-degree polynomial in one variable 


Y=€6o + Puxt Byte 


(12-46) 


and the second-degree polynomial in two variables 


= 2 2 
Y = Bo + Bix, + Box. + By x7 + Bax + Byoxpx. + € 


are linear regression models. 


(12-47) 


Polynomial regression models are widely used when the response is curvilinear, because 
the general principles of multiple regression can be applied. The following example illustrates 
some of the types of analyses that can be performed. 


EXAMPLE 12-12 Airplane Sidewall Panels 
Sidewall panels for the interior of an airplane are formed in a 
1500-ton press. The unit manufacturing cost varies with the 
production lot size. The data shown below give the average 
cost per unit (in hundreds of dollars) for this product (y) and 
the production lot size (x). The scatter diagram, shown in Fig. 
12-11, indicates that a second-order polynomial may be 
appropriate. 


y 1.81 1.70 1.65 1.55 1.48 1.40 
x 20 25 30 35 40 50 
y 1.30 1.26 1.24 1.21 1.20 1.18 
x 60 65 70 75 80 90 
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for Example 12-11. Lot size, x 
We will fit the model Solving the normal equations X'XB = X'y gives the fitted 
model 
Y=6£o+ Bix t+ Byte 
The y vector, the model matrix X and the B vector are as follows: } = 2.19826629 — 0.02252236x + 0.00012507x? 
1.81 1 20 400 Conclusions: The test for significance of regression is shown 
1.70 1 25 625 in Table 12-13. Since fy = 1762.3 is significant at 1%, we 
1.65 1 30 900 conclude that at least one of the parameters B, and B,, is not 
1.55 1 35 1225 zero. Furthermore, the standard tests for model adequacy do 
1.48 1 40 1600 not reveal any unusual behavior, and we would conclude that 
_ | 1.40 _ 1 50 2500 ee a this is a reasonable model for the sidewall panel cost data. 
¥™ | 1.30 1 60 3600 
1.26 1 65 4225 Pu 
1.24 1 70 4900 
1.21 1 75 5625 
1.20 1 80 6400 
| 1.18 | | 1 90 8100 | 


In fitting polynomials, we generally like to use the lowest-degree model consistent with 
the data. In this example, it would seem logical to investigate the possibility of dropping the 
quadratic term from the model. That is, we would like to test 

Ao: Bi, = 0 
A 1 Buy #0 


Table 12-13 Test for Significance of Regression for the Second-Order Model in Example 12-12 


Source of Sum of Degrees of Mean 

Variation Squares Freedom Square ho P-value 
Regression 0.52516 2 0.26258 1762.28 2.12E-12 
Error 0.00134 9 0.00015 


Total 0.5265 11 
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Table 12-14 Analysis of Variance for Example 12-12, Showing the Test for Hp: B,; = 0 


Source of Degrees of Mean 
Variation Sum of Squares Freedom Square to P-value 
Regression SSr(B1,B11|Bo) = 0.52516 D 0.26258 1767.40 2.09E-12 
Linear SSr(B1|Bo) = 0.49416 1 0.49416 2236.12 7.13E-13 
Quadratic SSr(Bi1|Bo.B1) = 0.03100 1 0.03100 208.67 1.56E-7 
Error 0.00133 9 0.00015 
Total 0.5265 11 


The general regression significance test can be used to test this hypothesis. We need to deter- 
mine the “extra sum of squares” due to B;,, or 


SSr(B11|B1-Bo) = SSx(Bi-B11/Bo) — SSa(Bi| Bo) 


The sum of squares SSp(B},B1;|Bo) = 0.52516 from Table 12-13. To find SSp(B1| Bo), we fit a 
simple linear regression model to the original data, yielding 


¥ = 1.90036313 — 0.00910056x 


It can be easily verified that the regression sum of squares for this model is 
SSp(Bi|Bo) = 0.4942 


Therefore, the extra sum of the squares due to B,,, given that 8, and By are in the model, is 


SSr(B11|B1,B0) = SSr(B1-B11| Bo) — SSr(Bi| Bo) 
= 0.5252 — 0.4942 
= 0.031 


The analysis of variance, with the test of Ho: B;, = O incorporated into the procedure, is 
displayed in Table 12-14. Note that the quadratic term contributes significantly to the 
model. 


12-6.2 Categorical Regressors and Indicator Variables 


The regression models presented in previous sections have been based on quantitative vari- 
ables, that is, variables that are measured on a numerical scale. For example, variables such as 
temperature, pressure, distance, and voltage are quantitative variables. Occasionally, we need to 
incorporate categorical, or qualitative, variables in a regression model. For example, suppose 
that one of the variables in a regression model is the operator who is associated with each 
observation y,; Assume that only two operators are involved. We may wish to assign different 
levels to the two operators to account for the possibility that each operator may have a different 
effect on the response. 

The usual method of accounting for the different levels of a qualitative variable is to use 
indicator variables. For example, to introduce the effect of two different operators into a 
regression model, we could define an indicator variable as follows: 


. if the observation is from operator | 
1 if the observation is from operator 2 
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In general, a qualitative variable with r-levels can be modeled by r — 1 indicator variables, 
which are assigned the value of either zero or one. Thus, if there are three operators, the 
different levels will be accounted for by the two indicator variables defined as follows: 


if the observation is from operator 1 
if the observation is from operator 2 
if the observation is from operator 3 


Indicator variables are also referred to as dummy variables. The following example [from 
Montgomery, Peck, and Vining (2006)] illustrates some of the uses of indicator variables; for 
other applications, see Montgomery, Peck, and Vining (2006). 


EXAMPLE 12-13 Surface Finish 

A mechanical engineer is investigating the surface finish of 
metal parts produced on a lathe and its relationship to the speed 
(in revolutions per minute) of the lathe. The data are shown in 
Table 12-15. Note that the data have been collected using two 
different types of cutting tools. Since the type of cutting tool 
likely affects the surface finish, we will fit the model 


Y = Bo + Bix, + Box. + € 


where Y is the surface finish, x, is the lathe speed in revolu- 
tions per minute, and x, is an indicator variable denoting the 
type of cutting tool used; that is, 


_ {° for tool type 302 
i 1, for tool type 416 


The parameters in this model may be easily interpreted. 
If x, = 0, the model becomes 


Y= Bo + Bix, + € 


which is a straight-line model with slope B, and intercept Bp. 
However, if x, = 1, the model becomes 


Y = Bo + Bix + Ba(1) + € = (Bo + Bo) + Bix + € 


which is a straight-line model with slope 8, and intercept 
Bo + Bo. Thus, the model Y = By + Byx + Box, + € im- 
plies that surface finish is linearly related to lathe speed and 
that the slope B, does not depend on the type of cutting tool 
used. However, the type of cutting tool does affect the inter- 
cept, and B, indicates the change in the intercept associated 
with a change in tool type from 302 to 416. 

The model matrix X and y vector for this problem are as 
follows: 


ri 225 0 45.44 
1 200 0 42.03 
1 250 0 50.10 
1 245 0 48.75 
1 235 0 47.92 
1 237 0 47.79 
1 265 0 52.26 
1 259 0 50.52 
1 221 0 45.58 
1 218 0 44.78 

X=|1 224 1] y=| 33.50 
t 2 4 31.23 
1 248 1 37.52 
1 260 1 37.13 
1 243 1 34.70 
1 238 1 33.92 
1 224 1 32.13 
1 251 1 35.47 
ti Be 33.49 
1 216 1 32.29 


The fitted model is 


¥ = 14.27620 + 0.14115x,; — 13.28020x2 


Conclusions: The analysis of variance for this model 
is shown in Table 12-16. Note that the hypothesis Ho: B= 
8. = 0 (significance of regression) would be rejected at any 
reasonable level of significance because the P-value is very 
small. This table also contains the sums of squares 


SSr = SSr(B1,B2| Bo) 
= SSr(Bi|Bo) + SSe(B2|B1,Bo) 
so a test of the hypothesis Hy: B. = 0 can be made. Since this 


hypothesis is also rejected, we conclude that tool type has an 
effect on surface finish. 
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Table 12-15 Surface Finish Data for Example 12-13 


Observation Surface Finish Type of Cutting Observation Surface Finish Type of Cutting 

Number, i yi RPM Tool Number, i Yi RPM Tool 
1 45.44 225 302 11 33.50 224 416 

2 42.03 200 302 12 31.23 212 416 

3 50.10 250 302 13 37.52 248 416 

4 48.75 245 302 14 37.13 260 416 

5 47.92 235 302 15 34.70 243 416 

6 47.79 237 302 16 33.92 238 416 

7 52.26 265 302 17 32.13 224 416 

8 50.52 259 302 18 35.47 251 416 

9 45.58 221 302 19 33.49 232 416 

10 44.78 218 302 20 32.29 216 416 


It is also possible to use indicator variables to investigate whether tool type affects both 
the slope and intercept. Let the model be 


Y = Bo + Bix, + Box. + B3xix + € 
where x, is the indicator variable. Now if tool type 302 is used, x. = 0, and the model is 
Y= Bo + Byxy, + € 
If tool type 416 is used, x. = 1, and the model becomes 


Y= Bo + Bix, + Bo + B3x, + € 
= (Bo + Bo) + (B; + Bs)x, + € 


Note that B, is the change in the intercept and that B, is the change in slope produced by a 
change in tool type. 

Another method of analyzing these data is to fit separate regression models to the data 
for each tool type. However, the indicator variable approach has several advantages. First, 
only one regression model must be fit. Second, by pooling the data on both tool types, 
more degrees of freedom for error are obtained. Third, tests of both hypotheses on the 
parameters B, and B; are just special cases of the extra sum of squares method. 


12-6.3 Selection of Variables and Model Building 


An important problem in many applications of regression analysis involves selecting the set of 
regressor variables to be used in the model. Sometimes previous experience or underlying 
theoretical considerations can help the analyst specify the set of regressor variables to use in a 
particular situation. Usually, however, the problem consists of selecting an appropriate set of 


Table 12-16 Analysis of Variance for Example 12-13 


Source of Degrees of Mean 
Variation Sum of Squares Freedom Square ho P-value 
Regression 1012.0595 Dy 506.0297 1103.69 1.02E-18 
SSp(Bi| Bo) 130.6091 1 130.6091 284.87 4.70E-12 
SSp(B2|B1.B0) 881.4504 1 881.4504 1922.52 6.24E-19 
Error 7.7943 17 0.4585 


Total 1019.8538 19 
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regressors from a set that quite likely includes all the important variables, but we are sure that 
not all these candidate regressors are necessary to adequately model the response Y. 

In such a situation, we are interested in variable selection; that is, screening the candidate 
variables to obtain a regression model that contains the “best” subset of regressor variables. 
We would like the final model to contain enough regressor variables so that in the intended use 
of the model (prediction, for example) it will perform satisfactorily. On the other hand, to keep 
model maintenance costs to a minimum and to make the model easy to use, we would like the 
model to use as few regressor variables as possible. The compromise between these conflict- 
ing objectives is often called finding the “best” regression equation. However, in most prob- 
lems, no single regression model is “best” in terms of the various evaluation criteria that have 
been proposed. A great deal of judgment and experience with the system being modeled is 
usually necessary to select an appropriate set of regressor variables for a regression equation. 

No single algorithm will always produce a good solution to the variable selection problem. 
Most of the currently available procedures are search techniques, and to perform satisfactorily, 
they require interaction with judgment by the analyst. We now briefly discuss some of the more 
popular variable selection techniques. We assume that there are K candidate regressors, x,, 
X>,..., Xx, and a single response variable y. All models will include an intercept term B,, so the 
model with all variables included would have K + 1 terms. Furthermore, the functional form of 
each candidate variable (for example, x; = 1/x, x, = In x, etc.) is assumed to be correct. 


All Possible Regressions 

This approach requires that the analyst fit all the regression equations involving one candi- 
date variable, all regression equations involving two candidate variables, and so on. Then 
these equations are evaluated according to some suitable criteria to select the “best” regres- 
sion model. If there are K candidate regressors, there are 2* total equations to be examined. 
For example, if K = 4, there are 24 = 16 possible regression equations; while if K = 10, 
there are 2'° = 1024 possible regression equations. Hence, the number of equations to be 
examined increases rapidly as the number of candidate variables increases. However, there 
are some very efficient computing algorithms for all possible regressions available and they 
are widely implemented in statistical software, so it is a very practical procedure unless the 
number of candidate regressors is fairly large. Look for a menu choice such as “Best 
Subsets” regression. 

Several criteria may be used for evaluating and comparing the different regression mod- 
els obtained. A commonly used criterion is based on the value of R? or the value of the 
adjusted R?, Reaie Basically, the analyst continues to increase the number of variables in the 
model until the increase in R? or the adjusted R;,, is small. Often, we will find that the R;,, will 
stabilize and actually begin to decrease as the number of variables in the model increases. 
Usually, the model that maximizes Roi is considered to be a good candidate for the best re- 
gression equation. Because we can write R>4, = 1 — {MS,/[SS;/(n — 1)]} and SS,/(n — 1) 
is a constant, the model that maximizes the Roi value also minimizes the mean square error, 
so this is a very attractive criterion. 

Another criterion used to evaluate regression models is the C, statistic, which is a meas- 
ure of the total mean square error for the regression model. We define the total standardized 


mean square error for the regression model as 


1 2 . 
T= > Ah — BP 
i=1 


= 41 Sem - ep + S ver} 


o i=1 i=1 
1 


= — [(bias)* + variance] 
o 
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We use the mean square error from the full K + 1 term model as an estimate of o”; that is, 
6° = MS,(K + 1). Then an estimator of I’, is [see Montgomery, Peck, and Vining (2006) or 


Myers (1990) for the details]: 


C, Statistic 


(12-48) 


If the p-term model has negligible bias, it can be shown that 


E(C,|zero bias) = p 


Therefore, the values of C, for each regression model under consideration should be evalu- 
ated relative to p. The regression equations that have negligible bias will have values of 
C, that are close to p, while those with significant bias will have values of C,, that are sig- 
nificantly greater than p. We then choose as the “best” regression equation either a model 
with minimum C, or a model with a slightly larger C,, that does not contain as much bias 


(ie., Cy = p). 


The PRESS statistic can also be used to evaluate competing regression models. PRESS 
is an acronym for Prediction Error Sum of Squares, and it is defined as the sum of the 
squares of the differences between each observation y, and the corresponding predicted value 
based on a model fit to the remaining n — | points, say Hay So PRESS provides a measure of 
how well the model is likely to perform when predicting new data, or data that was not used 
to fit the regression model. The computing formula for PRESS is 


Prediction 
Error Sum of n n e 2 
Squares PRESS = S}(y; — Hp)? = (; Sr ) 
(PRESS) i=l i=l ii 


where e; = y; — J, is the usual residual. Thus PRESS is easy to calculate from the standard 
least squares regression results. Models that have small values of PRESS are preferred. 


EXAMPLE 12-14 Wine Quality 
Table 12-17 presents data on taste-testing 38 brands of pinot 
noir wine (the data were first reported in an article by Kwan, 
Kowalski, and Skogenboe in an article in the Journal of 
Agricultural and Food Chemistry, Vol. 27, 1979, and it also ap- 
pears as one of the default data sets in Minitab). The response 
variable is y = quality, and we wish to find the “best” regres- 
sion equation that relates quality to the other five parameters. 
Figure 12-12 is the matrix of scatter plots for the wine 
quality data, as constructed by Minitab. We notice that there are 
some indications of possible linear relationships between qual- 
ity and the regressors, but there is no obvious visual impression 
of which regressors would be appropriate. Table 12-18 lists the 
all possible regressions output from Minitab. In this analysis, 


we asked Minitab to present the best three equations for each 
subset size. Note that Minitab reports the values of R’, Reap Cy 
and S = VMS, for each model. From Table 12-18 we see that 
the three-variable equation with x, = aroma, x, = flavor, and 
Xs = oakiness produces the minimum C,, equation, whereas the 
four-variable model, which adds x, = clarity to the previous 
three regressors, results in maximum Rea (or minimum MS;). 
The three-variable model is 


¥ = 6.47 + 0.580x + 1.20x4 — 0.602x;5 
and the four-variable model is 


¥ = 4.99 + 1.79x, + 0.530x + 1.26x4 — 0.659x5 
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Table 12-17 Wine Quality Data 


aa 7) x3 x4 x5 a? 
Clarity Aroma Body Flavor Oakiness Quality 

1 1.0 33) 2.8 3.1 4.1 9.8 

2 1.0 44 49 3.5 3.9 12.6 

3 1.0 3.9 5.3 48 47 11.9 

4 1.0 3.9 2.6 3.1 3.6 11.1 

5 1.0 5.6 5.1 5.5 5.1 133} 

6 1.0 4.6 47 5.0 4.1 12.8 

7 1.0 4.8 4.8 48 33) 12.8 

8 1.0 5.3 4.5 43 5.2 12.0 

9 1.0 43 43 3.9 2.9 13.6 
10 1.0 43 3.9 47 3.9 13.9 
11 1.0 5.1 43 4.5 3.6 14.4 
12 0.5 3.3 5.4 43 3.6 12.3 
13 0.8 5.9 5.7 7.0 4.1 16.1 
14 0.7 revi 6.6 6.7 3.7 16.1 
15 1.0 7A 44 5.8 41 15.5 
16 0.9 5.5 5.6 5.6 4.4 15.5 
17 1.0 6.3 5.4 48 4.6 13.8 
18 1.0 5.0 5:5 5.5 4.1 13.8 
19 1.0 4.6 4.1 43 3.1 11.3 
20 0.9 3.4 5.0 3.4 3.4 79 
21 0.9 6.4 5.4 6.6 4.8 15.1 
22 1.0 5.5 5.3 5.3 3.8 13.5 
23 0.7 47 4.1 5.0 3.7 10.8 
24 0.7 4.1 4.0 4.1 4.0 9.5 
25 1.0 6.0 5.4 5.7 47 a 
26 1.0 43 4.6 47 4.9 11.6 
27 1.0 3.9 4.0 5.1 5.1 11.7 
28 1.0 5.1 4.9 5.0 5.1 11.9 
29 1.0 3.9 44 5.0 4.4 10.8 
30 1.0 4.5 3.7 2.9 3.9 8.5 
31 1.0 5) 43 5.0 6.0 10.7 
32 0.8 4.2 3.8 3.0 47 9.1 
33 1.0 3.3 3.5 43 4.5 12.1 
34 1.0 6.8 5.0 6.0 5.2 14.9 
35 0.8 5.0 ei 5.5 4.8 13.5 
36 0.8 3:5 47 4.2 3.3 12.2 
37 0.8 4.3 5S) 3.5 5.8 10.3 
38 0.8 5.2 4.8 5.7 3.5 13.2 


These models should now be evaluated further using residuals plots and the other tech- 
niques discussed earlier in the chapter, to see if either model is satisfactory with respect to the 
underlying assumptions and to determine if one of them is preferable. It turns out that the 
residual plots do not reveal any major problems with either model. The value of PRESS for 
the three-variable model is 56.0524 and for the four-variable model it is 60.3327. Since 
PRESS is smaller in the model with three regressors, and since it is the model with the small- 
est number of predictors, it would likely be the preferred choice. 
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Figure 12-12 A matrix of scatter plots from Minitab for the wine quality data. 


Table 12-18 Minitab All Possible Regressions Output for the Wine Quality Data 


Best Subsets Regression: Quality versus Clarity, Aroma, ... 


Response is Quality 


7 


R-Sq 
62.4 
50.0 
30.1 
66.1 
65.9 
63.3 
70.4 
68.0 
66.5 
71.5 
70.5 
69.3 
72.1 


APE BWW WNN NY Ke 


R-Sq (adj) C-p S 
61.4 9.0 1.2712 
48.6 23.2 1.4658 
28.2 46.0 1.7335 
64.2 6.8 1.2242 
63.9 7A 1.2288 
61.2 10.0 1.2733 
67.8 3.9 1.1613 
65.2 6.6 1.2068 
63.5 8.4 1.2357 
68.0 4.7 1.1568 
66.9 5.8 1.1769 
65.6 7.1 1.1996 
67.7 6.0 1.1625 
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Stepwise Regression 
Stepwise regression is probably the most widely used variable selection technique. The pro- 
cedure iteratively constructs a sequence of regression models by adding or removing variables 
at each step. The criterion for adding or removing a variable at any step is usually expressed 
in terms of a partial F-test. Let f,, be the value of the F-random variable for adding a variable 
to the model, and let f,,, be the value of the F-random variable for removing a variable from 
the model. We must have f,, = fi, and usually f,, = fou 

Stepwise regression begins by forming a one-variable model using the regressor variable 
that has the highest correlation with the response variable Y. This will also be the regressor 
producing the largest F-statistic. For example, suppose that at this step, x, is selected. At the 
second step, the remaining K — 1 candidate variables are examined, and the variable for 
which the partial F-statistic 


SSr(Bj|B1-Bo) 

| near ae (12-49) 
MS (x; x1) 

is a maximum is added to the equation, provided that f; > fi,. In equation 12-49, MSp (x; x1) 
denotes the mean square for error for the model containing both x, and x;. Suppose that this 
procedure indicates that x, should be added to the model. Now the stepwise regression algo- 
rithm determines whether the variable x, added at the first step should be removed. This is 
done by calculating the F-statistic 


SSr(B1|B2,Bo) 
= (12-50) 

MS p(X, X2) 

If the calculated value f; < f,,,, the variable x, is removed; otherwise it is retained, and we 
would attempt to add a regressor to the model containing both x, and x. 

In general, at each step the set of remaining candidate regressors is examined, and the 
regressor with the largest partial F-statistic is entered, provided that the observed value of 
fexceeds f,,. Then the partial F-statistic for each regressor in the model is calculated, and the 
regressor with the smallest observed value of F is deleted if the observed f < f,,,. The 
procedure continues until no other regressors can be added to or removed from the model. 

Stepwise regression is almost always performed using a computer program. The analyst 
exercises control over the procedure by the choice of f,, and f,,,. Some stepwise regression com- 
puter programs require that numerical values be specified for f,, and f,,,. Since the number of 
degrees of freedom on MS; depends on the number of variables in the model, which changes 
from step to step, a fixed value of f,, and f,,, causes the type I and type II error rates to vary. Some 
computer programs allow the analyst to specify the type I error levels for f,, and f,,,,. However, 
the “advertised” significance level is not the true level, because the variable selected is the one 
that maximizes (or minimizes) the partial F-statistic at that stage. Sometimes it is useful to ex- 
periment with different values of f,, and f,,,, (or different advertised type I error rates) in several 
different runs to see if this substantially affects the choice of the final model. 


EXAMPLE 12-15 Wine Quality Stepwise Regression 


Table 12-19 gives the Minitab stepwise regression output for the Oakiness (step 2), and Aroma (step 3) and that no variables were 


wine quality data. Minitab uses fixed values of a for entering removed. No other variable could be entered, so the algorithm 
and removing variables. The default level is a = 0.15 for both terminated. This is the three-variable model found by all possible 
decisions. The output in Table 12-19 uses the default value. regressions that results in a minimum value of C,,. 


Notice that the variables were entered in the order Flavor (step 1), 


500 


CHAPTER 12 MULTIPLE LINEAR REGRESSION 


Table 12-19 Minitab Stepwise Regression Output for the 
Wine Quality Data 


Stepwise Regression: Quality versus Clarity, Aroma, ... 
Alpha-to-Enter: 0.15 Alpha-to-Remove: 0.15 


Response is Quality on 5 predictors, with N = 38 


Step 1 2 3 
Constant 4.941 6.912 6.467 
Flavor 1.57 1.64 1.20 
T-Value 7.73 8.25 4.36 
P-Value 0.000 0.000 0.000 
Oakiness —0.54 —0.60 
T-Value —1.95 —2.28 
P-Value 0.059 0.029 
Aroma 0.58 
T-Value 2.21 
P-Value 0.034 
S 1.27 1.22 1.16 
R-Sq 62.42 66.11 70.38 
R-Sq(adj) 61.37 64.17 67.76 
C-p 9.0 6.8 3.9 


Forward Selection 

The forward selection procedure is a variation of stepwise regression and is based on the 
principle that regressors should be added to the model one at a time until there are no remain- 
ing candidate regressors that produce a significant increase in the regression sum of squares. 
That is, variables are added one at a time as long as their partial F-value exceeds f,,. Forward 
selection is a simplification of stepwise regression that omits the partial F-test for deleting 
variables from the model that have been added at previous steps. This is a potential weakness 
of forward selection; that is, the procedure does not explore the effect that adding a regressor 
at the current step has on regressor variables added at earlier steps. Notice that if we were to 
apply forward selection to the wine quality data, we would obtain exactly the same results as 
we did with stepwise regression in Example 12-15, since stepwise regression terminated 
without deleting a variable. 


Backward Elimination 
The backward elimination algorithm begins with all K candidate regressors in the model. Then 
the regressor with the smallest partial F-statistic is deleted if this F-statistic is insignificant, that 
is, if f < fou Next, the model with K — | regressors is fit, and the next regressor for potential 
elimination is found. The algorithm terminates when no further regressor can be deleted. 

Table 12-20 shows the Minitab output for backward elimination applied to the wine quality 
data. The a value for removing a variable is « = 0.10. Notice that this procedure removes Body at 
step | and then Clarity at step 2, terminating with the three-variable model found previously. 


Some Comments on Final Model Selection 
We have illustrated several different approaches to the selection of variables in multiple linear 
regression. The final model obtained from any model-building procedure should be subjected 
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Table 12-20 Minitab Backward Elimination Output for the 
Wine Quality Data 


Stepwise Regression: Quality versus Clarity, Aroma, ... 


Backward elimination. Alpha-to-Remove: 0.1 


Response is Quality on 5 predictors, with N = 38 


Step 1 2 S! 
Constant 3.997 4.986 6.467 
Clarity 23 1.8 
T-Value 1.35 1.12 
P-Value 0.187 0.269 
Aroma 0.48 0.53 0.58 
T-Value 1.77 2.00 2.21 
P-Value 0.086 0.054 0.034 
Body 0.27 
T-Value 0.82 
P-Value 0.418 
Flavor 1.17 1.26 1.20 
T-Value 3.84 4.52 4.36 
P-Value 0.001 0.000 0.000 
Oakiness —0.68 —0.66 —0.60 
T-Value —2.52 —2.46 —2.28 
P-Value 0.017 0.019 0.029 
S 1.16 1.16 1.16 
R-Sq 72.06 71.47 70.38 
R-Sq(adj) 67.69 68.01 67.76 
C-p 6.0 47 3.9 


to the usual adequacy checks, such as residual analysis, lack-of-fit testing, and examination of 
the effects of influential points. The analyst may also consider augmenting the original set of 
candidate variables with cross-products, polynomial terms, or other transformations of the 
original variables that might improve the model. A major criticism of variable selection meth- 
ods such as stepwise regression is that the analyst may conclude there is one “best” regression 
equation. Generally, this is not the case, because several equally good regression models can 
often be used. One way to avoid this problem is to use several different model-building tech- 
niques and see if different models result. For example, we have found the same model for the 
wine quality data using stepwise regression, forward selection, and backward elimination. The 
same model was also one of the two best found from all possible regressions. The results from 
variable selection methods frequently do not agree, so this is a good indication that the three- 
variable model is the best regression equation. 

If the number of candidate regressors is not too large, the all-possible regressions method 
is recommended. We usually recommend using the minimum MS; and C, evaluation criteria 
in conjunction with this procedure. The all-possible regressions approach can find the “best” 
regression equation with respect to these criteria, while stepwise-type methods offer no such 
assurance. Furthermore, the all-possible regressions procedure is not distorted by dependen- 
cies among the regressors, as stepwise-type methods are. 
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12-6.4 Miulticollinearity 


Variance 
Inflation 
Factor (VIF) 


In multiple regression problems, we expect to find dependencies between the response variable 
Y and the regressors x; In most regression problems, however, we find that there are also 
dependencies among the regressor variables x;. In situations where these dependencies are 
strong, we say that multicollinearity exists. Multicollinearity can have serious effects on the 
estimates of the regression coefficients and on the general applicability of the estimated model. 

The effects of multicollinearity may be easily demonstrated. The diagonal elements of the 
matrix C = (X’X)~!can be written as 


1 
Ci (I — RB) j=1,2,...,k 
where Rj is the coefficient of multiple determination resulting from regressing x; on the other 
k — 1 regressor variables. We can think of R? as a measure of the correlation benvesd x; and 
the other regressors. Clearly, the stronger the Tineae dependency of x; on the remaining regres- 
sor variables, and hence the stronger the multicollinearity, the jareet the value of Ri will 
be. Recall that V(B;) = = 0°C;. Therefore, we say that the variance of B; is “inflated” 
by the quantity (1 — R 7) ' Consequently, we define the variance inflation factor for Bj as 


1 
(1 — Rj) 


VIF(B;) = j=1,2,...,k (12-51) 


These factors are an important measure of the extent to which multicollinearity is present. If the 
columns of the model matrix X are orthogonal, then the regressors are completely uncorrelated, 
and the variance inflation factors will all be unity. So any VIF that exceeds one indicates some 
level of multicollinearity in the data. 

Although the estimates of the regression coefficients are very imprecise when multi- 
collinearity is present, the fitted model equation may still be useful. For example, suppose we 
wish to predict new observations on the response. If these predictions are interpolations in the 
original region of the x-space where the multicollinearity is in effect, satisfactory predictions 
es often be obtained, because while individual 8; may be poorly estimated, the function 

Dj=1 Bij j May be estimated quite well. On the other hand, if the prediction of new observa- 
tions requires extrapolation beyond the original region of the x-space where the data were col- 
lected, generally we would expect to obtain poor results. Extrapolation usually requires good 
estimates of the individual model parameters. 

Multicollinearity arises for several reasons. It will occur when the analyst collects data 
such that a linear constraint holds approximately among the columns of the X matrix. For ex- 
ample, if four regressor variables are the components of a mixture, such a constraint will 
always exist because the sum of the components is always constant. Usually, these constraints 
do not hold exactly, and the analyst might not know that they exist. 

The presence of multicollinearity can be detected in several ways. Two of the more easily 
understood of these will be discussed briefly. 


1. The variance inflation factors, defined in Equation 12-51, are very useful measures 
of multicollinearity. The larger the variance inflation factor, the more severe the 
multicollinearity. Some authors have suggested that if any variance inflation factor 
exceeds 10, multicollinearity is a problem. Other authors consider this value too 
liberal and suggest that the variance inflation factors should not exceed 4 or 5. 
Minitab will calculate the variance inflation factors. Table 12-4 presents the Minitab 


SS 
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multiple regression output for the wire bond pull strength data. Since both VIF, and 
VIF, are small, there is no problem with multicollinearity. 


2. If the F-test for significance of regression is significant, but tests on the individual 
regression coefficients are not significant, multicollinearity may be present. 


Several remedial measures have been proposed for solving the problem of multi- 
collinearity. Augmenting the data with new observations specifically designed to break up the 
approximate linear dependencies that currently exist is often suggested. However, this is 
sometimes impossible because of economic reasons or because of the physical constraints that 
relate the x; Another possibility is to delete certain variables from the model, but this approach 
has the disadvantage of discarding the information contained in the deleted variables. 

Since multicollinearity primarily affects the stability of the regression coefficients, it would 
seem that estimating these parameters by some method that is less sensitive to multicollinearity 
than ordinary least squares would be helpful. Several methods have been suggested. One alterna- 
tive to ordinary least squares, ridge regression, can be useful in combating multicollinearity. For 
more details on ridge regression, there are more extensive presentations in Montgomery, Peck, 
and Vining (2006) and Myers (1990). 


EXERCISES FOR SECTION 12-6 


12-72. An article entitled “A Method for Improving the 
Accuracy of Polynomial Regression Analysis” in the Journal 
of Quality Technology (1971, pp. 149-155) reported the fol- 
lowing data on y = ultimate shear strength of a rubber com- 
pound (psi) and x = cure temperature (°F). 


y 7710 800 840 810 
x 280 284 292 295 
y 735 640 | 590 560 
x 298 305 | 308 315 


(a) Fit a second-order polynomial to these data. 

(b) Test for significance of regression using a = 0.05. 

(c) Test the hypothesis that B,, = Ousinga = 0.05. 

(d) Compute the residuals from part (a) and use them to eval- 
uate model adequacy. 


12-73. Consider the following data, which result from an 
experiment to determine the effect of x = test time in hours at a 
particular temperature on y = change in oil viscosity: 

(a) Fit a second-order polynomial to the data. 


y —1.42 —1.39 —1.55 —1.89 —2.43 
x 2S .50 Be) 1.00 1.25 
y —3.15 —4.05 —5.15 —6.43 —7.89 
x 1.50 1.75 2.00 2.25 2.50 


(b) Test for significance of regression using a = 0.05. 

(c) Test the hypothesis that B,;, = 0 using a = 0.05. 

(d) Compute the residuals from part (a) and use them to eval- 
uate model adequacy. 


12-74. The following data were collected during an experiment 
to determine the change in thrust efficiency ( y, in percent) as the 
divergence angle of a rocket nozzle (x) changes: 


y | 2460 | 24.71 23.90 39.50 39.60 57.12 
x 4.0 4.0 4.0 5.0 5.0 6.0 
y | 67.11 67.24 67.15 77.87 80.11 84.67 
x 6.5, 6.5 6.75 7.0 7A 73 


(a) Fit a second-order model to the data. 

(b) Test for significance of regression and lack of fit using 
a = 0.05. 

(c) Test the hypothesis that B,,; = 0, using a = 0.05. 

(d) Plot the residuals and comment on model adequacy. 

(e) Fit a cubic model, and test for the significance of the cubic 
term using a = 0.05. 


12-75. Anarticle in the Journal of Pharmaceuticals Sciences 
(Vol. 80, 1991, pp. 971-977) presents data on the observed mole 
fraction solubility of a solute at a constant temperature and the 
dispersion, dipolar, and hydrogen bonding Hansen partial solu- 
bility parameters. The data are as shown in the following table, 
where y is the negative logarithm of the mole fraction solubility, 
x, is the dispersion partial solubility, x, is the dipolar partial sol- 
ubility, and x; is the hydrogen bonding partial solubility. 
(a) Fit the model Y = By + Byx; + Box + B3x3 4 
Biox1X2 + Bi3x1x3 + Bo3x2"3 + BuixT + Br2x3 + Byax3 t+ €. 
(b) Test for significance of regression using a = 0.05. 
(c) Plot the residuals and comment on model adequacy. 
(d) Use the extra sum of squares method to test the contribu- 
tion of the second-order terms using a = 0.05. 
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Observation 
Number y xy Xp X3 
1 0.22200 Ia 0.0 0.0 
2 0.39500 8.7 0.0 0.3 
3 0.42200 8.8 0.7 1.0 
4 0.43700 8.1 4.0 0.2 
5 0.42800 9.0 0.5 1.0 
6 0.46700 8.7 1.5 2.8 
7 0.44400 9.3 Dail 1.0 
8 0.37800 7.6 5.1 3.4 
9 0.49400 10.0 0.0 0.3 
10 0.45600 8.4 3.7 4.1 
11 0.45200 9.3 3.6 2.0 
12 0.11200 17 2.8 7A 
13 0.43200 9.8 4.2 2.0 
14 0.10100 73 2:5 6.8 
15 0.23200 8.5 2.0 6.6 
16 0.30600 9.5 2.5 5.0 
17 0.09230 74 2.8 78 
18 0.11600 7.8 2.8 iyi 
19 0.07640 eh 3.0 8.0 
20 0.43900 10.3 1.7 4.2 
21 0.09440 7.8 3.3 8.5 
22 0.11700 71 3.9 6.6 
23 0.07260 eh 4.3 25 
24 0.04120 74 6.0 10.9 
25 0.25100 13 2.0 52 
26 0.00002 7.6 7.8 20.7 


12-76. Consider the arsenic concentration data in Exercise 

12-10. 

(a) Discuss how you would model the information about the 
person’s sex. 

(b) Fit a regression model to the arsenic in nails using 
age, drink use, cook use, and the person’s sex as the 
regressors. 

(c) Is there evidence that the person’s sex affects arsenic in the 
nails? Why? 

12-77. Consider the gasoline mileage data in Exercise 12-7. 

(a) Discuss how you would model the information about the 
type of transmission in the car. 

(b) Fita regression model to the gasoline mileage using cid, etw, 
and the type of transmission in the car as the regressors. 

(c) Is there evidence that the type of transmission (L4, L5, 
or M6) affects gasoline mileage performance? 


12-78. Consider the surface finish data in Example 12-15. 
Test the hypothesis that two different regression models (with 
different slopes and intercepts) are required to adequately model 
the data. Use indicator variables in answering this question. 


12-79. Consider the X-ray inspection data in Exercise 12-11. 

Use rads as the response. Build regression models for the data 

using the following techniques: 

(a) All possible regressions. 

(b) Stepwise regression. 

(c) Forward selection. 

(d) Backward elimination. 

(e) Comment on the models obtained. Which model would 
you prefer? Why? 

12-80. Consider the electric power data in Exercise 12-6. Build 

regression models for the data using the following techniques: 

(a) All possible regressions. Find the minimum C, and mini- 
mum MS, equations. 

(b) Stepwise regression. 

(c) Forward selection. 

(d) Backward elimination. 

(e) Comment on the models obtained. Which model would 
you prefer? 


12-81. Consider the regression model fit to the coal and 

limestone mixture data in Exercise 12-13. Use density as the 

response. Build regression models for the data using the fol- 

lowing techniques: 

(a) All possible regressions. 

(b) Stepwise regression. 

(c) Forward selection. 

(d) Backward elimination. 

(e) Comment on the models obtained. Which model would 
you prefer? Why? 

12-82. Consider the wire bond pull strength data in Ex- 

ercise 12-8. Build regression models for the data using the 

following methods: 

(a) All possible regressions. Find the minimum C,, and mini- 
mum MS, equations. 

(b) Stepwise regression. 

(c) Forward selection. 

(d) Backward elimination. 

(e) Comment on the models obtained. Which model would 
you prefer? 


12-83. Consider the grey range modulation data in Exercise 

12-15. Use the useful range as the response. Build regression 

models for the data using the following techniques: 

(a) All possible regressions. 

(b) Stepwise regression. 

(c) Forward selection. 

(d) Backward elimination. 

(e) Comment on the models obtained. Which model would 
you prefer? Why? 

12-84. Consider the nisin extraction data in Exercise 12-14. 

Build regression models for the data using the following 

techniques: 

(a) All possible regressions. 

(b) Stepwise regression. 

(c) Forward selection. 
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(d) Backward elimination. 

(e) Comment on the models obtained. Which model would 
you prefer? Why? 

12.85. Consider the stack loss data in Exercise 12-16. 

Build regression models for the data using the following 

techniques: 

(a) All possible regressions. 

(b) Stepwise regression. 

(c) Forward selection. 

(d) Backward elimination. 

(e) Comment on the models obtained. Which model would 
you prefer? Why? 

(f) Remove any influential data points and repeat the model 
building in the previous parts? Does your conclusion in 
part (e) change? 


12-86. Consider the NHL data in Exercise 12-18. Build 

regression models for these data with regressors GF through 

FG using the following methods: 

(a) All possible regressions. Find the minimum C, and mini- 
mum MS, equations. 

(b) Stepwise regression. 

(c) Forward selection. 

(d) Backward elimination. 

(e) Which model would you prefer? 


12-87. Use the football data in Exercise 12-17 to build 

regression models using the following techniques: 

(a) All possible regressions. Find the equations that minimize 
MS, and that minimize C,,. 

(b) Stepwise regression. 

(c) Forward selection. 

(d) Backward elimination. 

(e) Comment on the various models obtained. Which model 
seems “best,” and why? 


12-88. Consider the arsenic data in Exercise 12-12. Use 

arsenic in nails as the response and age, drink use, and cook 

use as the regressors. Build regression models for the data 

using the following techniques: 

(a) All possible regressions. 

(b) Stepwise regression. 

(c) Forward selection. 

(d) Backward elimination. 

(e) Comment on the models obtained. Which model would 
you prefer? Why? 

(f) Now construct an indicator variable and add the person’s 
sex to the list of regressors. Repeat the model building in the 
previous parts. Does your conclusion in part (e) change? 


12-89. Consider the gas mileage data in Exercise 12-7. 
Build regression models for the data from the numerical re- 
gressors using the following techniques: 

(a) All possible regressions. 

(b) Stepwise regression. 

(c) Forward selection. 

(d) Backward elimination. 
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(e) Comment on the models obtained. Which model would 
you prefer? Why? 

(f) Now construct indicator variable for trns and drv and add 
these to the list of regressors. Repeat the model building in the 
previous parts. Does your conclusion in part (e) change? 


12-90. When fitting polynomial regression models, we 
often subtract x from each x value to produce a “centered” 
regressor x’ = x — xX. This reduces the effects of dependencies 
among the model terms and often leads to more accurate esti- 
mates of the regression coefficients. Using the data from 
Exercise 12-72, fit the model Y = By + Bix’ + Bi, (x')? + €. 
(a) Use the results to estimate the coefficients in the uncen- 
tered model Y = Bo + Bix + Bix? + e. Predict y when 
x = 285°F. Suppose that we use a standardized variable 
x’ = (x — x)/s,, where s, is the standard deviation of x, in 
constructing a polynomial regression model. Fit the model 
Y= B+ Biz’ + Bue’)? + 

(b) What value of y do you predict when x = 285°F? 

(c) Estimate the regression coefficients in the unstandardized 
model Y = By + Bix 4 By yx? + €, 

(d) What can you say about the relationship between SS, and 
R’ for the standardized and unstandardized models? 

(e) Suppose that y’ = (y — y)/s, is used in the model along 
with x’. Fit the model and comment on the relationship 
between SS; and R? in the standardized model and the 
unstandardized model. 


12-91. Consider the data in Exercise 12-75. Use all the terms 

in the full quadratic model as the candidate regressors. 

(a) Use forward selection to identify a model. 

(b) Use backward elimination to identify a model. 

(c) Compare the two models obtained in parts (a) and (b). 
Which model would you prefer and why? 


12-92. We have used a sample of 30 observations to fit a 

regression model. The full model has nine regressors, the vari- 

ance estimate is 6? = MS; = 100, and R* = 0.92. 

(a) Calculate the F-statistic for testing significance of regres- 
sion. Using a = 0.05, what would you conclude? 

(b) Suppose that we fit another model using only four of the 

original regressors and that the error sum of squares for 

this new model is 2200. Find the estimate of o* for this 

new reduced model. Would you conclude that the reduced 

model is superior to the old one? Why? 

Find the value of C, for the reduced model in part (b). 

Would you conclude that the reduced model is better than 

the old model? 


12-93. A sample of 25 observations is used to fit a regres- 

sion model in seven variables. The estimate of o* for this full 

model is MS; = 10. 

(a) A forward selection algorithm has put three of the original 
seven regressors in the model. The error sum of squares 
for the three-variable model is SS; = 300. Based on C,, 

would you conclude that the three-variable model has any 

remaining bias? 
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(b) After looking at the forward selection model in part (a), 
suppose you could add one more regressor to the model. 
This regressor will reduce the error sum of squares to 275. 
Will the addition of this variable improve the model? Why? 


Supplemental Exercises 


12-94. Consider the computer output below. 


The regression equation is 


Y =5174+ 11.5x1 — 8.14 x2 + 10.9 x3 

Predictor Coef SE Coef T P 
Constant 517.46 11.76 ? ? 
xl 11.4720 ? 36.50 ? 
x2 —8.1378 0.1969 ? ? 
x3 10.8565 0.6652 a ? 
S = 10.2560 R-Sq =? R-Sq (adj) = ? 

Analysis of Variance 

Source DF SS MS 

Regression 2 347300 115767 ? 2 
Residual Error 16 4 105 

Total 19 348983 


(a) Fill in the missing values. Use bounds for the P-values. 

(b) Is the overall model significant at « = 0.05? Is it signifi- 
cant at a = 0.01? 

(c) Discuss the contribution of the individual regressors to the 


model. 
12-95. Consider the following inverse of the model matrix: 
0.893758 —0.028245 —0.0175641 
(X'X)' =] -0.028245  0.0013329 0.0001547 
—0.017564  0.0001547 0.0009 108 


(a) How many variables are in the regression model? 

(b) If the estimate of o7 is 50, what is the estimate of the vari- 
ance of each regression coefficient? 

(c) What is the standard error of the intercept? 


12-96. The data shown in Table 12-22 represent the thrust 
of a jet-turbine engine (y) and six candidate regressors: x, = 
primary speed of rotation, x. = secondary speed of rotation, 
x3 = fuel flow rate, x, = pressure, x; = exhaust temperature, 
and x, = ambient temperature at time of test. 

(a) Fit a multiple linear regression model using x; = fuel flow 
rate, x, = pressure, and x, = exhaust temperature as the 
regressors. 

(b) Test for significance of regression using a = 0.01. Find 
the P-value for this test. What are your conclusions? 

(c) Find the ¢test statistic for each regressor. Using a = 0.01, 
explain carefully the conclusion you can draw from these 
statistics. 

(d) Find R’ and the adjusted statistic for this model. 

(e) Construct a normal probability plot of the residuals and 
interpret this graph. 


(f) Plot the residuals versus y. Are there any indications of 
inequality of variance or nonlinearity? 

(g) Plot the residuals versus x3. Is there any indication of 
nonlinearity? 

(h) Predict the thrust for an engine for which x; = 28900, 
x4 = 170, and x; = 1589. 


12-97. Consider the engine thrust data in Exercise 12-96. 
Refit the model using y* = In y as the response variable and 
x} = In x, as the regressor (along with x, and xs). 

(a) Test for significance of regression using a = 0.01. Find 
the P-value for this test and state your conclusions. 

(b) Use the t-statistic to test Hy: B; = 0 versus H;: B; # 0 for 
each variable in the model. If « = 0.01, what conclusions 
can you draw? 

(c) Plot the residuals versus $* and versus x3. Comment on 
these plots. How do they compare with their counterparts 
obtained in Exercise 12-96 parts (f) and (g)? 


12-98. Transient points of an electronic inverter are 
influenced by many factors. Table 12-21 gives data on the tran- 
sient point (y, in volts) of PMOS-NMOS inverters and five can- 
didate regressors: x; = width of the NMOS device, x, = length 


Table 12-21 Transient Point of an Electronic Inverter 


Observation 
Number 34 Xp x3 sy Xs y 
1 3 3 3 3 0 0.787 
2 8 30 8 8 0 0.293 
3 3 6 6 6 0 1.710 
4 4 4 4 12 0 0.203 
5 8 7 6 5 0 0.806 
6 10 20 5 5 0 4.713 
7 8 6 3 3 Ds) 0.607 
8 6 24 4 4 25 9.107 
9 4 10 12 4 25 9.210 
10 16 12 8 4 25 1.365 
11 3 10 8 8 2S) 4.554 
12 8 3 3 3 25 0.293 
13 3 6 3 3 50 DIB) 
14 3 8 8 3 50 9.167 
15 4 8 4 8 50 0.694 
16 5 2 2 2 50 0.379 
17 2 2 2 3 50 0.485 
18 10 15 3 3 50 3.345 
19 15 6 2 3 50 0.208 
20 15 6 2, 3 75 0.201 
21 10 4 3 3 WS 0.329 
22 3 8 2 2 75 4.966 
U3} 6 6 6 4 ie) 1.362 
24 2 3 8 6 75 1.515 
Ds 3 3 8 8 ve) 0.751 


Table 12-22 Thrust of a Jet-Turbine Engine 
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Observation 

Number y 55 Xo x3 X4 Xs X6 
1 4540 2140 20640 30250 205 1732 99 
2 4315 2016 20280 30010 195 1697 100 
3 4095 1905 19860 29780 184 1662 97 
4 3650 1675 18980 29330 164 1598 97 
5 3200 1474 18100 28960 144 1541 97 
6 4833 2239 20740 30083 216 1709 87 
Wl 4617 2120 20305 29831 206 1669 87 
8 4340 1990 19961 29604 196 1640 87 
9 3820 1702 18916 29088 171 1572 85 
10 3368 1487 18012 28675 149 1522 85 
11 4445 2107 20520 30120 195 1740 101 
12 4188 1973 20130 29920 190 1711 100 
13 3981 1864 19780 29720 180 1682 100 
14 3622 1674 19020 29370 161 1630 100 
15 3125 1440 18030 28940 139 1572 101 
16 4560 2165 20680 30160 208 1704 98 
17 4340 2048 20340 29960 199 1679 96 
18 4115 1916 19860 29710 187 1642 94 
19 3630 1658 18950 29250 164 1576 94 
20 3210 1489 18700 28890 145 1528 94 
21 4330 2062 20500 30190 193 1748 101 
22 4119 1929 20050 29960 183 1713 100 
28 3891 1815 19680 29770 173 1684 100 
24 3467 1595 18890 29360 153 1624 99 
25 3045 1400 17870 28960 134 1569 100 
26 4411 2047 20540 30160 193 1746 99 
27, 4203 1935 20160 29940 184 1714 99 
28 3968 1807 19750 29760 173 1679 99 
23) 3531 1591 18890 29350 153 1621 99 
30 3074 1388 17870 28910 133 1561 99 
31 4350 2071 20460 30180 198 1729 102 
32 4128 1944 20010 29940 186 1692 101 
33 3940 1831 19640 29750 178 1667 101 
34 3480 1612 18710 29360 156 1609 101 
35 3064 1410 17780 28900 136 1552 101 
36 4402 2066 20520 30170 197 1758 100 
37 4180 1954 20150 29950 188 1729 99 
38 3973 1835 19750 29740 178 1690 99 
39 3530 1616 18850 29320 156 1616 99 
40 3080 1407 17910 28910 137 1569 100 
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of the NMOS device, x; = width of the PMOS device, x, = 

length of the PMOS device, and x, = temperature (°C). 

(a) Fit a multiple linear regression model that uses all regres- 
sors to these data. Test for significance of regression using 
a = 0.01. Find the P-value for this test and use it to draw 
your conclusions. 

(b) Test the contribution of each variable to the model using 
the ¢-test with a = 0.05. What are your conclusions? 

(c) Delete x; from the model. Test the new model for signifi- 
cance of regression. Also test the relative contribution of 
each regressor to the new model with the t-test. Using 
a = 0.05, what are your conclusions? 

(d) Notice that the MS; for the model in part (c) is smaller 
than the MS; for the full model in part (a). Explain why 
this has occurred. 

(e) Calculate the studentized residuals. Do any of these seem 

unusually large? 

Suppose that you learn that the second observation was 

recorded incorrectly. Delete this observation and refit the 

model using x,, x5, x3, and x, as the regressors. Notice that 
the R? for this model is considerably higher than the R? for 

either of the models fitted previously. Explain why the R? 

for this model has increased. 

(g) Test the model from part (f) for significance of regression 
using a = 0.05. Also investigate the contribution of each 
regressor to the model using the ¢-test with a = 0.05. 
What conclusions can you draw? 

(h) Plot the residuals from the model in part (f ) versus y and 
versus each of the regressors x,, X», x3, and x4. Comment 
on the plots. 

12-99. Consider the inverter data in Exercise 12-98. Delete 

observation 2 from the original data. Define new variables as 

follows: y* = Iny, x4 = 1/Vx,, x8 = Vox, x¥ = 1/V x3, 
and xf = Vx4. 

(a) Fit a regression model using these transformed regressors 
(do not use xs). 

(b) Test the model for significance of regression using a = 0.05. 
Use the f-test to investigate the contribution of each vari- 
able to the model (a = 0.05). What are your conclusions? 

(c) Plot the residuals versus 3" and versus each of the trans- 
formed regressors. Comment on the plots. 


(f 


Y 


12-100. Following are data on y = green liquor (g/l) and 
x = paper machine speed (feet per minute) from a Kraft paper 
machine. (The data were read from a graph in an article in the 
Tappi Journal, March 1986.) 


y 16.0 15.8 15.6 15.5 14.8 
x 1700 1720 1730 1740 1750 
y 14.0 13.5 13.0 12.0 11.0 
x 1760 1770 1780 1790 1795 
(a) Fitthe model Y = By + Byx + Box? + Eusing least squares. 


(b) Test for significance of regression using a = 0.05. What 
are your conclusions? 

(c) Test the contribution of the quadratic term to the model, 
over the contribution of the linear term, using an F-statistic. 
If a = 0.05, what conclusion can you draw? 

(d) Plot the residuals from the model in part (a) versus 7. 
Does the plot reveal any inadequacies? 

(e) Construct a normal probability plot of the residuals. 
Comment on the normality assumption. 

12-101. Consider the jet engine thrust data in Exercise 

12-96 and 12-97. Define the response and regressors as in 

Exercise 12-97. 

(a) Use all possible regressions to select the best regression 
equation, where the model with the minimum value of 
MS; is to be selected as “best.” 

(b) Repeat part (a) using the Cp criterion to identify the best 
equation. 

(c) Use stepwise regression to select a subset regression model. 

(d) Compare the models obtained in parts (a), (b), and (c) above. 

(e) Consider the three-variable regression model. Calculate 
the variance inflation factors for this model. Would you 
conclude that multicollinearity is a problem in this model? 

12-102. Consider the electronic inverter data in Exercise 

12-98 and 12-99. Define the response and regressors variables 

as in Exercise 12-99, and delete the second observation in the 

sample. 

(a) Use all possible regressions to find the equation that min- 
imizes C,,. 

(b) Use all possible regressions to find the equation that 
minimizes MS;. 

(c) Use stepwise regression to select a subset regression model. 

(d) Compare the models you have obtained. 

12-103. A multiple regression model was used to relate y = 

viscosity of a chemical product to x, = temperature and x, = 

reaction time. The data set consisted of n = 15 observations. 

(a) The estimated regression coefficients were By = 300.00, 
8, = 0.85, and B®, = 10.40. Calculate an estimate of 
mean viscosity when x, = 100°F and x = 2 hours. 

(b) The sums of squares were SS; = 1230.50 and SS; = 
120.30. Test for significance of regression using a = 
0.05. What conclusion can you draw? 

(c) What proportion of total variability in viscosity is 
accounted for by the variables in this model? 

(d) Suppose that another regressor, x; = stirring rate, is added 
to the model. The new value of the error sum of squares is 
SS; = 117.20. Has adding the new variable resulted in a 
smaller value of MS,? Discuss the significance of this result. 

(e) Calculate an F-statistic to assess the contribution of x; to the 
model. Using a = 0.05, what conclusions do you reach? 

12-104. Tables 12-23 and 12-24 present statistics for the Major 

League Baseball 2005 season (source: The Sports Network). 

(a) Consider the batting data. Use model-building methods to 
predict Wins from the other variables. Check that the 
assumptions for your model are valid. 
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Table 12-23 Major League Baseball 2005 Season 
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American League Batting 


Team W AVG R H 2B 3B HR_- RBI BB SO SB  GIDP LOB’ OBP 

Chicago 99 0.262 741 1450 253 23 200 713 435 1002 = 137 122 1032 0.322 

Boston 95 0.281 910 1579 339 21 199 863 653 1044 45 135 1249 0.357 

LA Angels 95 0.27 761 1520 278 30 147 726 447 ~~ 848 161 125 1086 0.325 

New York 95 0.276 886 1552 259 16 229 847 637 989 84 125 1264 0.355 

Cleveland 93 0.271 790 1522 337 30 207 760 503 1093 62 128 1148 0.334 

Oakland 88 0.262 772 1476 310 20 155 739 537 819 31 148 1170 0.33 

Minnesota 83 0.259 688 1441 269 32 134 644 485 978 102 155 1109 = 0.323 

Toronto 80 0.265 775 1480 307 39 136 735 486 955 72 126 1118 0.331 

Texas 79 0.267 865 1528 311 29 260 834 495 1112 67 123 1104 = 0.329 

Baltimore 74 0.269 729 1492 296 27 189 700 447 902 83 145 1103. 0.327 

Detroit 71 0.272 723 1521 283 45 168 678 384 1038 #8 66 137 1077. = 0.321 

Seattle 69 0.256 699 1408 289 34 130 657 466 986 102 115 1076 «0.317 

Tampa Bay 67 0.274 750 1519 289 40 157 717 412 990 151 133 1065 0.329 

Kansas City 56 0.263 7Ol 1445 289 34 126 653 424 1008 53 139 1062 0.32 
National League Batting 

Team W AVG R H 2B 3B HR_- RBI BB SO SB  GIDP LOB’ OBP 

St. Louis 100 0.27 805 1494 287 26 170 757 534 947 83 127 1152 0.339 

Atlanta 90 0.265 769 1453 308 37 184 733 534 1084 92 146 1114 = 0.333 

Houston 89 360.256 «66693. 1400) «=—281 032s sd161)S 6540481) 1037~—Sss« 115 116 1136 =0.322 

Philadelphia 88 = 0.27 807 1494 282 35 167 760 639 1083 116 107 1251 0.348 

Florida 83 0.272 «6717+ =—1499 306 S32) s«128—S- 678 = 512s 9:18 96 144 1181 0.339 

New York 83. 0.258 «89722 «1421 «36279 «8632S 175s «683. 486— 1075S s«153 103 1122 0.322 

San Diego 82. 0.257 684 1416 269 39 130 655 600 £977 99 122 1220 0.333 

Milwaukee 81 0.259 726 1413 327 19 175 689 531 1162 79 137 1120 0.331 

Washington 81 0.252 639 1367 311 32 117 «615 491 1090 45 130 1137 0.322 

Chicago 79 0.27 703 1506 323 23 194 674 419 920 65 131 1133 0.324 

Arizona 77 ~=©0.256 696 1419 291 27 191 670 606 1094 67 132 1247 = 0.332 

San Francisco 75 0.261 649 1427 299 26 128 617 431 901 71 147 1093 0.319 

Cincinnati 73 0.261 820 1453 335 15 222 784 611 1303 72 116 1176 =0.339 

Los Angeles 71 60.253) 685 =1374 «©2840 «21S 149) 653) 5411094 58 139 1135 0.326 

Colorado 67 0.267 740 1477 280 34 150 704 509 1103 65 125 1197 = 0.333 

Pittsburgh 67 0.259 680 1445 292 38 139 656 471 1092 73 130 1193 0.322 

Batting LOB Left on base 

WwW Wins OBP On-base percentage 

AVG Batting average 

R Runs Pitching 

H Hits ERA Earned run average 

2B Doubles SV Saves 

3B Triples H Hits 

HR Home runs R Runs 

RBI Runs batted in ER Earned runs 

BB Walks HR Home runs 

SO Strikeouts BB Walks 

SB Stolen bases SO Strikeouts 

GIDP Grounded into double play AVG Opponent batting average 
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Table 12-24 Major League Baseball 2005 Season 


American League Pitching 


Team WwW ERA SV H R ER HR BB SO AVG 
Chicago 99 3.61 54 1392 645 502) 167 459 1040 0.249 
Boston 95 4.74 38 1550 805 752 164 440 959 0.276 
LA Angels oS 3.68 54 1419 643 598 158 443 1126 0.254 
New York 95 4.52 46 1495 789 718 164 463 985 0.269 
Cleveland 28) 3.61 51 1363 642 582 157 413 1050 0.247 
Oakland 88 3.69 38 1315 658 594 154 504 1075 0.241 
Minnesota 83 3.71 44 1458 662 604 169 348 965 0.261 
Toronto 80 4.06 35 1475 705 653 185 444 958 0.264 
Texas 79 4.96 46 1589 858 794 159 522 3) 0.279 
Baltimore 74 4.56 38 1458 800 724 180 580 1052 0263 
Detroit 71 4.51 37 1504 787 WS 193 461 907 0.272 
Seattle 69 4.49 39 1483 751 712 179 496 892 0.268 
Tampa Bay 67 awe) 43 1570 936 851 194 615 949 0.28 
Kansas City 56 5.49 25 1640 935 862 178 580 924 0.291 
National League Pitching 

Team WwW ERA SV H R ER HR BB SO AVG 
St. Louis 100 3.49 48 1399 634 560 153 443 974 0.257 
Atlanta 90 3.98 38 1487 674 639 145 520 929 0.268 
Houston 89 3.51 45 1336 609 563 155, 440 1164 0.246 
Philadelphia 88 4.21 40 1379 726 672 189 487 1159 0.253 
Florida 83 4.16 42 1459 732 666 116 563 1125 0.266 
New York 83 3.76 38 1390 648 599 135 491 1012 0.255 
San Diego 82 4.13 45 1452 726 668 146 503 1133 0.259 
Milwaukee 81 3.97 46 1382 697 635 169 569 1173 0.251 
Washington 81 3.87 51 1456 673 627 140 539 997 0.262 
Chicago 79 4.19 39 1357 714 671 186 576 1256 0.25 
Arizona Wl 4.84 45 1580 856 783 193 537 1038 0.278 
San Francisco ie) 4.33 46 1456 745 695 151 592 972 0.263 
Cincinnati We 5.15 31 1657 889 820 219 492 CBS) 0.29 
Los Angeles 71 4.38 40 1434 755 695 182 471 1004 0.263 
Colorado 67 Soll} 37 1600 862 808 175 604 981 0.287 
Pittsburgh 67 4.42 35 1456 769 706 162 612 958 0.267 

Batting LOB Left on base 

WwW Wins OBP On-base percentage 

AVG Batting average 

R Runs Pitching 

H Hits ERA Earned run average 

2B Doubles SV Saves 

3B Triples H Hits 

HR Home runs R Runs 

RBI Runs batted in ER Earned runs 

BB Walks HR Home runs 

SO Strikeouts BB Walks 

SB Stolen bases SO Strikeouts 

GID Grounded into double play AVG Opponent batting average 
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(b) Repeat part (a) for the pitching data. 

(c) Use both the batting and pitching data to build a model to 
predict Wins. What variables are most important? Check 
that the assumptions for your model are valid. 


12-105. Anarticle in the Journal of the American Ceramics 
Society (1992, Vol. 75, pp. 112-116) describes a process for 
immobilizing chemical or nuclear wastes in soil by dissolving 
the contaminated soil into a glass block. The authors mix CaO 
and Na,O with soil and model viscosity and electrical conduc- 
tivity. The electrical conductivity model involves six regres- 
sors, and the sample consists of n = 14 observations. 

(a) For the six-regressor model, suppose that SS; = 0.50 and 
R? = 0.94. Find SS; and SSp, and use this information to 
test for significance of regression with a = 0.05. What are 
your conclusions? 

(b) Suppose that one of the original regressors is deleted from 
the model, resulting in R? = 0.92. What can you conclude 
about the contribution of the variable that was removed? 
Answer this question by calculating an F-statistic. 

(c) Does deletion of the regressor variable in part (b) result in 
a smaller value of MS, for the five-variable model, in 
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comparison to the original six-variable model? Comment 

on the significance of your answer. 
12-106. Exercise 12-5 introduced the hospital patient satis- 
faction survey data. One of the variables in that data set is a 
categorical variable indicating whether the patient is a medical 
patient or a surgical patient. Fit a model including this indica- 
tor variable to the data, using all three of the other regressors. 
Is there any evidence that the service the patient is on (medical 
versus surgical) has an impact on the reported satisfaction? 


12-107. Consider the inverse model matrix shown below. 


0.125 0 0 0 
0 0.125 0 0 
(X'x)"1 = 
0 0 0.125 0 
0 0 0 0.125 


(a) How many regressors are in this model? 
(b) What was the sample size? 


(c) Notice the special diagonal structure of the matrix. What 
does that tell you about the columns in the original X matrix? 


MIND-EXPANDING EXERCISES 


12-108. Consider a multiple regression model with k 
regressors. Show that the test statistic for significance 
of regression can be written as 

R?/k 
(1 — R?)/(n — k — 1) 
Suppose that n = 20, k = 4, and R? = 0.90. Ifa = 0.05, 
what conclusion would you draw about the relationship 
between y and the four regressors? 


Fo= 


12-109. A regression model is used to relate a response 
y to k = 4 regressors with n = 20. What is the smallest 
value of R? that will result in a significant regression if 
a = 0.05? Use the results of the previous exercise. Are 
you surprised by how small the value of R? is? 


12-110. Show that we can express the residuals from 
a multiple regression model as e = (I — H)y, where 
H= X(X'X) ''X’. 

12-111. Show that the variance of the ith residual e; in 
a multiple regression model is o7(1 — /,;) and that the 
covariance between e; and e; is =o fi, where the h’s are 
the elements of H = X(X X)'X’. 

12-112. Consider the multiple linear regression model 
y = XB + e. If B denotes the least squares estimator of 
B, show that B = B + Re, where R = (X'X) !X’. 
12-113. Constrained Least Squares. Suppose we 
wish to find the least squares estimator of 6 in the model 


y = XP + € subject to a set of equality constraints, say, 
TB =c. 


(a) Show that the estimator is 


B. = B + (X'x)! 
x T'[T(X'Xy'T'] (ec — TB) 


where B = (X’X)'X’y. 

(b) Discuss situations where this model might be 
appropriate. 

12-114. Piecewise Linear Regression. Suppose that 

y is piecewise linearly related to x. That is, different lin- 

ear relationships are appropriate over the intervals 

0 <x Sx" andx* <x < ~, 

(a) Show how indicator variables can be used to fit such 
a piecewise linear regression model, assuming that 
the point x* is known. 

(b) Suppose that at the point x* a discontinuity occurs 

in the regression function. Show how indicator vari- 

ables can be used to incorporate the discontinuity 
into the model. 

Suppose that the point x* is not known with cer- 

tainty and must be estimated. Suggest an approach 

that could be used to fit the piecewise linear 
regression model. 


(c 


Nee 
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IMPORTANT TERMS AND CONCEPTS 


All possible regressions 

Analysis of variance test 
in multiple regression 

Categorical variables 

Confidence interval on 
the mean response 

Cp statistic 

Extra sum of squares 
method 

Hidden extrapolation 


Indicator variables 
Inference (test and 
intervals) on individ- 
ual model parameters 
Influential observations 
Model parameters and 
their interpretation 
in multiple 
regression 
Multicollinearity 


Multiple Regression 

Outliers 

Polynomial regression 
model 

Prediction interval on a 
future observation 

PRESS statistic 

Residual analysis and 
model adequacy 
checking 


Significance of 
regression 

Stepwise regression and 
related methods 

Variance Inflation 


Factor (VIF) 


Design and Analysis 
of Single-Factor Experiments: 
The Analysis of Variance 


Experiments are a natural part of the engineering and scientific decision-making process. 
Suppose, for example, that a civil engineer is investigating the effects of different curing 
methods on the mean compressive strength of concrete. The experiment would consist of 
making up several test specimens of concrete using each of the proposed curing methods 
and then testing the compressive strength of each specimen. The data from this experi- 
ment could be used to determine which curing method should be used to provide maxi- 
mum mean compressive strength. 

If there are only two curing methods of interest, this experiment could be designed 
and analyzed using the statistical hypothesis methods for two samples introduced in 
Chapter 10. That is, the experimenter has a single factor of interest—curing methods— 
and there are only two levels of the factor. If the experimenter is interested in determining 
which curing method produces the maximum compressive strength, the number of speci- 
mens to test can be determined from the operating characteristic curves in Appendix 
Chart VIL, and the t-test can be used to decide if the two means differ. 

Many single-factor experiments require that more than two levels of the factor be 
considered. For example, the civil engineer may want to investigate five different curing 
methods. In this chapter we show how the analysis of variance (frequently abbreviated 
ANOVA) can be used for comparing means when there are more than two levels of a single 
factor. We will also discuss randomization of the experimental runs and the important role 
this concept plays in the overall experimentation strategy. In the next chapter, we will 
show how to design and analyze experiments with several factors. 
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514 CHAPTER 13 DESIGN AND ANALYSIS OF SINGLE-FACTOR EXPERIMENTS: THE ANALYSIS OF VARIANCE 


CHAPTER OUTLINE 
13-1 DESIGNING ENGINEERING 13-3 THE RANDOM-EFFECTS MODEL 
EXPERIMENTS 


13-3.1 Fixed Versus Random Factors 


13-2. COMPLETELY RANDOMIZED 13-3.2 ANOVA and Variance 


SINGLE-FACTOR EXPERIMENT 


Components 
13-2.1 Example: Tensile Strength 13-4. RANDOMIZED COMPLETE BLOCK 
13-2.2 Analysis of Variance DESIGN 
13-2.3 Multiple Comparisons Following 13-4.1 Design and Statistical Analysis 
the ANOVA 13-4.2 Multiple Comparisons 
13-2.4 Residual Analysis and Model 13-4.3 Residual Analysis and Model 
Checking 


Checking 
13-2.5 Determining Sample Size 


LEARNING OBJECTIVES 


After careful study of this chapter you should be able to do the following: 


1. Design and conduct engineering experiments involving a single factor with an arbitrary number 
of levels 


. Understand how the analysis of variance is used to analyze the data from these experiments 
. Assess model adequacy with residual plots 

. Use multiple comparison procedures to identify specific differences between means 

. Make decisions about sample size in single-factor experiments 

. Understand the difference between fixed and random factors 

. Estimate variance components in an experiment involving random factors 


. Understand the blocking principle and how it is used to isolate the effect of nuisance factors 


CO mANAKA WN BW WN 


. Design and conduct experiments involving the randomized complete block design 


13-1 DESIGNING ENGINEERING EXPERIMENTS 


Statistically based experimental design techniques are particularly useful in the engineering 
world for solving many important problems: discovery of new basic phenomena that can lead to 
new products, and commercialization of new technology including new product development, 
new process development, and improvement of existing products and processes. For example, 
consider the development of a new process. Most processes can be described in terms of several 
controllable variables, such as temperature, pressure, and feed rate. By using designed experi- 
ments, engineers can determine which subset of the process variables has the greatest influence 
on process performance. The results of such an experiment can lead to 


¢ Improved process yield 


e Reduced variability in the process and closer conformance to nominal or target 
requirements 


e¢ Reduced design and development time 


e Reduced cost of operation 
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Experimental design methods are also useful in engineering design activities, where new 
products are developed and existing ones are improved. Some typical applications of statisti- 
cally designed experiments in engineering design include 


e Evaluation and comparison of basic design configurations 
¢ Evaluation of different materials 


e Selection of design parameters so that the product will work well under a wide variety 
of field conditions (or so that the design will be robust) 


e Determination of key product design parameters that affect product performance 


The use of experimental design in the engineering design process can result in products that 
are easier to manufacture, products that have better field performance and reliability than their 
competitors, and products that can be designed, developed, and produced in less time. 

Designed experiments are usually employed sequentially. That is, the first experiment 
with a complex system (perhaps a manufacturing process) that has many controllable variables 
is often a screening experiment designed to determine which variables are most important. 
Subsequent experiments are used to refine this information and determine which adjustments 
to these critical variables are required to improve the process. Finally, the objective of the ex- 
perimenter is optimization, that is, to determine which levels of the critical variables result in 
the best process performance. 

Every experiment involves a sequence of activities: 


1. Conjecture—the original hypothesis that motivates the experiment. 
2. Experiment—the test performed to investigate the conjecture. 

3. Analysis—the statistical analysis of the data from the experiment. 
4 


. Conclusion—what has been learned about the original conjecture from the experi- 
ment. Often the experiment will lead to a revised conjecture, and a new experiment, 
and so forth. 


The statistical methods introduced in this chapter and Chapter 14 are essential to good 
experimentation. All experiments are designed experiments; unfortunately, some of them are 
poorly designed, and as a result, valuable resources are used ineffectively. Statistically designed 
experiments permit efficiency and economy in the experimental process, and the use of statistical 
methods in examining the data results in scientific objectivity when drawing conclusions. 


13-2 COMPLETELY RANDOMIZED SINGLE-FACTOR 
EXPERIMENT 


13-2.1 Example: Tensile Strength 


A manufacturer of paper used for making grocery bags is interested in improving the tensile 
strength of the product. Product engineering thinks that tensile strength is a function of the hard- 
wood concentration in the pulp and that the range of hardwood concentrations of practical inter- 
est is between 5 and 20%. A team of engineers responsible for the study decides to investigate four 
levels of hardwood concentration: 5%, 10%, 15%, and 20%. They decide to make up six test spec- 
imens at each concentration level, using a pilot plant. All 24 specimens are tested on a laboratory 
tensile tester, in random order. The data from this experiment are shown in Table 13-1. 

This is an example of a completely randomized single-factor experiment with four levels 
of the factor. The levels of the factor are sometimes called treatments, and each treatment has 
six observations or replicates. The role of randomization in this experiment is extremely 
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Hardwood concentration (%) 


Table 13-1 Tensile Strength of Paper (psi) 


Hardwood Observations 
Concentration (%) 1 D; 3 4 5 6 Totals Averages 
5 7 8 15 11 9 10 60 10.00 
10 12 17 13 18 19 15 94 15.67 
15 14 18 19 17 16 18 102 17.00 
20 19 25 22 23 18 20 127 21.17 


383 15.96 


important. By randomizing the order of the 24 runs, the effect of any nuisance variable that 
may influence the observed tensile strength is approximately balanced out. For example, 
suppose that there is a warm-up effect on the tensile testing machine; that is, the longer the 
machine is on, the greater the observed tensile strength. If all 24 runs are made in order of 
increasing hardwood concentration (that is, all six 5% concentration specimens are tested 
first, followed by all six 10% concentration specimens, etc.), any observed differences in 
tensile strength could also be due to the warm-up effect. The role of randomization to iden- 
tify causality was discussed in Section 10-1. 

It is important to graphically analyze the data from a designed experiment. Figure 13-1(a) 
presents box plots of tensile strength at the four hardwood concentration levels. This figure 
indicates that changing the hardwood concentration has an effect on tensile strength; specifi- 
cally, higher hardwood concentrations produce higher observed tensile strength. Furthermore, 
the distribution of tensile strength at a particular hardwood level is reasonably symmetric, 
and the variability in tensile strength does not change dramatically as the hardwood concen- 
tration changes. 

Graphical interpretation of the data is always useful. Box plots show the variability of the 
observations within a treatment (factor level) and the variability between treatments. We now 
discuss how the data from a single-factor randomized experiment can be analyzed statistically. 


10 15 20 Het H+) ul +73 M+ 


(a) (b) 


(a) Box plots of hardwood concentration data. (b) Display of the model in Equation 13-1 for the completely 
randomized single-factor experiment. 
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13-2.2 Analysis of Variance 


Suppose we have a different levels of a single factor that we wish to compare. Sometimes, 
each factor level is called a treatment, a very general term that can be traced to the early 
applications of experimental design methodology in the agricultural sciences. The response 
for each of the a treatments is a random variable. The observed data would appear as shown 
in Table 13-2. An entry in Table 13-2, say y,, represents the jth observation taken under treat- 
ment i. We initially consider the case in which there are an equal number of observations, n, 
on each treatment. 
We may describe the observations in Table 13-2 by the linear statistical model 


- bd i= 1,2,...,4 (13-1) 
gO RE SH) 120m 


where Y;; is a random variable denoting the (ij)th observation, jis a parameter common to all 
treatments called the overall mean, 7; is a parameter associated with the ith treatment called 
the ith treatment effect, and €, is a random error component. Notice that the model could 
have been written as 


_ i=1,2,...,a 
tyavtey| 
where pw, = p + 7; 1s the mean of the ith treatment. In this form of the model, we see that each 
treatment defines a population that has mean ,, consisting of the overall mean w plus an effect 
7; that is due to that particular treatment. We will assume that the errors €,; are normally and 
independently distributed with mean zero and variance o*. Therefore, each treatment can be 
thought of as a normal population with mean 1; and variance o*. See Fig. 13-1(b). 

Equation 13-1 is the underlying model for a single-factor experiment. Furthermore, since 
we require that the observations are taken in random order and that the environment (often 
called the experimental units) in which the treatments are used is as uniform as possible, this 
experimental design is called a completely randomized design (CRD). 

The a factor levels in the experiment could have been chosen in two different ways. First, 
the experimenter could have specifically chosen the a treatments. In this situation, we wish to 
test hypotheses about the treatment means, and conclusions cannot be extended to similar treat- 
ments that were not considered. In addition, we may wish to estimate the treatment effects. This 
is called the fixed-effects model. Alternatively, the a treatments could be a random sample 
from a larger population of treatments. In this situation, we would like to be able to extend the 
conclusions (which are based on the sample of treatments) to all treatments in the population, 
whether or not they were explicitly considered in the experiment. Here the treatment effects 7; 
are random variables, and knowledge about the particular ones investigated is relatively 


Table 13-2 Typical Data for a Single-Factor Experiment 


Treatment Observations Totals Averages 
1 Ju y12 tee Yin Jie Ye 
2 y21 y22 tee Y2n y2- Yo. 
a Val Yar a Yan Ya. Yai 
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ANOVA Sum 
of Squares 
Identity: 
Single Factor 
Experiment 


unimportant. Instead, we test hypotheses about the variability of the 7; and try to estimate this 
variability. This is called the random effects, or components of variance, model. 

In this section we develop the analysis of variance for the fixed-effects model. The 
analysis of variance is not new to us; it was used previously in the presentation of regression 
analysis. However, in this section we show how it can be used to test for equality of treatment 
effects. In the fixed-effects model, the treatment effects 1; are usually defined as deviations 
from the overall mean 1, so that 


> 7 =0 (13-2) 
i=1 


Let y,. represent the total of the observations under the ith treatment and y;. represent the average 
of the observations under the ith treatment. Similarly, let y.. represent the grand total of all obser- 
vations and y.. represent the grand mean of all observations. Expressed mathematically, 


n 


yp. = D i y;. = y;-/n i= 1,2,...,a 


j=l 


a n 


y. = > > Vij y.. = y../N (13-3) 


i=l j=l 


where N = an is the total number of observations. Thus, the “dot” subscript notation implies 
summation over the subscript that it replaces. 

We are interested in testing the equality of the a treatment means 1), 2, ..., Wy Using 
Equation 13-2, we find that this is equivalent to testing the hypotheses 


Apt) =m =" =%]=0 
H,:7; #0 for at least one i (13-4) 


Thus, if the null hypothesis is true, each observation consists of the overall mean wp plus a 
realization of the random error component e¢,. This is equivalent to saying that all N 
observations are taken from a normal distribution with mean p and variance o”. Therefore, 
if the null hypothesis is true, changing the levels of the factor has no effect on the mean 
response. 

The ANOVA partitions the total variability in the sample data into two component parts. 
Then, the test of the hypothesis in Equation 13-4 is based on a comparison of two independ- 
ent estimates of the population variance. The total variability in the data is described by the 
total sum of squares 


a n 


SSsr= Yd (7 — y-¥ 


i=l j=l 


The partition of the total sum of squares is given in the following definition. 


The sum of squares identity is 


a n 


Ge = Tr =e > CES 2 2 CaS er ees) 


i=l j=l i=l i=l j=l 


or symbolically 


SS ae SS Treatments ap SSz (13-6) 


Expected 
Values of Sums 
of Squares: 
Single Factor 
Experiment 


ANOVA 
F-Test 
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The identity in Equation 13-5 shows that the total variability in the data, measured by the 
total corrected sum of squares SS;, can be partitioned into a sum of squares of differences 
between treatment means and the grand mean denoted SSyyeatments and a sum of squares of dif- 
ferences of observations within a treatment from the treatment mean denoted SS;. Differences 
between observed treatment means and the grand mean measure the differences between treat- 
ments, while differences of observations within a treatment from the treatment mean can be 
due only to random error. 

We can gain considerable insight into how the analysis of variance works by examining 
the expected values of SSyyeatments and SS. This will lead us to an appropriate statistic for test- 
ing the hypothesis of no differences among treatment means (or all 7; = 0). 


The expected value of the treatment sum of squares is 


a 
= 2 2 
E(SS treatments) pa (a oat 1)o +n > Tj 
i=l 
and the expected value of the error sum of squares is 


E(SS,) = a(n — 1)o 


There is also a partition of the number of degrees of freedom that corresponds to the sum 
of squares identity in Equation 13-5. That is, there are an = N observations; thus, SS; has 
an — | degrees of freedom. There are a levels of the factor, so SStyeatments has a — 1 degrees of 
freedom. Finally, within any treatment there are n replicates providing n — 1 degrees of free- 
dom with which to estimate the experimental error. Since there are a treatments, we have 
a(n — 1) degrees of freedom for error. Therefore, the degrees of freedom partition is 


an-—1=a-—1+a(n-1) 
The ratio 


MS Treatments = SS Treatments/(@ ~ 1) 


is called the mean square for treatments. Now if the null hypothesis Hp: t, = T = 
- = 7, = Ois true, MS treatments iS an unbiased estimator of o* because }*_,7; = 0. However, 
if H, is true, MSqyeatments €Stimates o7 plus a positive term that incorporates variation due to the 
systematic difference in treatment means. 
Note that the error mean square 


MS, = SS_/[a(n — 1)] 


is an unbiased estimator of o7 regardless of whether or not H, is true. We can also show that 
MS treatments 2d MS; are independent. Consequently, we can show that if the null hypothesis H 
is true, the ratio 


= SS treanents/\@ = 1) = MS Treatments 
° SSp/[a(n — 1)] MS; 


(13-7) 
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has an F-distribution with a — 1 and a(n — 1) degrees of freedom. Furthermore, from the 
expected mean squares, we know that MS; is an unbiased estimator of o*. Also, under the null 
hypothesis, MSyy,eatments is an unbiased estimator of o*. However, if the null hypothesis is 
false, the expected value of MS tyeatments i8 greater than o*. Therefore, under the alternative 
hypothesis, the expected value of the numerator of the test statistic (Equation 13-7) 
is greater than the expected value of the denominator. Consequently, we should reject Hy 
if the statistic is large. This implies an upper-tail, one-tail critical region. Therefore, 
we would reject Ho if fo > fe,a-1,ain—1) Where fy is the computed value of Fy from 
Equation 13-7. 

Efficient computational formulas for the sums of squares may be obtained by 
expanding and simplifying the definitions of SSt,eatments and SS7. This yields the following 
results. 


The sums of squares computing formulas for the ANOVA with equal sample sizes in 
each treatment are 
aon y 2 
SSi= > > -> (13-8) 
= N 
and 
a Se 2 
Yi- y.. 
SS Treatments = a n N (13-9) 
The error sum of squares is obtained by subtraction as 
SSe = SSp — SS treatments (13-10) 


The computations for this test procedure are usually summarized in tabular form as shown in 
Table 13-3. This is called an analysis of variance (or ANOVA) table. 


EXAMPLE 13-1 Tensile Strength ANOVA 
Consider the paper tensile strength experiment described in 
Section 13-2.1. This experiment is a CRD. We can use the 
analysis of variance to test the hypothesis that different hard- 
wood concentrations do not affect the mean tensile strength of 
the paper. 

The hypotheses are 


Ao: tT) = 1) = 73 = T4 = O 


1: 7; # 0 for at least one i 


We will use a = 0.01. The sums of squares for the analysis of 
variance are computed from Equations 13-8, 13-9, and 13-10 
as follows: 


4 6 yr 
SSp = » Hy 


i=l j=l 


Table 13-3. The Analysis of Variance for a Single-Factor Experiment, Fixed-Effects Model 


Source of Degrees of 
Variation Sum of Squares Freedom Mean Square Fo 

MS Treatments 
Treatments SStreatments @= Il MSreatments a 
Error SSi a(n — 1) MS; 
Total SS an — 1 
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Table 13-4. ANOVA for the Tensile Strength Data 


Source of Degrees of 
Variation Sum of Squares Freedom Mean Square to P-value 
Hardwood 
concentration 382.79 3 127.60 19.60 3.59 E-6 
Error 130.17 20 6.51 
Total 512.96 23 
4 ye. yr. the paper. We can also find a P-value for this test statistic as 
SS Treatments = > n N follows: 
_ (60) + (94)? + (102)? + (127)? (383)? P = P(Fy9 > 19.60) ~ 3.59 X 1078 
6 24 


SSe = SS a SS Treatments 
= 512.96 — 382.79 = 130.17 


The ANOVA is summarized in Table 13-4. Since fp.91,3,20 = 
4.94, we reject Hy and conclude that hardwood concentra- 
tion in the pulp significantly affects the mean strength of 


Minitab Output 


Since P = 3.59 X 10 °is considerably smaller than « = 0.01, 
we have strong evidence to conclude that H, is not true. 

Practical Interpretation: There is strong evidence to 
conclude that hardwood concentration has an effect on ten- 
sile strength. However, the ANOVA does not tell as which 
levels of hardwood concentration result in different tensile 
strength means. We will see how to answer this question 
below. 


Many software packages have the capability to analyze data from designed experiments using 
the analysis of variance. Table 13-5 presents the output from the Minitab one-way analysis of 
variance routine for the paper tensile strength experiment in Example 13-1. The results agree 
closely with the manual calculations reported previously in Table 13-4. 

The Minitab output also presents 95% confidence intervals on each individual treatment 
mean. The mean of the ith treatment is defined as 


Bi = B+ IT; 


A point estimator of 1; is fi; = Y;. Now, if we assume that the errors are normally distributed, 


each treatment average is normally distributed with mean 1; and variance o7/n. Thus, if 0” were 
known, we could use the normal distribution to construct a CI. Using MS; as an estimator of o 
(the square root of MS, is the “Pooled StDev” referred to in the Minitab output), we would base 


the CI on the f distribution, since 


T= Yi. = Bi 
7 VMS;/n 


has a ¢ distribution with a(n — 1) degrees of freedom. This leads to the following definition 


of the confidence interval. 


Confidence 
Interval ona 
Treatment 
Mean 


A 100(1 — qa) percent confidence interval on the mean of the ith treatment ,, is 


S MS¢ 2 
Jie ta/2,a(n-1) “n a Ca/2,a(n-1) “nN 


MS 
= (13-11) 
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Table 13-5 Minitab Analysis of Variance Output for Example 13-1 


One-Way ANOVA: Strength versus CONC 
Analysis of Variance for Strength 


Source DF SS MS F P 
Conc 3 382.79 127.60 19.61 0.000 
Error 20 130.17 6.51 
Total 23 512.96 Individual 95% Cls For Mean 
Based on Pooled StDev 
Level N Mean StDev - 4 t t -4 
2) 6 10.000 2.828 (—*—) 

10 6 15.667 2.805 (—*—) 

15 6 17.000 1.789 (—*—) 

20 6 21.167 2.639 (—*—) 

+ -+ + +- 

Pooled StDev = 2.551 10.0 15.0 20.0 25.0 


Fisher’s pairwise comparisons 


Family error rate = 0.192 
Individual error rate = 0.0500 


Critical value = 2.086 


Intervals for (column level mean) — (row level mean) 


5 10 15 
10 —8.739 
—2.594 
15 — 10.072 —4.406 
—3.928 1.739 
20 — 14.239 —8.572 —7.239 
—8.094 —2.428 —1.094 


Equation 13-11 is used to calculate the 95% CIs shown graphically in the Minitab output of 
Table 13-5. For example, at 20% hardwood the point estimate of the mean is yy. = 21.167, 
MS; = 6.51, and to 025,29 = 2.086, so the 95% CI is 

[Ya. © t0.025,20V MS;/n| 

[21.167 + (2.086) V6.51/6] 
or 

19.00 psi = py = 23.34 psi 


It can also be interesting to find confidence intervals on the difference in two treatment means, 
say, L; — pj; The point estimator of w; — y; is Y; — Y}., and the variance of this estimator is 


Confidence 
Interval on a 
Difference in 

Treatment 
Means 


Computing 
Formulas for 
ANOVA: Single 
Factor with 
Unequal 
Sample Sizes 
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Now if we use MS; to estimate 0”, 


= Fe = (uy =) 
V2MSr, n 


Y,. 


T= 


has a ¢ distribution with a(n — 1) degrees of freedom. Therefore, a CI on p; — p, may be 
based on the ¢ distribution. 


A 100(1 — a) percent confidence interval on the difference in two treatment means 
i — Bj 1S 
= = 2MS; = 7 2MS; 
Vir ~ Vir ty/2,a(n—-1) n S pj — Bj = yj. ~ Jj- ar Ly /2,a(n-1) n 
(13-12) 


A 95% Cl on the difference in means 4, — 2 1s computed from Equation 13-12 as follows: 


[V3 — Yo. = to.025,20 W2MS;/n] 
[17.00 — 15.67 + (2.086)V2(6.51)/6| 


or 


—1.74 S p3 — po = 4.40 


Since the CI includes zero, we would conclude that there is no difference in mean tensile 
strength at these two particular hardwood levels. 

The bottom portion of the computer output in Table 13-5 provides additional information con- 
cerning which specific means are different. We will discuss this in more detail in Section 13-2.3. 


An Unbalanced Experiment 

In some single-factor experiments, the number of observations taken under each treatment 
may be different. We then say that the design is unbalanced. In this situation, slight 
modifications must be made in the sums of squares formulas. Let n; observations be taken 
under treatment i (i = 1, 2,..., a), and let the total number of observations N = >¥_, n;. 
The computational formulas for SS; and SSy,eatments 2 aS Shown in the following definition. 


The sums of squares computing formulas for the ANOVA with unequal sample sizes 
n; 1n each treatment are 


a n; DB 
i y.. 
SSs= > DF - ms (13-13) 
= jl 
a 2 2 
Jie y.. 
zn ae 13-14 
SS Treatments = Nn; N ( 3 ) 
and 
SS a SS a SSrestrente (13-15) 
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Choosing a balanced design has two important advantages. First, the ANOVA is relatively 
insensitive to small departures from the assumption of equality of variances if the sample sizes 
are equal. This is not the case for unequal sample sizes. Second, the power of the test is max- 
imized if the samples are of equal size. 


13-2.3 Multiple Comparisons Following the ANOVA 


Least 
Significant 
Difference for 
Multiple 
Comparisons 


EXAMPLE 13-2 


When the null hypothesis Hp: Tt; = tT) = ++: = T, = O is rejected in the ANOVA, we know 
that some of the treatment or factor level means are different. However, the ANOVA 
doesn’t identify which means are different. Methods for investigating this issue are called 
multiple comparisons methods. Many of these procedures are available. Here we 
describe a very simple one, Fisher’s least significant difference (LSD) method and a 
graphical method. Montgomery (2009) presents these and other methods and provides a 
comparative discussion. 

The Fisher LSD method compares all pairs of means with the null hypotheses Ho: bw; = w; 
(for all i # j) using the f-statistic 


ae Yj. 


ty = ——== 
. 2MS; 
n 


Assuming a two-sided alternative hypothesis, the pair of means 1; and ; would be declared 
significantly different if 


where LSD, the least significant difference, is 


[2MS, : 
LSD = Ly/2,a(n-l1) iii ( 16) 


If the sample sizes are different in each treatment, the LSD is defined as 


1 1 
LSD —_ a/2,N—a MS¢ (3 + x) 


We will apply the Fisher LSD method to the hardwood con- ages that differs by more than 3.07 implies that the correspon- 
centration experiment. There are a = 4 means, n = 6, MS; = ding pair of treatment means are different. 
6.51, and t0.9959 = 2.086. The treatment means are The comparisons among the observed treatment averages 
are as follows: 

y,. = 10.00 psi 

Jo. = 15.67 psi 4vs.1 = 21.17 — 10.00 = 11.17 > 3.07 

¥3. = 17.00 psi 4vs.2 = 21.17 — 15.67 = 5.50 > 3.07 

¥4. = 21.17 psi 4vs.3 = 21.17 — 17.00 = 4.17 > 3.07 

3 vs. 1 = 17.00 — 10.00 = 7.00 > 3.07 

The value of LSD is LSD = to 95 .9V2MS;/n = 2.086 3 vs. 2 = 17.00 — 15.67 = 1.33 < 3.07 
V2(6.51)/6 = 3.07. Therefore, any pair of treatment aver- 2vs.1 = 15.67 — 10.00 = 5.67 > 3.07 
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5% 10% 15% 20% 
° ee ° 
0 5 10 15 20 25 psi 


Figure 13-2 Results of Fisher’s LSD method in Example 13-2. 


Conclusions: From this analysis, we see that there are sig- It is often helpful to draw a graph of the treatment means, such 
nificant differences between all pairs of means except 2 and 3. as in Fig. 13-2, with the means that are not different under- 
This implies that 10% and 15% hardwood concentration pro- lined. This graph clearly reveals the results of the experiment 
duce approximately the same tensile strength and that all other and shows that 20% hardwood produces the maximum tensile 
concentration levels tested produce different tensile strengths. strength. 


The Minitab output in Table 13-5 shows the Fisher LSD method under the heading 
“Fisher’s pairwise comparisons.” The critical value reported is actually the value of fo,925.99 = 
2.086. Minitab implements Fisher’s LSD method by computing confidence intervals on all 
pairs of treatment means using Equation 13-12. The lower and upper 95% confidence limits 
are shown at the bottom of the table. Notice that the only pair of means for which the 
confidence interval includes zero is for p49 and p45. This implies that jr4) and p45 are not 
significantly different, the same result found in Example 13-2. 

Table 13-5 also provides a “family error rate,” equal to 0.192 in this example. When all 
possible pairs of means are tested, the probability of at least one type I error can be much 
greater than for a single test. We can interpret the family error rate as follows. The probability 
is 1 — 0.192 = 0.808 that there are no type I errors in the six comparisons. The family error 
rate in Table 13-5 is based on the distribution of the range of the sample means. See 
Montgomery (2009) for details. Alternatively, Minitab permits you to specify a family error 
rate and will then calculate an individual error rate for each comparison. 


Graphical Comparison of Means 

It is easy to compare treatment means graphically, following the analysis of variance. Suppose 
that the factor has a levels and that y,., ¥>., ..., Y,. are the observed averages for these factor 
levels. Each treatment average has standard deviation o/ Vn, where o is the standard deviation 
of an individual observation. If all treatment means are equal, the observed means }j,. would 
behave as if they were a set of observations drawn at random from a normal distribution with 
mean yw and standard deviation o/ Vn. 

Visualize this normal distribution capable of being slid along an axis below which the 
treatment means y,., 2., ..., Y,. are plotted. If all treatment means are equal, there should be 
some position for this distribution that makes it obvious that the y,. values were drawn from the 
same distribution. If this is not the case, the y;. values that do not appear to have been drawn 
from this distribution are associated with treatments that produce different mean responses. 

The only flaw in this logic is that o is unknown. However, we can use V MS, from the 
analysis of variance to estimate o. This implies that a ¢ distribution should be used instead of 
the normal in making the plot, but since the t looks so much like the normal, sketching a nor- 
mal curve that is approximately 6 MS,/n units wide will usually work very well. 

Figure 13-3 shows this arrangement for the hardwood concentration experiment in 
Example 13-1. The standard deviation of this normal distribution is 


V MS;p/n = V6.51/6 = 1.04 


If we visualize sliding this distribution along the horizontal axis, we note that there is no location 
for the distribution that would suggest that all four observations (the plotted means) are typical, 
randomly selected values from that distribution. This, of course, should be expected, because the 
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o/Vn = 1.04 


1 2 3 4 
e ee 


0 5 10 15 20 25 30 


Figure 13-3 Tensile strength averages from the hardwood concentration 
experiment in relation to a normal distribution with standard deviation 


V/MS,/n = V6.51/6 = 1.04. 


analysis of variance has indicated that the means differ, and the display in Fig. 13-3 is just a graph- 
ical representation of the analysis of variance results. The figure does indicate that treatment 4 
(20% hardwood) produces paper with higher mean tensile strength than do the other treatments, 
and treatment 1 (5% hardwood) results in lower mean tensile strength than do the other treat- 
ments. The means of treatments 2 and 3 (10% and 15% hardwood, respectively) do not differ. 

This simple procedure is a rough but very effective multiple comparison technique. It 
works well in many situations. 


13-2.4 Residual Analysis and Model Checking 


The analysis of variance assumes that the observations are normally and independently dis- 
tributed with the same variance for each treatment or factor level. These assumptions should 
be checked by examining the residuals. A residual is the difference between an observation yj 
and its estimated (or fitted) value from the statistical model being studied, denoted as y,;. For 
the completely randomized design y,; = y;. and each residual is e; = yj — y;., that is, the dif- 
ference between an observation and the corresponding observed treatment mean. The residuals 
for the paper tensile strength experiment are shown in Table 13-6. Using y,. to calculate each 
residual essentially removes the effect of hardwood concentration from the data; consequently, 
the residuals contain information about unexplained variability. 

The normality assumption can be checked by constructing a normal probability plot of 
the residuals. To check the assumption of equal variances at each factor level, plot the resid- 
uals against the factor levels and compare the spread in the residuals. It is also useful to plot 
the residuals against y;. (sometimes called the fitted value); the variability in the residuals 
should not depend in any way on the value of y;. Most statistical software packages will 
construct these plots on request. When a pattern appears in these plots, it usually suggests the 
need for a transformation, that is, analyzing the data in a different metric. For example, if the 
variability in the residuals increases with y;., a transformation such as log y or Vy should 
be considered. In some problems, the dependency of residual scatter on the observed mean y,. 
is very important information. It may be desirable to select the factor level that results in max- 
imum response; however, this level may also cause more variation in response from run to run. 


Table 13-6 Residuals for the Tensile Strength Experiment 


Hardwood 
Concentration (%) Residuals 
5 —3.00 —2.00 5.00 1.00 —1.00 0.00 
10 —3.67 1.33 —2.67 2.33 3.33 —0.67 
15 —3.00 1.00 2.00 0.00 —1.00 1.00 


20 —2.17 3.83 0.83 1.83 —3.17 —1.17 
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The independence assumption can be checked by plotting the residuals against the time 
or run order in which the experiment was performed. A pattern in this plot, such as sequences 
of positive and negative residuals, may indicate that the observations are not independent. 
This suggests that time or run order is important or that variables that change over time are 
important and have not been included in the experimental design. 

A normal probability plot of the residuals from the paper tensile strength experiment is 
shown in Fig. 13-4. Figures 13-5 and 13-6 present the residuals plotted against the factor 
levels and the fitted value y,. respectively. These plots do not reveal any model inadequacy or 


unusual problem with the assumptions. 


13-2.5 Determining Sample Size 


In any experimental design problem, the choice of the sample size or number of replicates to 
use is important. Operating characteristic curves can be used to provide guidance in making 
this selection. Recall that an operating characteristic curve is a plot of the probability of a type II 
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Figure 13-4 Normal probability plot of residuals from 
the hardwood concentration experiment. 
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Figure 13-5 Plot of residuals versus factor levels 
(hardwood concentration). 
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Figure 13-6 Plot of residuals versus y,, 
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error (B) for various sample sizes against values of the parameters under test. The operating 
characteristic curves can be used to determine how many replicates are required to achieve 
adequate sensitivity. 

The power of the ANOVA test is 


1 — B = P{Reject Hy | Hp is false} 
= P{F > fo,a—1,a(n—1) | Ao is false} (13-17) 


To evaluate this probability statement, we need to know the distribution of the test statistic Fo if 
the null hypothesis is false. Because ANOVA compares several means, the null hypothesis can be 
false in different ways. For example, possibly +; > 0, 7, = 0, 73 < 0, and so forth. It can be shown 
that the power for ANOVA in Equation 13-17 depends on the 7,’s only through the function 


a 
no 
ao 7 

Therefore, alternative hypotheses for the 7;’s can be used to calculate ©? and this in turn can be 
used to calculate the power. Specifically, it can be shown that if Ho is false, the statistic 
Fo = MStreatments/ MS has a noncentral F distribution, with a — 1 and n(a — 1) degrees of 
freedom and a noncentrality parameter that depends on ©”. Instead of tables for the noncentral 
F distribution, operating characteristic curves are used to evaluate B defined in Equation 13-17. 
These curves plot B against ®. 

Curves are available for a = 0.05 and a = 0.01 and for several values of the number of 
degrees of freedom for numerator (denoted v,) and denominator (denoted vz). Figure 13-7 
gives representative O.C. curves, one for a = 4 (vy, = 3) and one for a = 5 (1, = 4) treat- 
ments. Notice that for each value of a there are curves for a = 0.05 and a = 0.01. 

In using the operating curves, we must define the difference in means that we wish to detect 
in terms of >/_,77. Also, the error variance o7 is usually unknown. In such cases, we 
must choose ratios of >/_, 17/07 that we wish to detect. Alternatively, if an estimate of e 
is available, one may replace o” with this estimate. For example, if we were interested in 
the sensitivity of an experiment that has already been performed, we might use MS; as the es- 
timate of 0”. 


EXAMPLE 13-3 

Suppose that five means are being compared in a completely 
randomized experiment with a = 0.01. The experimenter 
would like to know how many replicates to run if it is im- 
portant to reject Hy with probability at least 0.90 if 
Sy 7/0” = 5.0. The parameter ®? is, in this case, 


a 
n> 
i=1 


ao” 5 


cp2 


and for the operating characteristic curve with vy, = a — 1 = 
5 — 1 =4, and v, = a(n — 1) = 5(n — 1) error degrees of 
freedom refer to the lower curve in Figure 13-7. As a first 
guess, try n = 4 replicates. This yields ®* = 4, @ = 2, and 
v, = 5(3) = 15 error degrees of freedom. Consequently, from 


Figure 13-7, we find that B = 0.38. Therefore, the power of the 
test is approximately 1 — B = 1 — 0.38 = 0.62, which is 
less than the required 0.90, and so we conclude that n = 4 
replicates is not sufficient. Proceeding in a similar manner, we 
can construct the following table: 


n @& ® a(n — 1) B Power = (1 — B) 
4 4 2.00 15 0.38 0.62 
5 5 2.24 20 0.18 0.82 
6 6 2.45 25 0.06 0.94 


Conclusions: At least n = 6 replicates must be run in 
order to obtain a test with the required power. 
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Figure 13-7 Two Operating Characteristic curves for the fixed-effects model analysis of variance. 
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EXERCISES FOR SECTION 13-2 
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13-1. Consider the computer output below. 


One-way ANOVA: y versus Factor 


Source DF SS MS F P 
Factor 2? 117.4 39.1 ? ? 
Error 16 396.8 ? 

Total 19 514.2 


(a) How many levels of the factor were used in this ex- 
periment? 

(b) How many replicates did the experimenter use? 

(c) Fill in the missing information in the ANOVA table. Use 
bounds for the P-value. 

(d) What conclusions can you draw about differences in the 
factor level means? 


13-2. An article in Nature describes an experiment to in- 
vestigate the effect on consuming chocolate on cardiovascu- 
lar health (“Plasma Antioxidants from Chocolate,” Vol. 424, 
2003, pp. 1013). The experiment consisted of using three dif- 
ferent types of chocolates: 100 g of dark chocolate, 100 g of 
dark chocolate with 200 ml of full-fat milk, and 200 g of 
milk chocolate. Twelve subjects were used, seven women and 
five men, with an average age range of 32.2 + 1 years, an 


Cotton Observations 
Percentage 1 2 3 4 5 
15 Z 7 15 11 9 
20 12 17 12 18 18 
25 14 18 18 19 19 
30 19 25 22 19 23 
35 7 10 11 15 ll 


(a) Does cotton percentage affect breaking strength? Draw 
comparative box plots and perform an analysis of vari- 
ance. Use a = 0.05. 

(b) Plot average tensile strength against cotton percentage 
and interpret the results. 

(c) Analyze the residuals and comment on model adequacy. 


13-4. In “Orthogonal Design for Process Optimization and Its 
Application to Plasma Etching” (Solid State Technology, May 
1987), G. Z. Yin and D. W. Jillie describe an experiment to de- 
termine the effect of CF, flow rate on the uniformity of the etch 
on a silicon wafer used in integrated circuit manufacturing. 
Three flow rates are used in the experiment, and the resulting 
uniformity (in percent) for six replicates is shown below. 


average weight of 65.8 + 3.1 kg, and body-mass index of C,F, Flow Observations 
21.9 + 0.4kg m ~~. On different days, a subject consumed (SCCM) 1 2 B 4 5 6 
one of the chocolate-factor levels, and one hour later the 
total antioxidant capacity of their blood plasma was meas- = | 4.6 2.6 3.0 = 3.8 
ured in an assay. Data similar to those summarized in the 160 4.9 4.6 5.0 4.2 3.6 4.2 
article are shown below. 200 4.6 3.4 2.9 3.5 4.1 5.1 
Factor Subjects (Observations) 
1 2 3 4 5 6 7 8 9 10 11 12 
DC 118.8 1226 1156 1136 1195 115.9 115.8 115.1 1169 1154 115.6 107.9 
DC+MK 105.4 101.1 102.7 97.1 101.9 98.9 100.0 99.8 1026 1009 104.5 O35) 
MC 102.1 105.8 99.6 102.7 98.8 100.9 102.8 98.7 94.7 97.8 99.7 98.6 


(a) Construct comparative box plots and study the data. What 
visual impression do you have from examining these 
plots? 

(b) Analyze the experimental data using an ANOVA. If 
a = 0.05, what conclusions would you draw? What 
would you conclude if « = 0.01? 

(c) Is there evidence that the dark chocolate increases the 
mean antioxidant capacity of the subjects’ blood plasma? 

(d) Analyze the residuals from this experiment. 


13-3. In Design and Analysis of Experiments, 7th edition 
(John Wiley & Sons, 2009) D. C. Montgomery described an 
experiment in which the tensile strength of a synthetic fiber 
was of interest to the manufacturer. It is suspected that 
strength is related to the percentage of cotton in the fiber. Five 
levels of cotton percentage were used, and five replicates were 
run in random order, resulting in the data below. 


(a) Does CF, flow rate affect etch uniformity? Construct box 
plots to compare the factor levels and perform the analysis 
of variance. Use a = 0.05. 

(b) Do the residuals indicate any problems with the underly- 
ing assumptions? 

13-5. The compressive strength of concrete is being stud- 

ied, and four different mixing techniques are being investi- 

gated. The following data have been collected. 


Mixing 
Technique Compressive Strength (psi) 
1 3129 3000 2865 2890 
2 3200 3300 2975 3150 
3 2800 2900 2985 3050 
4 2600 2700 2600 2765 


13-2 COMPLETELY RANDOMIZED SINGLE-FACTOR EXPERIMENT 


(a) Test the hypothesis that mixing techniques affect the 
strength of the concrete. Use a = 0.05. 

(b) Find the P-value for the F-statistic computed in part (a). 

(c) Analyze the residuals from this experiment. 

13-6. The response time in milliseconds was determined for 

three different types of circuits in an electronic calculator. The 

results are recorded here. 


Circuit 
Type Response 
1 19 2) 20 18 D> 
2 20 21 33 27 40 
3 16 15 18 26 17 


(a) Using a = 0.01, test the hypothesis that the three circuit 
types have the same response time. 

(b) Analyze the residuals from this experiment. 

(c) Find a 95% confidence interval on the response time for 
circuit three. 


13-7. An electronics engineer is interested in the effect on 
tube conductivity of five different types of coating for cathode 
ray tubes in a telecommunications system display device. The 
following conductivity data are obtained. 


Coating 
Type Conductivity 
1 143 141 150 146 
2 152 149 137 143 
3 134 133 132 127 
4 129 127 132 129 
5 147 148 144 142 


(a) Is there any difference in conductivity due to coating 
type? Use a = 0.01. 

(b) Analyze the residuals from this experiment. 

(c) Construct a 95% interval estimate of the coating type 1 
mean. Construct a 99% interval estimate of the mean dif- 
ference between coating types 1 and 4. 


13-8. An article in Environment International (Vol. 18, 
No. 4, 1992) described an experiment in which the amount of 
radon released in showers was investigated. Radon-enriched 
water was used in the experiment, and six different orifice 
diameters were tested in shower heads. The data from the 
experiment are shown in the following table. 


Orifice 
Diameter Radon Released (%) 
0.37 80 83 83 85 
0.51 75 75 79 79 
0.71 74 1B 76 Wil 
1.02 67 72 74 74 
1.40 62 62 67 69 
1.99 60 61 64 66 
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(a) Does the size of the orifice affect the mean percentage of 
radon released? Use a = 0.05. 

(b) Find the P-value for the F-statistic in part (a). 

(c) Analyze the residuals from this experiment. 

(d) Find a 95% confidence interval on the mean percent of 
radon released when the orifice diameter is 1.40. 


13-9. An article in the ACI Materials Journal (Vol. 84, 
1987, pp. 213-216) described several experiments investi- 
gating the rodding of concrete to remove entrapped air. A 
3-inch X 6-inch cylinder was used, and the number of times 
this rod was used is the design variable. The resulting com- 
pressive strength of the concrete specimen is the response. 
The data are shown in the following table. 


Rodding 
Level Compressive Strength 
10 1530 1530 1440 
15 1610 1650 1500 
20 1560 1730 1530 
25 1500 1490 1510 


(a) Is there any difference in compressive strength due to the 
rodding level? 

(b) Find the P-value for the F-statistic in part (a). 

(c) Analyze the residuals from this experiment. What con- 
clusions can you draw about the underlying model 
assumptions? 

13-10. An article in the Materials Research Bulletin (Vol. 26, 
No. 11, 1991) investigated four different methods of prepar- 
ing the superconducting compound PbMo,S,. The authors 
contend that the presence of oxygen during the preparation 
process affects the material’s superconducting transition 
temperature T.. Preparation methods 1 and 2 use techniques 
that are designed to eliminate the presence of oxygen, while 
methods 3 and 4 allow oxygen to be present. Five observations 
on T, (in °K) were made for each method, and the results are 
as follows: 


Preparation 
Method Transition Temperature T.(°K) 
1 14.8 14.8 14.7 14.8 14.9 
2 14.6 15.0 14.9 14.8 14.7 
3 12.7 11.6 12.4 27 12.1 
4 14.2 14.4 14.4 12.2 11.7 


(a) Is there evidence to support the claim that the presence of 
oxygen during preparation affects the mean transition 
temperature? Use a = 0.05. 

(b) What is the P-value for the F-test in part (a)? 

(c) Analyze the residuals from this experiment. 

(d) Find a 95% confidence interval on mean 7. when method 
1 is used to prepare the material. 


SS 


i) 
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13-11. A paper in the Journal of the Association of Asphalt 
Paving Technologists (Vol. 59, 1990) describes an experiment 
to determine the effect of air voids on percentage retained 
strength of asphalt. For purposes of the experiment, air voids 
are controlled at three levels; low (2-4%), medium (4—6%), 
and high (6-8%). The data are shown in the following table. 


Air Voids Retained Strength (%) 

Low 106 90 103 90 79 88 92 95 
Medium 80 69 94 91 70 83 87 83 
High 78 80 62 69 76 85 69 85 


(a) Do the different levels of air voids significantly affect 
mean retained strength? Use a = 0.01. 
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(a) Is there a difference in the cross-linker level? Draw com- 
parative boxplots and perform an analysis of variance. Use 
a = 0.05. 

(b) Find the P-value of the test. Estimate the variability due to 
random error. 

(c) Plot average domain spacing against cross-linker level and 
interpret the results. 

(d) Analyze the residuals from this experiment and comment 
on model adequacy. 


13-13. In the book Analysis of Longitudinal Data, 2nd ed., 
(2002, Oxford University Press), by Diggle, Heagerty, Liang, 
and Zeger, the authors analyzed the effects of three diets on the 
protein content of cow’s milk. The data shown here were col- 
lected after one week and include 25 cows on the barley diet, 
and 27 cows each on the other two diets: 


Diet Protein Content of Cow’s Milk 

Barley 3.63 3.24 3.98 3.66 434 436 4.17 44 3.4 3.75 42 402 402 3.9 
Barley + lupins 3.38 3.8 417 459 407 432 3.56 3.67 4.15 3.51 42 412 3.52 4.08 
Lupins 3.69 4.2 3.31 3.13 3.73 432 3.04 384 398 418 42 4.1 3.25 3.34 
Diet (continued) 

Barley 3.81 3.62 3.66 444 423 3.82 3.53 447 3.93 3.27 3.3 

Barley +lupins 402 3.18 4.11 3.27 3.27 3.97 3.31 412 3.92 3.78 4 437 3.79 

Lupins 35) 413 3.21 3.9 3.5 41 2.69 4.3 4.06 3.88 4 3.67 4.27 


(b) Find the P-value for the F-statistic in part (a). 

(c) Analyze the residuals from this experiment. 

(d) Find a 95% confidence interval on mean retained strength 
where there is a high level of air voids. 

(e) Find a 95% confidence interval on the difference 
in mean retained strength at the low and high levels of 
air voids. 


13-12. An article in Quality Engineering (“Estimating 
Sources of Variation: A Case Study from Polyurethane 
Product Research” (1999-2000, Vol. 12, pp. 89-96)] studied 
the effects of additives on final polymer properties. In this case, 
polyurethane additives were referred to as cross-linkers. The 
average domain spacing was the measurement of the polymer 
property. The data are as follows: 


Cross-Linker 


Level Domain Spacing (nm) 
=i 8.2 8 2 79) Rll 8 
—0.75 8.3 8.4 8.3 8.2 8.3 8.1 
=0)5) 89 87 89 84 83 85 
0 8.5 87 87 87 88 88 
0.5 88 91 #90 87 89 85 
1 8.6 85 86 87 88 88 


(a) Does diet affect the protein content of cow’s milk? Draw 
comparative boxplots and perform an analysis of variance. 
Use a = 0.05. 

(b) Find the P-value of the test. Estimate the variability due to 
random error. 

(c) Plot average protein content against diets and interpret the 
results. 

(d) Analyze the residuals and comment on model adequacy. 

13-14. An article in Journal of Food Science (2001, Vol. 66, 

No. 3, pp. 472-477) studied potato spoilage based on different 

conditions of acidified oxine (AO), which is a mixture of chlo- 

rite and chlorine dioxide. The data are shown below: 


AO Solution 
(ppm) % Spoilage 
50 100 50 60 
100 60 30 30 
200 60 50 29 
400 25 30 15 


(a) Do the AO solutions differ in the spoilage percentage? Use 
a = 0.05. 

(b) Find the P-value of the test. Estimate the variability due to 
random error. 


= 


pe) 


i) 
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(c) Plot average spoilage against AO solution and interpret the 
results. Which AO solution would you recommend for use 
in practice? 

(d) Analyze the residuals from this experiment. 

13-15. An experiment was run to determine whether four 


specific firing temperatures affect the density of a certain type 
of brick. The experiment led to the following data. 


Temperature 

(°F) Density 

100 Dig PDN) Wile Pil — iL PALS) ILL} 
125 21.7 214 215 215 — — — 
150 21.9 21.8 218 216 215 — — 
175 21.9 21.7) 21.8 21.7) 216 218 — 


(a) Does the firing temperature affect the density of the 
bricks? Use a = 0.05. 

(b) Find the P-value for the F-statistic computed in part (a). 

(c) Analyze the residuals from the experiment. 


13-16. (a) Use Fisher’s LSD method with a = 0.05 to analyze 

the means of the three types of chocolate in Exercise 13-4. 

(b) Use the graphical method to compare means described in 
this section and compare your conclusions to those from 
Fisher’s LSD method. 


13-17. (a) Use Fisher’s LSD method with a = 0.05 to ana- 

lyze the means of the five different levels of cotton content in 

Exercise 13-3. 

(b) Use the graphical method to compare means described in 
this section and compare your conclusions to those from 
Fisher’s LSD method. 


13-18. (a) Use Fisher’s LSD method with a = 0.01 to ana- 

lyze the mean response times for the three circuits described in 

Exercise 13-6. 

(b) Use the graphical method to compare means described in 
this section and compare your conclusions to those from 
Fisher’s LSD method. 


13-19. (a) Use Fisher’s LSD method with a = 0.05 to 

analyze the mean compressive strength of the four mixing 

techniques in Exercise 13-5. 

(b) Use the graphical method to compare means described in 
this section and compare your conclusions to those from 
Fisher’s LSD method. 
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13-20. (a) Use Fisher’s LSD method with a = 0.05 to 
analyze the mean amounts of radon released in the experiment 
described in Exercise 13-8. 

(b) Use the graphical method to compare means described in 
this section and compare your conclusions to those from 
Fisher’s LSD method. 

13-21. (a) Use Fisher’s LSD method with a = 0.01 to analyze 

the five means for the coating types described in Exercise 13-7. 

(b) Use the graphical method to compare means described in 
this section and compare your conclusions to those from 
Fisher’s LSD method. 

13-22. (a) Apply Fisher’s LSD method with a = 0.05 to the 

superconducting material experiment described in Exercise 

13-10. Which preparation methods differ? 

(b) Use the graphical method to compare means described in 
this section and compare your conclusions to those from 
Fisher’s LSD method. 

13-23. (a) Apply Fisher’s LSD method to the air void exper- 

iment described in Exercise 13-11. Using a = 0.05, which 

treatment means are different? 

(b) Use the graphical method to compare means described in 
this section and compare your conclusions to those from 
Fisher’s LSD method. 

13-24. (a) Apply Fisher’s LSD method to the domain spac- 

ing data in Exercise 13-12. Which cross-linker levels differ? 

Use a = 0.05. 

(b) Use the graphical method to compare means described in 
this section and compare your conclusions to those from 
Fisher’s LSD method. 

13-25. (a) Apply Fisher’s LSD method to the data on protein 

content of milk in Exercise 13-13. Which diets differ? Use 

a = 0.01. 

(b) Use the graphical method to compare means described in 
this section and compare your conclusions to those from 
Fisher’s LSD method. 

13-26. Suppose that four normal populations have common 

variance o? = 25 and means 1; = 50, 2 = 60, 3 = 50, and 

144 = 60. How many observations should be taken on each 

population so that the probability of rejecting the hypothesis of 

equality of means is at least 0.90? Use a = 0.05. 

13-27. Suppose that five normal populations have common 

variance o* = 100 and means 1, = 175, p = 190, ps = 160, 

44 = 200, and 5 = 215. How many observations per popula- 

tion must be taken so that the probability of rejecting the 

hypothesis of equality of means is at least 0.95? Use a = 0.01. 


In many situations, the factor of interest has a large number of possible levels. The analyst is 
interested in drawing conclusions about the entire population of factor levels. If the experimenter 
randomly selects a of these levels from the population of factor levels, we say that the factor is a 


@ 
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random factor. Because the levels of the factor actually used in the experiment were chosen ran- 
domly, the conclusions reached will be valid for the entire population of factor levels. We will 
assume that the population of factor levels is either of infinite size or is large enough to be con- 
sidered infinite. Notice that this is a very different situation than we encountered in the fixed- 
effects case, where the conclusions apply only for the factor levels used in the experiment. 


13-3.2 ANOVA and Variance Components 


Expected Values of 
Mean Squares: 
Random Effects 


The linear statistical model is 


_ i=1,2,...,a 
Yr mtatey tip (13-18) 
where the treatment effects 1; and the errors €,; are independent random variables. Note that 
the model is identical in structure to the fixed-effects case, but the parameters have a different 
interpretation. If the variance of the treatment effects 7; is 02, by independence the variance of 
the response is 


VY,) = 02+ 0° (13-19) 


The variances o? and o” are called variance components, and the model, Equation 13-19, is 
called the components of variance model or the random-effects model. To test hypotheses 
in this model, we assume that the errors €,, are normally and independently distributed with 
mean 0 and variance o? and that the treatment effects t; are normally and independently dis- 
tributed with mean zero and variance o2.* 

For the random-effects model, testing the hypothesis that the individual treatment effects 
are zero is meaningless. It is more appropriate to test hypotheses about o2. Specifically, 


Ho: 02 = 0 
H,:02>0 


If o = 0, all treatments are identical; but if o > 0, there is variability between treatments. 
The ANOVA decomposition of total variability is still valid; that is, 


SS = SSTyeatments + SSz (13-20) 


However, the expected values of the mean squares for treatments and error are somewhat 
different than in the fixed-effects case. 


In the random-effects model for a single-factor, completely randomized experiment, 
the expected mean square for treatments is 


SS 
E(MS treatments) = 15 Gane 


a — Il 
= 0° + no (13-21) 
and the expected mean square for error is 
SS5 
E(MS;) = E | ———~ 
(us) = £| 
=e (13-22) 


*The assumption that the {1;} are independent random variables implies that the usual assumption of >7_, 7; = 0 
from the fixed-effects model does not apply to the random-effects model. 


Variance 
Components 
Estimates 
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From examining the expected mean squares, it is clear that both MS; and MS tyeatments 
estimate o? when Hy: o2 = 0 is true. Furthermore, MS, and MS-yeaments af€ independent. 


Consequently, the ratio 


pr, — MS treaments (13-23) 
9 MS; 


is an F random variable with a — 1 and a(n — 1) degrees of freedom when Hp is true. The null 
hypothesis would be rejected at the a-level of significance if the computed value of the test 
statistic fo > foa—1,a(n—1) 

The computational procedure and construction of the ANOVA table for the random- 
effects model are identical to the fixed-effects case. The conclusions, however, are quite dif- 
ferent because they apply to the entire population of treatments. 

Usually, we also want to estimate the variance components (a? and o2) in the model. The 
procedure that we will use to estimate o” and o? is called the analysis of variance method 
because it uses the information in the analysis of variance table. It does not require the nor- 
mality assumption on the observations. The procedure consists of equating the expected 
mean squares to their observed values in the ANOVA table and solving for the variance 
components. When equating observed and expected mean squares in the one-way classifica- 
tion random-effects model, we obtain 


MS treatments = o + no- and MS; = o 


Therefore, the estimators of the variance components are 


Go oS. (13-24) 
and 


sD MS treatments — MS 
Oo, = n 


(13-25) 


Sometimes the analysis of variance method produces a negative estimate of a variance 
component. Since variance components are by definition nonnegative, a negative estimate of 
a variance component is disturbing. One course of action is to accept the estimate and use it as 
evidence that the true value of the variance component is zero, assuming that sampling 
variation led to the negative estimate. While this approach has intuitive appeal, it will disturb 
the statistical properties of other estimates. Another alternative is to reestimate the negative 
variance component with a method that always yields nonnegative estimates. Still another pos- 
sibility is to consider the negative estimate as evidence that the assumed linear model is 
incorrect, requiring that a study of the model and its assumptions be made to find a more 
appropriate model. 


EXAMPLE 13-4 Textile Manufacturing 

In Design and Analysis of Experiments, 7th edition (John 
Wiley, 2009), D. C. Montgomery describes a single-factor 
experiment involving the random-effects model in which a 
textile manufacturing company weaves a fabric on a large 


number of looms. The company is interested in loom-to-loom 
variability in tensile strength. To investigate this variability, a 
manufacturing engineer selects four looms at random and 
makes four strength determinations on fabric samples chosen 
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Table 13-7 Strength Data for Example 13-4 
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Observations 
Loom 1 2 3 4. Total Average Table 13-8 Analysis of Variance for the Strength Data 
1 98 97 99 96 390 97.5 Sourceof Sumof  Degreesof Mean 
2 91 90 93 92 366 91.5 Variation Squares Freedom Square fo P-value 
3 0 GO G7 OS Fe3 95.8 Looms 89.19 3} 29713 15.68 1.88 E-4 
4 95 96 99 98 388 97.0 Error 22.75 12 1.90 
1527 95.45 Total 111.94 15 


at random from each loom. The data are shown in Table 13-7 
and the ANOVA is summarized in Table 13-8. 

From the analysis of variance, we conclude that the 
looms in the plant differ significantly in their ability to pro- 
duce fabric of uniform strength. The variance components are 
estimated by 6? = 1.90 and 
29.73 — 1.90 


er 


any 


G 


Therefore, the variance of strength in the manufacturing 
process is estimated by 


& = 6.96 + 1.90 = 8.86 


x 
a 
| 
on 

i) 


¥ 


Conclusions: Most of the variability in strength in the 
output product is attributable to differences between looms. 


This example illustrates an important application of the analysis of variance—the isola- 
tion of different sources of variability in a manufacturing process. Problems of excessive 
variability in critical functional parameters or properties frequently arise in quality- 
improvement programs. For example, in the previous fabric strength example, the process 
mean is estimated by y = 95.45 psi, and the process standard deviation is estimated by 


Oy = 


V Wy, ij) = V8.86 = 2.98 psi. If strength is approximately normally distributed, the 


distribution of strength in the outgoing product would look like the normal distribution 
shown in Fig. 13-8(a). If the lower specification limit (LSL) on strength is at 90 psi, a sub- 
stantial proportion of the process output is fallout—that is, scrap or defective material that 
must be sold as second quality, and so on. This fallout is directly related to the excess vari- 
ability resulting from differences between looms. Variability in loom performance could be 
caused by faulty setup, poor maintenance, inadequate supervision, poorly trained operators, 
and so forth. The engineer or manager responsible for quality improvement must identify 
and remove these sources of variability from the process. If this can be done, strength vari- 


ability will be greatly reduced, perhaps as low as Gy = 


V& = V1.90 = 1.38 psi, as 


shown in Fig. 13-8(b). In this improved process, reducing the variability in strength has 
greatly reduced the fallout, resulting in lower cost, higher quality, a more satisfied customer, and 
enhanced competitive position for the company. 


Figure 13-8 The ise 

distribution of fabric an 

strength. (a) Current 80 a5 50 a5 ior ue 
process, (b) improved LSL 


process. (a) 


110 psi 80 85 90 95 100 105 
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13-28. An article in the Journal of the Electrochemical 
Society (Vol. 139, No. 2, 1992, pp. 524-532) describes an ex- 
periment to investigate the low-pressure vapor deposition of 
polysilicon. The experiment was carried out in a large-capacity 
reactor at Sematech in Austin, Texas. The reactor has several 
wafer positions, and four of these positions are selected at ran- 
dom. The response variable is film thickness uniformity. Three 
replicates of the experiment were run, and the data are as 
follows: 


Wafer 


Position Uniformity 


1 2.76 5.67 4.49 
2 1.43 1.70 2.19 
3 2.34 1.97 1.47 
4 0.94 1.36 1.65 


(a) Is there a difference in the wafer positions? Use a = 0.05. 

(b) Estimate the variability due to wafer positions. 

(c) Estimate the random error component. 

(d) Analyze the residuals from this experiment and comment 
on model adequacy. 


13-29. A textile mill has a large number of looms. Each 
loom is supposed to provide the same output of cloth per 
minute. To investigate this assumption, five looms are chosen 
at random, and their output is measured at different times. The 
following data are obtained: 


Output (1b/min) 

1 4.0 4.1 4.2 4.0 41 
2 3.9 3.8 3.9 4.0 4.0 
3 4.1 4.2 41 4.0 3.9 
4 3.6 3.8 4.0 3.9 3.7 
5 3.8 3.6 3.9 3.8 4.0 


(a) Are the looms similar in output? Use a = 0.05. 

(b) Estimate the variability between looms. 

(c) Estimate the experimental error variance. 

(d) Analyze the residuals from this experiment and check for 
model adequacy. 


13-30. In the book Bayesian Inference in Statistical 
Analysis (1973, John Wiley and Sons) by Box and Tiao, the 
total product yield was determined for five samples randomly 
selected from each of six randomly chosen batches of raw 
material. 


Yield (in grams) 

1 1545 1440 1440 1520 1580 
2 1540 1555 1490 1560 1495 
3 1595 1550 1605 1510 1560 
4 1445 1440 1595 1465 1545 
S 

6 


1595 1630 1515 1635 1625 
1520 1455 1450 1480 1445 


(a) Do the different batches of raw material significantly 
affect mean yield? Use a = 0.01. 

(b) Estimate the variability between batches. 

(c) Estimate the variability between samples within batches. 

(d) Analyze the residuals from this experiment and check for 
model adequacy. 


13-31. An article in the Journal of Quality Technology 
(Vol. 13, No. 2, 1981, pp. 111-114) described an experiment 
that investigated the effects of four bleaching chemicals on 
pulp brightness. These four chemicals were selected at ran- 
dom from a large population of potential bleaching agents. 
The data are as follows: 


Chemical Pulp Brightness 
1 77.199 74.466 92.746 76.208 82.876 
2 80.522 79.306 81.914 80.346 73.385 
3 79.417 78.017 91.596 80.802 80.626 
4 78.001 78.358 77.544 77.364 77.386 


(a) Is there a difference in the chemical types? Use a = 0.05. 

(b) Estimate the variability due to chemical types. 

(c) Estimate the variability due to random error. 

(d) Analyze the residuals from this experiment and comment 
on model adequacy. 


13-32. Consider the vapor-deposition experiment described 

in Exercise 13-28. 

(a) Estimate the total variability in the uniformity response. 
(b) How much of the total variability in the uniformity response 
is due to the difference between positions in the reactor? 

(c) To what level could the variability in the uniformity re- 
sponse be reduced if the position-to-position variability 
in the reactor could be eliminated? Do you believe this is 
a significant reduction? 

13-33. Reconsider Exercise 13-13 in which the effect of dif- 

ferent diets on the protein content of cow’s milk was investigated. 

Suppose that the three diets reported were selected at random 

from a large number of diets. To simplify, delete the last two ob- 

servations in the diets with n = 27 (to make equal sample sizes). 

(a) How does this change the interpretation of the experiment? 

(b) What is an appropriate statistical model for this experiment? 

(c) Estimate the parameters of this model. 
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13-4 RANDOMIZED COMPLETE BLOCK DESIGN 


13-4.1 Design and Statistical Analysis 


In many experimental design problems, it is necessary to design the experiment so that the 
variability arising from a nuisance factor can be controlled. For example, consider the sit- 
uation of Example 10-9, where two different methods were used to predict the shear 
strength of steel plate girders. Because each girder has different strength (potentially), and 
this variability in strength was not of direct interest, we designed the experiment by using 
the two test methods on each girder and then comparing the average difference in strength 
readings on each girder to zero using the paired f-test. The paired t-test is a procedure for 
comparing two treatment means when all experimental runs cannot be made under 
homogeneous conditions. Alternatively, we can view the paired f-test as a method for re- 
ducing the background noise in the experiment by blocking out a nuisance factor effect. 
The block is the nuisance factor, and in this case, the nuisance factor is the actual experi- 
mental unit—the steel girder specimens used in the experiment. 

The randomized block design is an extension of the paired f-test to situations where 
the factor of interest has more than two levels; that is, more than two treatments must be 
compared. For example, suppose that three methods could be used to evaluate the strength 
readings on steel plate girders. We may think of these as three treatments, say f,, t,, and f,. 
If we use four girders as the experimental units, a randomized complete block design (RCBD) 
would appear as shown in Fig. 13-9. The design is called a randomized complete block 
design because each block is large enough to hold all the treatments and because the actual 
assignment of each of the three treatments within each block is done randomly. Once the 
experiment has been conducted, the data are recorded in a table, such as is shown in 
Table 13-9. The observations in this table, say, y,, represent the response obtained when 
method 7 is used on girder j. 

The general procedure for a randomized complete block design consists of selecting b 
blocks and running a complete replicate of the experiment in each block. The data that re- 
sult from running a RCBD for investigating a single factor with a levels and b blocks are 
shown in Table 13-10. There will be a observations (one per factor level) in each block, and 
the order in which these observations are run is randomly assigned within the block. 

We will now describe the statistical analysis for the RCBD. Suppose that a single factor 
with a levels is of interest and that the experiment is run in b blocks. The observations may be 
represented by the linear statistical model 


7 i=1,2,...,a 
Y,=wta+ pte GAD con ab (13-26) 


where y is an overall mean, 7; is the effect of the ith treatment, 8; is the effect of the jth block, 
and €;;is the random error term, which is assumed to be normally and independently distributed 


Block 1 Block 2 Block 3 Block 4 Table 13-9 A Randomized Complete Block Design 
i z ty t ; 
; Treatments Block (Girder) 
to to to to (Method) ‘ ; : Fi 
t3 t3 t3 t3 1 Mili Yi2 J13 V4 
2 y y y y 
Figure 13-9 A randomized complete 3 a a ae 1 


block design. 


ANOVA Sums of 
Squares Identity: 
Randomized Block 
Experiment 
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Table 13-10 A Randomized Complete Block Design with a Treatments and b Blocks 


Blocks 
Treatments 1 2, ae b Totals Averages 
1 Yu V2 ov Vip M- ae 
2 Yai Y22 tee Yap y2- Y2- 
a Yai Ya2 ras ab Va. Ya 
Totals yey yen Bee y.p y.. 
Averages yey y.o dite Yep Yiz 


with mean zero and variance o”. Furthermore, the treatment and block effects are defined as de- 
viations from the overall mean, so >‘, 7; = 0 and ee 8; = 0. This was the same type of 
definition used for completely randomized experiments in Section 13-2. We also assume that 
treatments and blocks do not interact. That is, the effect of treatment i is the same regardless of 
which block (or blocks) it is tested in. We are interested in testing the equality of the treatment 
effects. That is, 


Ho: Ty] 72 las Tq 0 


H,: tT; # 0 at least one i 


The analysis of variance can be extended to the RCBD. The procedure uses a sum of 
squares identity that partitions the total sum of squares into three components. 


The sum of squares identity for the randomized complete block design is 


b 
2 Si 7 Se ah eS (y=) 


= i=l j=l 


b 
i = wee (13-27) 


a 
> 


or symbolically 


SS 7 SS Treatments ar SSplocks ar SS 


Furthermore, the degrees of freedom corresponding to these sums of squares are 
ab—-—1=(a-—1)+(b-1)+(a- 1)\(b-1) 


For the randomized block design, the relevant mean squares are 


ea: 
an 
MSplocks = b oT 
SS 

MS; E 
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The expected values of these mean squares can be shown to be as follows: 


Expected Mean 
Squares: a 
Randomized Block b » 7 
Experiment E(MS treatments) = o oP aL 


Therefore, if the null hypothesis A) is true so that all treatment effects Tt; = 0, MStyeatments 18 an 
unbiased estimator of o, while if Ho is false, MStyeatmems OVerestimates o7. The mean square 
for error is always an unbiased estimate of o7. To test the null hypothesis that the treatment 
effects are all zero, we use the ratio 


MS Treatments 
0 = MS, (13-28) 
which has an F-distribution with a — 1 and (a — 1)(b — 1) degrees of freedom if the null 
hypothesis is true. We would reject the null hypothesis at the a-level of significance if the 
computed value of the test statistic in Equation 13-28 is fo > foa—1ya—1(b-1)« 

In practice, we compute SS7, SSqreatments ANd SSpiocxs and then obtain the error sum of 
squares SS, by subtraction. The appropriate computing formulas are as follows. 


Computing 
Formulas for The computing formulas for the sums of squares in the analysis of variance for a ran- 
ANOVA: domized complete block design are 
Randomized 
Block @ tt 5 ye 
Experiment SS; = > i on (13-29) 
i=l j=l 
eee ye 
SStreatments = B a y}. a hh (13-30) 
ieee ye 
SSpincks = = 2 13-31 
Block: a > wy ‘i ab ( ) 
and 
SS i SS a SS'Treatments = SSBocks (13-32) 


The computations are usually arranged in an ANOVA table, such as is shown in Table 13-11. 
Generally, a computer software package will be used to perform the analysis of variance for 
the randomized complete block design. 
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Table 13-11 ANOVA for a Randomized Complete Block Design 


Source of Degrees of 
Variation Sum of Squares Freedom Mean Square Fy 
Treatments SSiyentnents a-1 SStreatments MSrreatments 
a-1l MS; 
SS 
Blocks eee b-1 a , 
: SS 
Error SSz (by subtraction) (a — 1)(b — 1) Geen) 
Total SSp ab —1 
EXAMPLE 13-5 Fabric Strength 
An experiment was performed to determine the effect of four 5 yg y2 
different chemicals on the strength of a fabric. These chemi- SSplocks = a = prs 
cals are used as part of the permanent press finishing process. Jl 
Five fabric samples were selected, and a RCBD was run by (9.2)? + (10.1)? + (3.5)? + (8.8)? + (7.6)? 
testing each chemical type once in random order on each fab- = 4 
ric sample. The data are shown in Table 13-12. We will test for d 
differences in means using an ANOVA with a = 0.01. oe (39.2) = 6.69 
The sums of squares for the analysis of variance are 20 
computed as follows: SSz = SSr — SSpiocks — SStreatments 
a y2 = 25.69 — 6.69 — 18.04 = 0.96 
tld The ANOVA is summarized in Table 13-13. Since fy = 75.13 > 
; i , (39.2) foo13.12 = 5.95 (the P-value is 4.79 X 107%), we conclude that 
= (1.3) + (1.6) +++ + (3.4) 20 25.69 there is a significant difference in the chemical types so far as 
A 232 2 their effect on strength is concerned. 
SS. = yi Ye 
Treatments a b ab 
(5.7)? + (8.8) + (6.9 + (17.8) 
5 
39,2) 
7 as 8.04 


Table 13-12 Fabric Strength Data—Randomized Complete Block Design 


Treatment Treatment 
Fabric Sample Totals Averages 

Chemical Type 1 2 3 4 3 Yj. yj. 

1 13} 1.6 0.5 1.2 1.1 Sil 1.14 

2 22 2.4 0.4 2.0 1.8 8.8 1.76 

3 1.8 1.7 0.6 1.5 11,3) 6.9 1.38 

4 3.9 4.4 2.0 4.1 3.4 17.8 3.56 
Block totals y.; 9.2 10.1 3\5) 8.8 7.6 39.2(y..) 
Block averages y.; 2.30 DESS 0.88 2.20 1.90 1.96(¥..) 
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Table 13-13 Analysis of Variance for the Randomized Complete Block Experiment 


Source of Degrees of 

Variation Sum of Squares Freedom Mean Square to P-value 
Chemical types 

(treatments) 18.04 3} 6.01 75.13 4.79 E-8 
Fabric samples 

(blocks) 6.69 4 1.67 

Error 0.96 12 0.08 

Total 25.69 19 


When Is Blocking Necessary? 

Suppose an experiment is conducted as a randomized block design, and blocking was not 
really necessary. There are ab observations and (a — 1)(b — 1) degrees of freedom for 
error. If the experiment had been run as a completely randomized single-factor design 
with b replicates, we would have had a(b — 1) degrees of freedom for error. Therefore, 
blocking has cost a(b — 1) — (a — 1)(b — 1) = b — 1 degrees of freedom for error. 
Thus, since the loss in error degrees of freedom is usually small, if there is a reasonable 
chance that block effects may be important, the experimenter should use the randomized 
block design. 

For example, consider the experiment described in Example 13-5 as a single-factor ex- 
periment with no blocking. We would then have 16 degrees of freedom for error. In the ran- 
domized block design, there are 12 degrees of freedom for error. Therefore, blocking has cost 
only 4 degrees of freedom, which is a very small loss considering the possible gain in infor- 
mation that would be achieved if block effects are really important. The block effect in 
Example 13-5 is large, and if we had not blocked, SSp),¢,, would have been included in the er- 
ror sum of squares for the completely randomized analysis. This would have resulted in a 
much larger MS;, making it more difficult to detect treatment differences. As a general rule, 
when in doubt as to the importance of block effects, the experimenter should block and gam- 
ble that the block effect does exist. If the experimenter is wrong, the slight loss in the degrees 
of freedom for error will have a negligible effect, unless the number of degrees of freedom is 
very small. 


Computer Solution 

Table 13-14 presents the computer output from Minitab for the randomized complete block 
design in Example 13-5. We used the analysis of variance menu for balanced designs to solve 
this problem. The results agree closely with the hand calculations from Table 13-13. 

Notice that Minitab computes an F-statistic for the blocks (the fabric samples). The 
validity of this ratio as a test statistic for the null hypothesis of no block effects is doubtful 
because the blocks represent a restriction on randomization; that is, we have only randomized 
within the blocks. If the blocks are not chosen at random, or if they are not run in random 
order, the F-ratio for blocks may not provide reliable information about block effects. For 
more discussion see Montgomery (2009, Chapter 4). 


13-4.2 Multiple Comparisons 


When the ANOVA indicates that a difference exists between the treatment means, we may 
need to perform some follow-up tests to isolate the specific differences. Any multiple com- 
parison method, such as Fisher’s LSD method, could be used for this purpose. 
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Table 13-14 Minitab Analysis of Variance for the Randomized Complete 
Block Design in Example 13-5 


Analysis of Variance (Balanced Designs) 


Factor Type Levels Values 
Chemical fixed 4 1 2 3 4 
Fabric S fixed | 1 2 3 4 5 


Analysis of Variance for strength 


Source DF SS MS F P 
Chemical 3 18.0440 6.0147 75.89 0.000 
Fabric S 4 6.6930 1.6733 21.11 0.000 
Error 12 0.9510 0.0792 

Total 19 25.6880 


F-test with denominator: Error 
Denominator MS = 0.079250 with 12 degrees of freedom 


Numerator DF MS F P 
Chemical 3 6.015 75.89 0.000 
Fabric S 4 1.673 21.11 0.000 


We will illustrate Fisher’s LSD method. The four chemical type averages from 
Example 13-5 are: 


Each treatment average uses b = 5 observations (one from each block). We will use a = 
0.05, so fo.925,12 = 2.179. Therefore the value of the LSD is 


2MS; 2(0.08) 
LSD = t0.025,12 i a = 2.179 5 = 0.39 


Any pair of treatment averages that differ by 0.39 or more indicates that this pair of treatment 
means is significantly different. The comparisons are shown below: 


Avs. 1 = Jy. — yy. = 3.56 — 1.14 = 2.42 > 0.39 
Avs. 3 = yy. — yz. = 3.56 — 1.38 = 2.18 > 0.39 
4vs.2 = Jy. — yo. = 3.56 — 1.76 = 1.80 > 0.39 
2vs.1 =>. — 3). = 1.76 — 1.14 = 0.62 > 0.39 
2 vs.3 = ¥. — yy. = 1.76 — 1.38 = 0.38 < 0.39 
1.38 — 1.14 = 0.24 < 0.39 


3 vs. 1 = y3. — yy. 


Figure 13-10 presents the results graphically. The underlined pairs of means are not different. 
The LSD procedure indicates that chemical type 4 results in significantly different strengths 
than the other three types do. Chemical types 2 and 3 do not differ, and types 1 and 3 do not 
differ. There may be a small difference in strength between types 1 and 2. 
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LSD method. — 


13-4.3 Residual Analysis and Model Checking 


In any designed experiment, it is always important to examine the residuals and to check for 
violation of basic assumptions that could invalidate the results. As usual, the residuals for the 
RCBD are just the difference between the observed and estimated (or fitted) values from the 
statistical model, say, 


ej = Yi — Diy (13-33) 
and the fitted values are 
Vg = Vi + VG — Y- 


The fitted value represents the estimate of the mean response when the ith treatment is run in 

the jth block. The residuals from the chemical type experiment are shown in Table 13-15. 
Figures 13-11, 13-12, 13-13, and 13-14 present the important residual plots for the 

experiment. These residual plots are usually constructed by computer software packages. 


Table 13-15 Residuals from the Randomized Complete Block Design 


Chenical Fabric Sample 
Type 1 2 3 4 3 
1 —0.18 —0.10 0.44 —0.18 0.02 
2 0.10 0.08 —0.28 0.00 0.10 
3 0.08 —0.24 0.30 0,12 —0.02 
4 0.00 0.28 —0.48 0.30 —0.10 
2 e 
8 
feet e 
a 8 
8 - 0.5 
c ad ; 3 
iS) e 
a ‘ i 
-1 we ety 0 : ele | if 
e el 2 63 t 
ele | 
20 |= @ e ° 
-0.50 -0.25 0 0.25 0.50 i 
Residual value -0.5 


Figure 13-11 Normal probability plot of 
residuals from the randomized complete 
block design. 


Figure 13-12 Residuals by treatment. 
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Figure 13-13 Residuals by block. 
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Figure 13-14 Residuals versus ,. 


There is some indication that fabric sample (block) 3 has greater variability in strength when 
treated with the four chemicals than the other samples. Chemical type 4, which provides the 
greatest strength, also has somewhat more variability in strength. Followup experiments may 
be necessary to confirm these findings, if they are potentially important. 


EXERCISES FOR SECTION 13-4 


13-34. Consider the computer output below. 


Randomized Block ANOVA: y versus Factor, Block 


Source DF SS MS F P 
Factor ? 193.800 64.600 ? ? 
Block 3 464.218 154.739 
Error ? ? 4.464 
Total 15 698.190 
(a) How many levels of the factor were used in this 
experiment? 
(b) How many blocks were used in this experiment? 
(c) Fill in the missing information. Use bounds for the 
P-value. 
(d) What conclusions would you draw if a = 0.05? What 


would you conclude if @ = 0.01? 


13-35. Exercise 13-2 introduced you to an experiment to in- 
vestigate the potential effect of consuming chocolate on cardio- 
vascular health. The experiment was conducted as a completely 
randomized design, and the exercise asked you to use the 
ANOVA to analyze the data and draw conclusions. Now assume 
that the experiment had been conducted as a RCBD, with the 
subjects considered as blocks. Analyze the data using this as- 
sumption. What conclusions would you draw (using a = 0.05) 
about the effect of the different types of chocolate on cardiovas- 
cular health? Would your conclusions change if a = 0.01? 


13-36. An article in Quality Engineering [Designed 
Experiment to Stabilize Blood Glucose Levels” (1999-2000, 
Vol. 12, pp. 83-87)] described an experiment to minimize 


variations in blood glucose levels. The treatment was the exer- 
cise time on a Nordic Track cross-country skier (10 or 20 min). 
The experiment was blocked for time of day. The data are as 
follows: 


Exercise (min) TimeofDay Average Blood Glucose 
10 pm 71.5 
10 am 103 
20 am 83.5 
20 pm 126 
10 am 125.5 
10 pm 129.5 
20 pm 95 
20 am 93 


(a) Is there an effect of exercise time on the average blood 
glucose? Use a = 0.05. 

(b) Find the P-value for the test in part (a). 

(c) Analyze the residuals from this experiment. 


13-37. In “The Effect of Nozzle Design on the Stability 
and Performance of Turbulent Water Jets” (Fire Safety 
Journal, Vol. 4, August 1981), C. Theobald described an 
experiment in which a shape measurement was determined 
for several different nozzle types at different levels of jet 
efflux velocity. Interest in this experiment focuses primarily 
on nozzle type, and velocity is a nuisance factor. The data are 
as follows: 
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Nozzle Jet Efflux Velocity (m/s) 
Type 11.73 14.37 16.59 20.43 23.46 28.74 
1 0.78 O80 081 0.75 0.77 0.78 
2 0.85 085 0.92 0.86 0.81 0.83 
3 0.93 0.92 0.95 0.89 0.89 0.83 
4 114 0.97 0.98 0.88 0.86 0.83 
5 0.97 0.86 0.78 0.76 0.76 0.75 


(a) Does nozzle type affect shape measurement? Compare the 
nozzles with box plots and the analysis of variance. 

(b) Use Fisher’s LSD method to determine specific differ- 
ences between the nozzles. Does a graph of the average 
(or standard deviation) of the shape measurements versus 
nozzle type assist with the conclusions? 

(c) Analyze the residuals from this experiment. 


13-38. In Design and Analysis of Experiments, 7th edition 
(John Wiley & Sons, 2009), D. C. Montgomery described an 
experiment that determined the effect of four different types 
of tips in a hardness tester on the observed hardness of a 
metal alloy. Four specimens of the alloy were obtained, and 
each tip was tested once on each specimen, producing the 
following data: 


Type of Specimen 
Tip 1 2 3 4 
1 9.3 9.4 9.6 10.0 
2 9.4 9.3 9.8 9.9 
3 Q)2 9.4 Oe) 9.7 
4 9.7 9.6 10.0 10.2 


(a) Is there any difference in hardness measurements between 
the tips? 

(b) Use Fisher’s LSD method to investigate specific differ- 
ences between the tips. 

(c) Analyze the residuals from this experiment. 


13-39. An article in the American Industrial Hygiene 
Association Journal (Vol. 37, 1976, pp. 418-422) described 
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a field test for detecting the presence of arsenic in urine samples. 
The test has been proposed for use among forestry workers 
because of the increasing use of organic arsenics in that 
industry. The experiment compared the test as performed by 
both a trainee and an experienced trainer to an analysis at a 
remote laboratory. Four subjects were selected for testing 
and are considered as blocks. The response variable is ar- 
senic content (in ppm) in the subject’s urine. The data are as 
follows: 


Subject 
Test 1 D 3 4 
Trainee 0.05 0.05 0.04 0.15 
Trainer 0.05 0.05 0.04 0.17 
Lab 0.04 0.04 0.03 0.10 


(a) Is there any difference in the arsenic test procedure? 
(b) Analyze the residuals from this experiment. 


13-40. An article in the Food Technology Journal (Vol. 10, 

1956, pp. 39-42) described a study on the protopectin content 

of tomatoes during storage. Four storage times were selected, 

and samples from nine lots of tomatoes were analyzed. The 
protopectin content (expressed as hydrochloric acid soluble 

fraction mg/kg) is in Table 13-16. 

(a) The researchers in this study hypothesized that mean pro- 
topectin content would be different at different storage 
times. Can you confirm this hypothesis with a statistical 
test using a = 0.05? 

(b) Find the P-value for the test in part (a). 

(c) Which specific storage times are different? Would you 
agree with the statement that protopectin content de- 
creases as storage time increases? 

(d) Analyze the residuals from this experiment. 


13-41. An experiment was conducted to investigate leak- 
ing current in a SOS MOSFETS device. The purpose of the 
experiment was to investigate how leakage current varies 
as the channel length changes. Four channel lengths were 
selected. For each channel length, five different widths were 


Table 13-16 Protopectin Content of Tomatoes in Storage 


Storage Lot 
Time 1 2 3 4 3 6 7 8 9 
0 days 1694.0 989.0 917.3 346.1 1260.0 965.6 1123.0 1106.0 1116.0 
7 days 1802.0 1074.0 278.8 1375.0 544.0 672.2 818.0 406.8 461.6 
14 days 1568.0 646.2 1820.0 1150.0 983.7 3I85),3) 422.3 420.0 409.5 
21 days 415.5 845.4 377.6 279.4 447.8 272.1 394.1 356.4 351.2 


also used, and width is to be considered a nuisance factor. 
The data are as follows: 


Channel Width 

Length 1 Y) 3 4 5 
1 0.7 0.8 0.8 0.9 1.0 
2 0.8 0.8 0.9 0.9 1.0 
3 0.9 1.0 lea 2.0 4.0 
4 1.0 1.5 2.0 3.0 20.0 


(a) Test the hypothesis that mean leakage voltage does not 
depend on the channel length, using « = 0.05. 

(b) Analyze the residuals from this experiment. Comment on 
the residual plots. 

(c) The observed leakage voltage for channel length 4 and 
width 5 was erroneously recorded. The correct observa- 
tion is 4.0. Analyze the corrected data from this experi- 
ment. Is there evidence to conclude that mean leakage 
voltage increases with channel length? 


Supplemental Exercises 


13-42. Consider the computer output below. 


One-way ANOVA: y versus Factor 


Source DF SS MS F P 
Factor ? ? ? ? ? 
Error 15 167.5 ? 
Total 19 326.2 

S = 3.342 R-Sq =? R-Sq(adj) = 34.96% 


(a) How many levels of the factor were used in this experiment? 

(b) How many replicates were used? 

(c) Fill in the missing information. Use bounds for the P-value. 

(d) What conclusions would you draw if a = 0.05? What if 
a = 0.01? 


13-43. Consider the computer output below. 


Randomized Block ANOVA: y versus Factor, Block 


Source DF SS MS FP 
Factor ? 126.880 63.4401 ? ? 
Block ? 54.825 18.2751 

Error 6 ? 2.7403 

Total 11 198.147 


(a) How many levels of the factor were used in this experiment? 
(b) How many blocks were used? 

(c) Fill in the missing information. Use bounds for the P-value. 
(d) What conclusions would you draw if a = 0.05? What if 

a = 0.01? 

13-44. An article in Lubrication Engineering (December 
1990) described the results of an experiment designed to 
investigate the effects of carbon material properties on the pro- 
gression of blisters on carbon face seals. The carbon face seals 
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are used extensively in equipment such as air turbine starters. 
Five different carbon materials were tested, and the surface 
roughness was measured. The data are as follows: 


Carbon 
Material 
Type Surface Roughness 
EC10 0.50 0.55 0.55 0.36 
ECI10A 0.31 0.07 0.25 0.18 0.56 0.20 
EC4 0.20 0.28 0.12 
ECI1 0.10 0.16 


(a) Does carbon material type have an effect on mean surface 
roughness? Use a = 0.05. 

(b) Find the residuals for this experiment. Does a normal 
probability plot of the residuals indicate any problem with 
the normality assumption? 

(c) Plot the residuals versus $;. Comment on the plot. 

(d) Find a 95% confidence interval on the difference between 
the mean surface roughness between the EC10 and the 
EC1 carbon grades. 

(e) Apply the Fisher LSD method to this experiment. 
Summarize your conclusions regarding the effect of mate- 
rial type on surface roughness. 

13-45. Anarticle in the EEE Transactions on Components, 

Hybrids, and Manufacturing Technology (Vol. 15, No. 2, 

1992, pp. 146-153) described an experiment in which the con- 

tact resistance of a brake-only relay was studied for three dif- 

ferent materials (all were silver-based alloys). The data are as 
follows. 


Alloy Contact Resistance 
1 95 97 99 98 99 
99 99 94 95 98 
2 104 102 102 105 99 
102 111 103 100 103 
3 119 130 132 136 141 
172 145 150 144 135 


(a) Does the type of alloy affect mean contact resistance? Use 
a = 0.01. 

(b) Use Fisher’s LSD method to determine which means differ. 

(c) Find a 99% confidence interval on the mean contact 
resistance for alloy 3. 

(d) Analyze the residuals for this experiment. 


13-46. An article in the Journal of Quality Technology 
(Vol. 14, No. 2, 1982, pp. 80-89) described an experiment 
in which three different methods of preparing fish are eval- 
uated on the basis of sensory criteria and a quality score is 


(_) 


i) 


s 
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assigned. Assume that these methods have been randomly 
selected from a large population of preparation methods. 
The data are in the following table: 


Method Score 
1 24.4 23.2 25.0 IGF 
Do) 24.4 23.8 18.0 
2 22.1 19.5 17.3 19.7 
22.3 23.2 21.4 22.6 
3 2373 22.8 22.4 Bol 
20.4 DEES 20.8 24.1 


(a) Is there any difference in preparation methods? Use 
a = 0.05. 

(b) Calculate the P-value for the F-statistic in part (a). 

(c) Analyze the residuals from this experiment and comment 
on model adequacy. 

(d) Estimate the components of variance. 


13-47. An article in the Journal of Agricultural 
Engineering Research (Vol. 52, 1992, pp. 53-76) described 
an experiment to investigate the effect of drying temperature 
of wheat grain on the baking quality of bread. Three temper- 
ature levels were used, and the response variable measured 
was the volume of the loaf of bread produced. The data are 
as follows: 


Temperature (°C) Volume (CC) 
70.0 1245 1235 1285 1245 1235 
75.0 1235 1240 1200 1220 1210 
80.0 1225 1200 1170 1155 1095 


(a) Does drying temperature affect mean bread volume? Use 
a = 0.01. 

(b) Find the P-value for this test. 

(c) Use the Fisher LSD method to determine which means are 
different. 


Table 13-17 Software Development Costs 
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(d) Analyze the residuals from this experiment and comment 
on model adequacy. 


13-48. An article in Agricultural Engineering (December 
1964, pp. 672-673) described an experiment in which 
the daily weight gain of swine is evaluated at different levels of 
housing temperature. The mean weight of each group of 
swine at the start of the experiment is considered to be a nui- 
sance factor. The data from this experiment are as follows: 


Housing Air Temperatures 


Mean 
Weight (degrees F) 
(Ibs) 50 60 70 80 90 100 
100 1.37 1.58 2.00 1.97 1.40 0.39 
150 1.47 1.75 2.16 1.82 1.14 —0.19 
200 1.19 1.91 Pee, 1.67 0.88 —0.77 


(a) Does housing air temperature affect mean weight gain? 
Use a = 0.05. 

(b) Use Fisher’s LSD method to determine which tempera- 
ture levels are different. 

(c) Analyze the residuals from this experiment and comment 
on model adequacy. 


13-49. An article in Communications of the ACM (Vol. 30, 

No. 5, 1987) studied different algorithms for estimating 

software development costs. Six algorithms were applied to 

eight software development projects and the percent error in 

estimating the development cost was observed. The data are in 

Table 13-17. 

(a) Do the algorithms differ in their mean cost estimation 
accuracy? Use a = 0.05. 

(b) Analyze the residuals from this experiment. 

(c) Which algorithm would you recommend for use in 
practice? 

13-50. An article in Nature Genetics (2003, Vol. 34(1), pp. 

85-90) “Treatment-specific changes in gene expression dis- 


criminate in vivo drug response in human leukemia cells” 
studied gene expression as a function of different treatments 


Project 

Algorithm 1 2 3 4 5 6 7 8 

1(SLIM) 1244 21 82 220M 905 839 527 122 
2(COCOMO-A) 281 129 396 1306 336 910 473 199 
3(COCOMO-R) 220 84 458 543 300 794 488 142 
4(COCOMO-C) 225 83 425 552 291 826 509 153 
S(FUNCTION POINTS) 19 11 —34 121 15 103 87 leh 
6(ESTIMALS) —20 35 —53 170 104 199 142 41 


s 


s 


s 


Table 13-18 Treatment-Specific Changes in Gene Expression 
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Treatments Observations 

MP ONLY 334.5 31.6 701 41.2 61.2 69.6 67.5 66.6 120.7 881.9 

MP+HDMTX 919.4 404.2 1024.8 54.1 62.8 671.6 882.1 354.2 321.9 91.1 

MP+LDMTX 108.4 26.1 240.8 191.1 69.7 242.8 62.7 396.9 23.6 290.4 
for leukemia. Three treatment groups are: mercaptopurine 13-51. Consider an ANOVA situation with a = 5 treat- 


(MP) only; low-dose methotrexate (LDMTX) and MP; and 
high-dose methotrexate (HDMTX) and MP. Each group con- 
tained ten subjects. The responses from a specific gene are 
shown in Table 13-18. 

(a) Check the normality of the data. Can we assume these 
samples are from normal populations? 

(b) Take the logarithm of the raw data and check the normal- 
ity of the transformed data. Is there evidence to support 
the claim that the treatment means differ for the trans- 
formed data? Use a = 0.1. 

(c) Analyze the residuals from the transformed data and com- 
ment on model adequacy. 


ments. Let o* = 9 and a = 0.05, and suppose that n = 4. 

(a) Find the power of the ANOVA F-test when p, = po = 
3 = 1, py = 3, and ps = 2. 

(b) What sample size is required if we want the power of the 
F-test in this situation to be at least 0.90? 


13-52. Consider an ANOVA situation with a = 4 means 

Mw, = 1, bo = 5, by = 8, and wy = 4. Suppose that o? = 4, 

n = 4, anda = 0.05. 

(a) Find the power of the ANOVA F-test. 

(b) How large would the sample size have to be if we want the 
power of the F-test for detecting this difference in means 
to be at least 0.90? 


MIND-EXPANDING EXERCISES 


13-53. Show that in the fixed-effects model analysis 
of variance E(MS,) = o*. How would your development 
change if the random-effects model had been specified? 


13-54. Consider testing the equality of the means of 
two normal populations where the variances are unknown 
but are assumed equal. The appropriate test procedure is 
the two-sample t-test. Show that the two-sample test is 
equivalent to the single-factor analysis of variance F-test. 
13-55. Consider the ANOVA with a = 2 treatments. 
Show that the MS, in this analysis is equal to the pooled 
variance estimate used in the two-sample f-test. 


13-56. Show that the variance of the linear combination 
a da 
> CiY;. is 07 > njC?. 
i=1 i=1 
13-57. Ina fixed-effects model, suppose that there are 


n observations for each of four treatments. Let Q7, Q3, 


and Q3 be single-degree-of-freedom sums of squares for 
orthogonal contrasts. A contrast is a linear combination of 
the treatment means with coefficients that sum to zero. 
The coefficient vectors of orthogonal contrasts are or- 
thogonal vectors. Prove that SStyeamens = Qt + O3 + Q3. 
13-58. Consider the single-factor completely ran- 
domized design with a treatments and n replicates. 
Show that if the difference between any two treatment 


means is as large as D, the minimum value that the OC 
curve parameter © can take is 


2 
2 nD 
2ac0 


13-59. Consider the single-factor completely ran- 
domized design. Show that a 100(1 — «@) percent confi- 
dence interval for o? is 


2 
Xa/2,N-a 


ee) = (N = a)MS_ 


2 
X1-a/2,N-a 


where JN is the total number of observations in the 
experimental design. 


13-60. Consider the random-effect model for the 
single-factor completely randomized design. Show that 
a 100(1 — a)% confidence interval on the ratio of vari- 
ance components o2/o” is given by 
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MIND-EXPANDING EXERCISES 


where 


he 1 | Sieaee x ( 1 ) 1 
i MS; VopaatN=a 


and 
= 1 | Area x ( 1 ) 1 
le MS; Jiao Livia 
13-61. Consider a random-effects model for the 


single-factor completely randomized design. (a) Show 
that a 100(1 — a)% confidence interval on the ratio 
o2/(o7 + 02) is 


D 
Le U 


= = 
1+L o+o2 14+U 


where L and U are as defined in Exercise 13-60. 
(b) Use the results of part (a) to finda 10011 — a)% confi- 
dence interval for o?/(a7 + 02). 

13-62. Consider the fixed-effect model of the com- 
pletely randomized single-factor design. The model 
parameters are restricted by the constraint }7_, 7; = 0. 
(Actually, other restrictions could be used, but this one is 
simple and results in intuitively pleasing estimates for 


IMPORTANT TERMS AND CONCEPTS 


the model parameters.) For the case of unequal sample 
Size nj, No, ..., Ng, the restriction is S‘_, nj; = 0, Use 
this to show that 


E(MStyeatments) a o ar 


Does this suggest that the null hypothesis in this model 
is Ho: 147, = MoT) = +** = Nyt, = 0? 

13-63. Sample Size Determination. In the single- 
factor completely randomized design, the accuracy of a 
100(1 — a)% confidence interval on the difference in 
any two treatment means is fy2,a(n— 1) V 2MS_/n. 

(a) Show that if A is the desired accuracy of the interval, 

the sample size required is 


2F o/2,1,a(n = )MS_ 

L = ——— 

A 

(b) Suppose that in comparing a = 5 means we have a 
preliminary estimate of o7 of 4. If we want the 95% 
confidence interval on the difference in means to 
have an accuracy of 2, how many replicates should 
we use? 
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Design of Experiments 
with Several Factors 


Carotenoids are fat-soluble pigments that occur naturally in fruits in vegetables that are rec- 
ommended for healthy diets. A well-known carotenoid is beta-carotene. Astaxanthin is 
another carotenoid that is a strong antioxidant and commercially produced. An exercise later 
in this chapter describes an experiment in Biotechnology Progress to promote astaxanthin 
production. Seven variables were considered important to production: photon flux density, 
and concentrations of nitrogen, phosphorous, magnesium, acetate, ferrous, NaCl. It was 
important to study the effects of these factors, but also the effects of combinations on the pro- 
duction. Even with only a high and low setting for each variable, an experiment that uses all 
possible combinations requires 2’ = 128 tests. There are a number of disadvantages of such 
a large experiment and a question is whether a fraction of the full set of tests can be selected 
to provide the most important information about the effects of these variables in many fewer 
runs. The example used a surprisingly small set of 16 runs (16/128 = 1/8 fraction). The 
design and analysis of experiments of this type is the focus of this chapter. Such experiments 
are widely-used throughout modern engineering development and scientific studies. 


CHAPTER OUTLINE 

14-1 INTRODUCTION 14-4 GENERAL FACTORIAL 

14-2. FACTORIAL EXPERIMENTS idee a 

14-3. TWO-FACTOR FACTORIAL 14-5 2' FACTORIAL DESIGNS 
EXPERIMENTS 14-5.1 2? Design 
14-3.1 Statistical Analysis of the 14-5.2 2" Design for k = 3 Factors 

Fixed-Effects Model 14-5.3 Single Replicate of the 2" Design 

14-3.2 Model Adequacy Checking 14-5.4 Addition of Center Points to a 
14-3.3 One Observation per Cell 2" Design 
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14-6 BLOCKING AND CONFOUNDING 14-7.2 Smaller Fractions: The 2'~? 
IN THE 2' DESIGN Fractional Factorial 
14-7 FRACTIONAL REPLICATION OF 14-8 RESPONSE SURFACE METHODS 
THE 2" DESIGN AND DESIGNS 
14-7.1 One-Half Fraction of the 2* 
Design 
LEARNING OBJECTIVES 


After careful study of this chapter you should be able to do the following: 


1. Design and conduct engineering experiments involving several factors using the factorial 
design approach 


. Know how to analyze and interpret main effects and interactions 

. Understand how the ANOVA is used to analyze the data from these experiments 
. Assess model adequacy with residual plots 

. Know how to use the two-level series of factorial designs 

. Understand how two-level factorial designs can be run in blocks 

. Design and conduct two-level fractional factorial designs 


. Test for curvature in two-level factorial designs by using center points 


OMAN A NH BW WY 


. Use response surface methodology for process optimization experiments 


14-1 INTRODUCTION 


An experiment is just a test or series of tests. Experiments are performed in all engineering and 
scientific disciplines and are an important part of the way we learn about how systems and 
processes work. The validity of the conclusions that are drawn from an experiment depends to a 
large extent on how the experiment was conducted. Therefore, the design of the experiment 
plays a major role in the eventual solution of the problem that initially motivated the experiment. 
In this chapter we focus on experiments that include two or more factors that the experi- 
menter thinks may be important. A factorial experiment is a powerful technique for this type of 
problem. Generally, in a factorial experimental design, experimental trials (or runs) are per- 
formed at all combinations of factor levels. For example, if a chemical engineer is interested in 
investigating the effects of reaction time and reaction temperature on the yield of a process, and 
if two levels of time (1 and 1.5 hours) and two levels of temperature (125 and 150°F) are con- 
sidered important, a factorial experiment would consist of making experimental runs at each of 
the four possible combinations of these levels of reaction time and reaction temperature. 
Experimental design is an extremely important tool for engineers and scientists who are in- 
terested in improving the performance of a manufacturing process. It also has extensive applica- 
tion in the development of new processes and in new product design. We now give some examples. 


Process Characterization Experiment 

In an article in JEEE Transactions on “Electronics Packaging Manufacturing” (2001, Vol. 24(4), 
pp. 249-254), the authors discussed the change to lead-free solder in surface mount technology 
(SMT). SMT is a process to assemble electronic components to a printed circuit board. Solder 
paste is printed through a stencil onto the printed circuit board. The stencil-printing machine has 
squeegees; the paste rolls in front of the squeegee and fills the apertures in the stencil. The 
squeegee shears off the paste in the apertures as it moves over the stencil. Once the print stroke 


Figure 14-1 The 
flow solder 
experiment. 
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Uncontrollable (noise) factors 


is completed, the board is separated mechanically from the stencil. Electronic components are 
placed on the deposits and the board is heated so that the paste reflows to form the solder joints. 

The current SMT soldering process is based on tin—lead solders, and it has been well 
developed and refined over the years to operate at a competitive cost. The process will have 
several (perhaps many) variables, and all of them are not equally important. The initial list of 
candidate variables to be included in an experiment is constructed by combining the knowl- 
edge and information about the process from all team members. For example, engineers would 
conduct a brainstorming session and invite manufacturing personnel with SMT experience 
to participate. SMT has several variables that can be controlled. These include 


(1) squeegee speed, (2) squeegee pressure, (3) squeegee angle, (4) metal or polyurethane squeegee, 
(5) squeegee vibration, (6) delay time before the squeegee lifts from the stencil, (7) stencil separa- 
tion speed, (8) print gap, (9) solder paste alloy, (10) paste pretreatment (11) paste particle size, 
(12) flux type, (13) reflow temperature, (14) reflow time, and so forth. 


In addition to these controllable factors, there are several other factors that cannot be easily 
controlled during routine manufacturing, including 


(1) thickness of the printed circuit board, (2) types of components used on the board and aperture 
width and length, (3) layout of the components on the board, (4) paste density variation, (5) envi- 
ronmental factors, (6) squeegee wear, (7) cleanliness, and so forth. 


Sometimes we call the uncontrollable factors noise factors. A schematic representation of the 
process is shown in Fig. 14-1. In this situation, the engineer wants to characterize the SMT 
process; that is, to determine the factors (both controllable and uncontrollable) that affect the 
occurrence of defects on the printed circuit boards. To determine these factors, an experiment 
can be designed to estimate the magnitude and direction of the factor effects. Sometimes we 
call such an experiment a screening experiment. The information from this characterization 
study, or screening experiment, can help determine the critical process variables as well as the 
direction of adjustment for these factors in order to reduce the number of defects, and assist in 
determining which process variables should be carefully controlled during manufacturing to 
prevent high defect levels and erratic process performance. 


Optimization Experiment 
In a characterization experiment, we are interested in determining which factors affect the 
response. A logical next step is to determine the region in the important factors that leads to an opti- 
mum response. For example, if the response is cost, we will look for a region of minimum cost. 
As an illustration, suppose that the yield of a chemical process is influenced by the op- 
erating temperature and the reaction time. We are currently operating the process at 155°F 
and 1.7 hours of reaction time, and the current process yield is around 75%. Figure 14-2 
shows a view of the time-temperature space from above. In this graph we have connected 
points of constant yield with lines. These lines are yield contours, and we have shown the 
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Figure 14-2 Contour 
plot of yield as a func- 
tion of reaction time 
and reaction tempera- 
ture, illustrating an 
optimization 
experiment. 
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contours at 60, 70, 80, 90, and 95% yield. To locate the optimum, we might begin with a 
factorial experiment such as we describe below, with the two factors, time and temperature, 
run at two levels each at 10°F and 0.5 hours above and below the current operating condi- 
tions. This two-factor factorial design is shown in Fig. 14-2. The average responses observed 
at the four points in the experiment (145°F, 1.2 hours; 145°F, 2.2 hours; 165°F, 1.2 hours; and 
165°F, 2.2 hours) indicate that we should move in the general direction of increased temper- 
ature and lower reaction time to increase yield. A few additional runs could be performed in 
this direction to locate the region of maximum yield. 


A Product Design Example 

We can also use experimental design in the development of new products. For example, suppose 
that a group of engineers are designing a door hinge for an automobile. The product characteristic 
is the check effort, or the holding ability, of the latch that prevents the door from swinging closed 
when the vehicle is parked on a hill. The check mechanism consists of a leaf spring and a roller. 
When the door is opened, the roller travels through an arc causing the leaf spring to be com- 
pressed. To close the door, the spring must be forced aside, and this creates the check effort. The 
engineering team thinks that check effort is a function of the following factors: 


(1) roller travel distance, (2) spring height from pivot to base, (3) horizontal distance from pivot to 
spring, (4) free height of the reinforcement spring, (5) free height of the main spring. 


The engineers can build a prototype hinge mechanism in which all these factors can be 
varied over certain ranges. Once appropriate levels for these five factors have been identified, 
an experiment can be designed consisting of various combinations of the factor levels, and the 
prototype can be tested at these combinations. This will produce information concerning 
which factors are most influential on the latch check effort, and through analysis of this infor- 
mation, the latch design can be improved. 


14-2 
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Most of the statistical concepts introduced in Chapter 13 for single-factor experiments can 
be extended to the factorial experiments of this chapter. The analysis of variance (ANOVA), in 
particular, will continue to be used as a tool for statistical data analysis. We will also introduce 
several graphical methods that are useful in analyzing the data from designed experiments. 


FACTORIAL EXPERIMENTS 


Factorial 
Experiment 


When several factors are of interest in an experiment, a factorial experiment should be used. 
As noted previously, in these experiments factors are varied together. 


By a factorial experiment we mean that in each complete trial or replicate of the 
experiment all possible combinations of the levels of the factors are investigated. 


Thus, if there are two factors A and B with a levels of factor A and b levels of factor B, each 
replicate contains all ab treatment combinations. 

The effect of a factor is defined as the change in response produced by a change in the level 
of the factor. It is called a main effect because it refers to the primary factors in the study. For ex- 
ample, consider the data in Table 14-1. This is a factorial experiment with two factors, A and B, 
each at two levels (Ajow, Apighs ANd Bioy, Brigh). The main effect of factor A is the difference between 
the average response at the high level of A and the average response at the low level of A, or 


30+ 40 10+ 20 55 
7 ; aa 


That is, changing factor A from the low level to the high level causes an average response 
increase of 20 units. Similarly, the main effect of B is 


R 20+ 40 10 + 30 10 
= 5 5 = 


In some experiments, the difference in response between the levels of one factor is not the 
same at all levels of the other factors. When this occurs, there is an interaction between the fac- 
tors. For example, consider the data in Table 14-2. At the low level of factor B, the A effect is 


A = 30 — 10 = 20 
and at the high level of factor B, the A effect is 
A =0- 20 = —20 


Since the effect of A depends on the level chosen for factor B, there is interaction between A and B. 


Table 14-1 A Factorial Experiment with Table 14-2 A Factorial Experiment with 


Two Factors Interaction 
Factor B Factor B 
Factor A B low B high Factor A Brow B high 
Alow 10 20 Alow 10 20 


Ahigh 30 40 Ahigh 30 0 
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When an interaction is large, the corresponding main effects have very little practical 
meaning. For example, by using the data in Table 14-2, we find the main effect of A as 


30+0_ 10+20_, 
2 2 


and we would be tempted to conclude that there is no factor A effect. However, when we ex- 
amined the effects of A at different levels of factor B, we saw that this was not the case. The 
effect of factor A depends on the levels of factor B. Thus, knowledge of the AB interaction is 
more useful than knowledge of the main effect. A significant interaction can mask the signif- 
icance of main effects. Consequently, when interaction is present, the main effects of the 
factors involved in the interaction may not have much meaning. 

It is easy to estimate the interaction effect in factorial experiments such as those illus- 
trated in Tables 14-1 and 14-2. In this type of experiment, when both factors have two 
levels, the AB interaction effect is the difference in the diagonal averages. This represents 
one-half the difference between the A effects at the two levels of B. For example, in 
Table 14-1, we find the AB interaction effect to be 


20+ 30 10+ 40 
AB =0 
2 2, 
Thus, there is no interaction between A and B. In Table 14-2, the AB interaction effect is 


20+30 10+0_ 
2 2 


AB 20 


As we noted before, the interaction effect in these data is very large. 

The concept of interaction can be illustrated graphically in several ways. Figure 14-3 
plots the data in Table 14-1 against the levels of A for both levels of B. Note that the B,,,, and 
Bpign lines are approximately parallel, indicating that factors A and B do not interact signifi- 
cantly. Figure 14-4 presents a similar plot for the data in Table 14-2. In this graph, the B,,,, and 
Byigh lines are not parallel, indicating the interaction between factors A and B. Such graphical 
displays are called two-factor interaction plots. They are often useful in presenting the re- 
sults of experiments, and many computer software programs used for analyzing data from de- 
signed experiments will construct these graphs automatically. 


50 50 
40 a Bhigh 40 
S an ae Bus Ss 30 © Blow 
= 20 ° = 20 ° 
a 2 
rs} 10 e 8 10 e 
(0) 0 e Bhigh 
Alow Ahigh Atow Anigh 
Factor A Factor A 
Figure 14-3 Factorial experiment, no Figure 14-4 Factorial experiment, with 
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Figure 14-5  Three-dimensional surface plot of the data from Figure 14-6 Three-dimensional surface plot of the data from 
Table 14-1, showing the main effects of the two factors A and B. Table 14-2 showing the effect of the A and B interaction. 


Figures 14-5 and 14-6 present another graphical illustration of the data from Tables 14-1 and 
14-2. In Fig. 14-3 we have shown a three-dimensional surface plot of the data from Table 14-1. 
These data contain no interaction, and the surface plot is a plane lying above the A-B space. The 
slope of the plane in the A and B directions is proportional to the main effects of factors A and B, 
respectively. Figure 14-6 is a surface plot of the data from Table 14-2. Notice that the effect of the 
interaction in these data is to “twist” the plane, so that there is curvature in the response function. 
Factorial experiments are the only way to discover interactions between variables. 

An alternative to the factorial design that is (unfortunately) used in practice is to change 
the factors one at a time rather than to vary them simultaneously. To illustrate this one-factor-at- 
a-time procedure, suppose that an engineer is interested in finding the values of temperature and 
pressure that maximize yield in a chemical process. Suppose that we fix temperature at 155°F (the 
current operating level) and perform five runs at different levels of time, say, 0.5, 1.0, 1.5, 2.0, and 
2.5 hours. The results of this series of runs are shown in Fig. 14-7. This figure indicates that max- 
imum yield is achieved at about 1.7 hours of reaction time. To optimize temperature, the engineer 
then fixes time at 1.7 hours (the apparent optimum) and performs five runs at different tempera- 
tures, say, 140, 150, 160, 170, and 180°E The results of this set of runs are plotted in Fig. 14-8. 
Maximum yield occurs at about 155°E Therefore, we would conclude that running the process at 
155°F and 1.7 hours is the best set of operating conditions, resulting in yields of around 75%. 
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e 
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Figure 14-7 Yield versus reaction time with Figure 14-8 Yield versus temperature with reaction 


temperature constant at 155°F. time constant at 1.7 hours. 
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Figure 14-9 
Optimization 
experiment using the 
one-factor-at-a- 

time method. 


Temperature (°F) 


0.5 1.0 1:5 2.0 2.5 
Time (hr) 


Figure 14-9 displays the contour plot of actual process yield as a function of temperature 
and time with the one-factor-at-a-time experiments superimposed on the contours. Clearly, 
this one-factor-at-a-time approach has failed dramatically here, as the true optimum is at least 
20 yield points higher and occurs at much lower reaction times and higher temperatures. The 
failure to discover the importance of the shorter reaction times is particularly important be- 
cause this could have significant impact on production volume or capacity, production plan- 
ning, manufacturing cost, and total productivity. 

The one-factor-at-a-time approach has failed here because it cannot detect the interac- 
tion between temperature and time. Factorial experiments are the only way to detect inter- 
actions. Furthermore, the one-factor-at-a-time method is inefficient. It will require more 
experimentation than a factorial, and as we have just seen, there is no assurance that it will 
produce the correct results. 


14-3 TWO-FACTOR FACTORIAL EXPERIMENTS 


The simplest type of factorial experiment involves only two factors, say A, and B. There are a 
levels of factor A and b levels of factor B. This two-factor factorial is shown in Table 14-3. The 
experiment has n replicates, and each replicate contains all ab treatment combinations. The 
observation in the ijth cell for the Ath replicate is denoted by y;,. In performing the experi- 
ment, the abn observations would be run in random order. Thus, like the single-factor exper- 
iment studied in Chapter 13, the two-factor factorial is a completely randomized design. 

The observations may be described by the linear statistical model 


i=1,2,...,a 
Vg = ay Bee OB) ee hf = Le wg PD (14-1) 
k=1,2,....n 
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Table 14-3 Data Arrangement for a Two-Factor Factorial Design 
Factor B 
1 D} b Totals Averages 
1 Mitt» Yi12» Y121> Y122» Yibi> Yib2» 7 
+9 Vin +29 V12n +++ Yibn Viv Jie 
> Y211+ Y2125 Y221» Y2225 Y2b1> Y2b2s 7 
Factor A +5 Y2In +5 Y22n ++ Yobn Yo. yo. 
Yai» Yai2» Ya21» Ya22» Yabi> Yab2» 
a +9 Vain s+ Ya2n ++» Yabn Yas Yar 
Totals yy. yer. Y-p Vie: 
Averages y.. y.2. yp ve 


where p is the overall mean effect, 7; is the effect of the ith level of factor A, B; is the effect of 
the jth level of factor B, (tB),; is the effect of the interaction between A and B, and €;;, is a ran- 
dom error component having a normal distribution with mean zero and variance o*. We are 
interested in testing the hypotheses of no main effect for factor A, no main effect for B, and no 
AB interaction effect. As with the single-factor experiments of Chapter 13, the analysis of vari- 
ance (ANOVA) will be used to test these hypotheses. Since there are two factors in the 
experiment, the test procedure is sometimes called the two-way analysis of variance. 


14-3.1 Statistical Analysis of the Fixed-Effects Model 

Suppose that A and B are fixed factors. That is, the a levels of factor A and the b levels of fac- 
tor B are specifically chosen by the experimenter, and inferences are confined to these levels 
only. In this model, it is customary to define the effects 7;, B;, and (7B);; as deviations from the 
mean, so that Sas = 0, pe = 0, Di-1 (7B); = 0, and dj-1(48);j = 0. 

The analysis of variance can be used to test hypotheses about the main factor effects of 
A and B and the AB interaction. To present the ANOVA, we will need some symbols, some of 
which are illustrated in Table 14-3. Let y,. denote the total of the observations taken at the ith 
level of factor A; y.;. denote the total of the observations taken at the jth level of factor B; y;,. 
denote the total of the observations in the ijth cell of Table 14-3; and y... denote the grand total 
of all the observations. Define y,.., y.;.. yj, and y... as the corresponding row, column, cell, 
and grand averages. That is, 


Notation 
for Totals bon y; 
and Means y= > Vijk yj. = f= 1,2, ovo o@ 
j=l k=l bn 
a n * 
= Yap : 
Ve = D vie ip — Tai Ye Deer) 
Al i 
n 2 Yij- 
Yi = Dy Vij = a 
k=1 
a b n Vex 
You = Vie Fhe GH Unb 
i=l »> a abn 
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ANOVA 

Sum of Squares 
Identity: Two 
Factors 


The hypotheses that we will test are as follows: 


1. Apt) =% =: =7,=0 (no main effect of factor A) 
H,: at least one T; # 0 

2. Ao Bi = Bo =°'° =P, =0 (no main effect of factor B) (14-2) 
Hy: at least one B; # 0 

3. Ao: (TB), = (TB) .2 = +: = CB), = O (no interaction) 


H;: at least one (tB),; # 0 


As before, the ANOVA tests these hypotheses by decomposing the total variability in the data 
into component parts and then comparing the various elements in this decomposition. Total 
variability is measured by the total sum of squares of the observations 


a b 


ssr= SD Vows 


i=1 j=1 k=1 


and the sum of squares decomposition is defined below. 


The sum of squares identity for a two-factor ANOVA is 


a 


j=l 
b b n 


or symbolically, 


SSr = SS4 a SSz =k SSap =e SSE (14-4) 


Equations 14-3 and 14-4 state that the total sum of squares SS; is partitioned into a sum of 
squares for the row factor A (SS,), a sum of squares for the column factor B (SS,), a sum of 
squares for the interaction between A and B (SS,,), and an error sum of squares (SS;). There 
are abn — | total degrees of freedom. The main effects A and B have a — 1 and b — 1 degrees 
of freedom, while the interaction effect AB has (a — 1)(b — 1) degrees of freedom. Within 
each of the ab cells in Table 14-3, there are n — 1 degrees of freedom between the n replicates, 
and observations in the same cell can differ only because of random error. Therefore, there are 
ab(n — 1) degrees of freedom for error. Therefore, the degrees of freedom are partitioned 
according to 


abn —1=(a-—1)+(b- 1) + (a— 1)(6- 1) + ab(n — 1) 


If we divide each of the sum of squares on the right-hand side of Equation 14-4 by the 
corresponding number of degrees of freedom, we obtain the mean squares for A, B, the 
interaction, and error: 


ni 55; ye SSp 


SS 
MS, = MS; = MS = 
eI oe ped Sap (a — 1)(b — 1) 


Expected 
Values of Mean 
Squares: 

Two Factors 


F Test for 
Factor A 


F Test for 
Factor B 


F Test for AB 
Interaction 
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Assuming that factors A and B are fixed factors, it is not difficult to show that the expected 
values of these mean squares are 


From examining these expected mean squares, it is clear that if the null hypotheses about main 
effects Hy: 7; = 0, Ho: B; = 0, and the interaction hypothesis Hp: (TB), = 0 are all true, all four 
mean squares are unbiased estimates of o”. 

To test that the row factor effects are all equal to zero (Hy: 7; = 0), we would use the ratio 


_ MS, 


i= 
Oo MSE 


which has an F distribution with a — 1 and ab(n — 1) degrees of freedom if Ho: T; = 0 is true. This 
null hypothesis is rejected at the a level of significance if fo > fo.a—1abn—1) Similarly, to test the hy- 
pothesis that all the column factor effects are equal to zero (Ho: B; = 0), we would use the ratio 


_ MSz 


i= 
RS 


which has an F distribution with b — 1 and ab(n — 1) degrees of freedom if Hp: 8; = 0 is true. This 
null hypothesis is rejected at the a level of significance if fy > fo.p—1abin—1)- Finally, to test the hy- 
pothesis Hp: (+8); = 0, which is the hypothesis that all interaction effects are zero, we use the ratio 


MS 
Fo eae ‘AB 
MS, 


which has an F distribution with (a — 1)(b — 1) and ab(n — 1) degrees of freedom if the null 
hypothesis Hp: (7B);; = 0. This hypothesis is rejected at the a level of significance if 


fo > So(a-W(b—V),ab(n—-1)* 
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It is usually best to conduct the test for interaction first and then to evaluate the main 
effects. If interaction is not significant, interpretation of the tests on the main effects is 
straightforward. However, as noted in Section 14-3, when interaction is significant, the main 
effects of the factors involved in the interaction may not have much practical interpretative 
value. Knowledge of the interaction is usually more important than knowledge about the main 


effects. 


Computational formulas for the sums of squares are easily obtained. 


Computing 
ee ad Computing formulas for the sums of squares in a two-factor analysis of variance. 
NOVA: 
Two Factors ee nee 
Ss - ) S S (14-5) 
Al jl Zi abn 
Ged 2) 
Jive y- 
SS, = = a 14-6 
a = bn abn ( ) 
b Yn y2 
= ne ee 14- 
SSp= >i an ~ “abn (14-7) 
ee Vi cee 
SSap = yy — BS) 14-8 
AB >» 2 “bn 7 354 ~ SSe (14-8) 
SSe a SSp a SSap = SS4 as SSp (14-9) 


The computations are usually displayed in an ANOVA table, such as Table 14-4. 


EXAMPLE 14-1 Aircraft Primer Paint 

Aircraft primer paints are applied to aluminum surfaces by 
two methods: dipping and spraying. The purpose of the primer 
is to improve paint adhesion, and some parts can be primed 
using either application method. The process engineering 
group responsible for this operation is interested in learning 
whether three different primers differ in their adhesion proper- 


ties. A factorial experiment was performed to investigate the 
effect of paint primer type and application method on paint ad- 
hesion. For each combination of primer type and application 
method, three specimens were painted, then a finish paint was 
applied, and the adhesion force was measured. The data from 
the experiment are shown in Table 14-5. The circled numbers 


Table 14-4 ANOVA Table for a Two-Factor Factorial, Fixed-Effects Model 


Source of Sum of Degrees of 
Variation Squares Freedom Mean Square Fo 
nee SS, MS, 
A treatments SS4 a-1 A MS; 
B treatment SS b-1 Wipe a 
reatments a eg MS; 
f SSap MSap 
Interaction SS, (a — 1)\(b —- 1) MS, 
2 Gave) MS¢ 
Error SS ab(n — 1) 
SSE 
Total SS7 abn — 1 MS; = 
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Table 14-5 Adhesion Force Data for Example 14-1 


Primer Type Dipping Spraying jp 
1 4.0, 4.5, 4.3 5.4, 4.9, 5.6 28.7 
2 5.6, 4.9, 5.4 5.8, 6.1, 6.3 34.1 
3 3.8, 3.7, 4.0 (13) Sah oH), 0) (53) 27.0 
ye}: 40.2 49.6 89.8 = y... 


in the cells are the cell totals y,;.. The sums of squares required 
to perform the ANOVA are computed as follows: 


a b ye. 
3sr= >> Soh 
i=l j=l k= ab 
= (4.0)? + (4.5)? + = 
(89.8) 
5.0) = 10.72 
(5.0) is 
yee. 2 
View Vow 
SStypes = ES abn 
(28.7)° + (34.1)? + (27.0) 
6 
(89.8) 
_ = 4.58 
18 
b yy. ye. 
SSmethods = » ‘an. ~~ abn 
40.2)? + (49.6)? (89.8)? 
7 y+ (49.6) (89.8)° ha 
9 18 
ab Vie y?. 
SSinteraction = a » n abn SStypes SSimethods 
(12.8)? + (15.9)? + (11.5)? + (15.9)? + (18.2)? + (15.5)? 
~ 3 
(89.8) 
4.58 — 4.91 = 0.24 
18 
and 
SS = SS — SStypes ~ SStethods ~ SSinteraction 
= 10.72 — 4.58 — 4.91 — 0.24 = 0.99 


Tests on Individual Means 


The ANOVA is summarized in Table 14-6. The experimenter 
has decided to use a = 0.05. Since fy 959.12 = 3.89 and foos4.12 = 
4.75, we conclude that the main effects of primer type and ap- 
plication method affect adhesion force. Furthermore, since 1.5 < 
fo.os.2,12, there is no indication of interaction between these fac- 
tors. The last column of Table 14-6 shows the P-value for 
each F-ratio. Notice that the P-values for the two test statis- 
tics for the main effects are considerably less than 0.05, while 
the P-value for the test statistic for the interaction is greater 
than 0.05. 

Practical Interpretation: A graph of the cell adhesion 
force averages { ¥;;.} versus levels of primer type for each ap- 
plication method is shown in Fig. 14-10. The no-interaction 
conclusion is obvious in this graph, because the two lines are 
nearly parallel. Furthermore, since a large response indicates 
greater adhesion force, we conclude that spraying is the best 
application method and that primer type 2 is most effective. 
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Figure 14-10 Graph of average adhesion force ver- 
sus primer types for both application methods. 


When both factors are fixed, comparisons between the individual means of either factor may 
be made using any multiple comparison technique such as Fisher’s LSD method (described in 


564 


CHAPTER 14 DESIGN OF EXPERIMENTS WITH SEVERAL FACTORS 


Table 14-6 ANOVA for Example 14-1 


Source of Sum of Degrees of Mean 

Variation Squares Freedom Square ho P-Value 
Primer types 4.58 2, 2.29 28.63 2a Xx ES 
Application methods 4.91 1 4.91 61.38 4.7 X E-6 
Interaction 0.24 y} 0.12 1.50 0.2621 
Error 0.99 12 0.08 
Total 10.72 17 


Chapter 13). When there is no interaction, these comparisons may be made using either the 
row averages y;.. or the column averages y,;. However, when interaction is significant, 
comparisons between the means of one factor (say, A) may be obscured by the AB interaction. 
In this case, we could apply a procedure such as Fisher’s LSD method to the means of factor 
A, with factor B set at a particular level. 


Minitab Output 

Table 14-7 shows some of the output from the Minitab analysis of variance procedure for the 
aircraft primer paint experiment in Example 14-1. The upper portion of the table gives factor 
name and level information, and the lower portion of the table presents the analysis of vari- 
ance for the adhesion force response. The results are identical to the manual calculations dis- 
played in Table 14-6 apart from rounding. 


14-3.2 Model Adequacy Checking 


Just as in the single-factor experiments discussed in Chapter 13, the residuals from a factorial 
experiment play an important role in assessing model adequacy. The residuals from a 
two-factor factorial are 


Cijk = ijk — Viy- 


That is, the residuals are just the difference between the observations and the corresponding 
cell averages. 


Table 14-7 Analysis of Variance from Minitab for Example 14-1 


ANOVA (Balanced Designs) 


Factor Type Levels Values 
Primer fixed 3 1 2 3 
Method fixed 2 Dip Spray 


Analysis of Variance for Adhesion 


Source DF SS MS F P 
Primer 2 4.5811 2.2906 27.86 0.000 
Method 1 4.9089 4.9089 59.70 0.000 
Primer *Method 2 0.2411 0.1206 1.47 0.269 
Error 12 0.9867 0.0822 


Total 17 10.7178 
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Table 14-8 Residuals for the Aircraft Primer Experiment in Example 14-1 


Application Method 
Primer Type Dipping Spraying 
1 —0.27, 0.23, 0.03 0.10, —0.40, 0.30 
2 0.30, —0.40, 0.10 —0.27, 0.03, 0.23 
3 —0.03, —0.13, 0.17 0.33, —0.17, —0.17 


Table 14-8 presents the residuals for the aircraft primer paint data in Example 14-1. 
The normal probability plot of these residuals is shown in Fig. 14-11. This plot has tails 
that do not fall exactly along a straight line passing through the center of the plot, indicat- 
ing some potential problems with the normality assumption, but the deviation from nor- 
mality does not appear severe. Figures 14-12 and 14-13 plot the residuals versus the levels 
of primer types and application methods, respectively. There is some indication that primer 
type 3 results in slightly lower variability in adhesion force than the other two primers. The 
graph of residuals versus fitted values in Fig. 14-14 does not reveal any unusual or diag- 
nostic pattern. 
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In some cases involving a two-factor factorial experiment, we may have only one replicate—that 
is, only one observation per cell. In this situation, there are exactly as many parameters in the 
analysis of variance model as observations, and the error degrees of freedom are zero. Thus, we 
cannot test hypotheses about the main effects and interactions unless some additional 
assumptions are made. One possible assumption is to assume the interaction effect is negligi- 
ble and use the interaction mean square as an error mean square. Thus, the analysis is equivalent 
to the analysis used in the randomized block design. This no-interaction assumption can be dan- 
gerous, and the experimenter should carefully examine the data and the residuals for indications 
as to whether or not interaction is present. For more details, see Montgomery (2009). 


EXERCISES FOR SECTION 14-3 


14-1. An article in Industrial Quality Control (1956, pp. 
5-8) describes an experiment to investigate the effect of two 
factors (glass type and phosphor type) on the brightness of a 
television tube. The response variable measured is the current 
(in microamps) necessary to obtain a specified brightness 
level. The data are shown in the following table: 

(a) State the hypotheses of interest in this experiment. 

(b) Test the above hypotheses and draw conclusions using the 

analysis of variance with a = 0.05. 
(c) Analyze the residuals from this experiment. 


Glass Phosphor Type 
Type 1 2, 3 
1 280 300 290 
290 310 285 
285 295 290 
2 230 260 220 
235) 240 225 
240 235 230 


14-2. An engineer suspects that the surface finish of metal 
parts is influenced by the type of paint used and the drying time. 
He selected three drying times—20, 25, and 30 minutes—and 
used two types of paint. Three parts are tested with each combi- 
nation of paint type and drying time. The data are as follows: 


Drying Time (min) 


Paint 20 23) 30 
1 74 73 78 
64 61 85 

50 44 92 

2 92 98 66 
86 73 45 

68 88 85 


(a) State the hypotheses of interest in this experiment. 

(b) Test the above hypotheses and draw conclusions using the 
analysis of variance with a = 0.05. 

(c) Analyze the residuals from this experiment. 

14-3. Inthe book Design and Analysis of Experiments, 7th edi- 

tion (2009, John Wiley & Sons), the results of an experiment in- 

volving a storage battery used in the launching mechanism of a 

shoulder-fired ground-to-air missile were presented. Three mate- 

rial types can be used to make the battery plates. The objective is 

to design a battery that is relatively unaffected by the ambient 

temperature. The output response from the battery is effective life 

in hours. Three temperature levels are selected, and a factorial ex- 

periment with four replicates is run. The data are as follows: 


Temperature (°F) 
Material Low Medium High 

1 130 155 34 40 20 70 

74 180 80 75 82 58 

2 150 188 136 122 25 70 

159 126 106 115 58 45 

3 138 110 174 120 96 104 
168 160 150 139 82 60 


(a) Test the appropriate hypotheses and draw conclusions 
using the analysis of variance with a = 0.05. 

(b) Graphically analyze the interaction. 

(c) Analyze the residuals from this experiment. 

14-4. An experiment was conducted to determine whether 

either firing temperature or furnace position affects the baked 

density of a carbon anode. The data are as follows: 


Temperature (°C) 
Position 800 825 850 
1 570 1063 565 


565 1080 510 


s 


s 


583 1043 590 
2 528 988 526 
547 1026 538 
521 1004 532 


(a) State the hypotheses of interest. 

(b) Test the above hypotheses using the analysis of variance 
with a = 0.05. What are your conclusions? 

(c) Analyze the residuals from this experiment. 

(d) Using Fisher’s LSD method, investigate the differences 
between the mean baked anode density at the three differ- 
ent levels of temperature. Use a = 0.05. 


14-5. An article in Technometrics (“Exact Analysis of 
Means with Unequal Variances” (2002, Vol. 44, pp. 152—160)] 
described the technique of the analysis of means (ANOM) and 
presented the results of an experiment on insulation. Four 
insulation types were tested at three different temperatures. 
The data are as follows: 


Temperature (F) 


Insulation 1 2, 3 

6.6 4 4.5 2) 2.3 0.9 

2.7 6.2 5.5 2:7 5.6 49 

1 6 5 4.8 5.8 DD) 3.4 
3 3.2 3 1.5 1.3 3.3 

Dll 4.1 2S) 2.6 0.5 1.1 

2 5.9 2.5 0.4 3.5 1.7 0.1 
Sal 44 8.9 Wed 2.6 9.9 

3.2 3.2 7 73 11.5 10.5 

3 5.3 9.7 8 DD, 3.4 6.7 
7 8.9 12 9.7 8.3 8 

V3 9 8.5 10.8 10.4 oF 

4 8.6 11.3 79 7.3 10.6 TA 


(a) Write down a model for this experiment. 

(b) Test the appropriate hypotheses and draw conclusions 
using the analysis of variance with a = 0.05. 

(c) Graphically analyze the interaction. 

(d) Analyze the residuals from the experiment. 

(e) Use Fisher’s LSD method to investigate the differences 
between mean effects of insulation type. Use a = 0.05. 


14-6. Johnson and Leone (Statistics and Experimental 
Design in Engineering and the Physical Sciences, John 
Wiley, 1977) described an experiment conducted to investi- 
gate warping of copper plates. The two factors studied were 
temperature and the copper content of the plates. The re- 
sponse variable is the amount of warping. The data are as 
follows: 
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Temperature Copper Content (%) 
CC) 40 60 80 100 
50 17, 20 16, 21 24, 22 28, 27 
75 12,9 18, 13 17, 12 27, 31 
100 16, 12 18, 21 25, 23 30, 23 
125 21,17 23, 21 23, 22 29, 31 


(a) Is there any indication that either factor affects the amount 
of warping? Is there any interaction between the factors? 
Use a = 0.05. 

(b) Analyze the residuals from this experiment. 

(c) Plot the average warping at each level of copper content 
and compare the levels using Fisher’s LSD method. 
Describe the differences in the effects of the different 
levels of copper content on warping. If low warping is 
desirable, what level of copper content would you specify? 

(d) Suppose that temperature cannot be easily controlled in 
the environment in which the copper plates are to be used. 
Does this change your answer for part (c)? 


14-7. Anarticle in the ZEEE Transactions on Electron Devices 
(November 1986, p. 1754) describes a study on the effects of 
two variables—polysilicon doping and anneal conditions (time 
and temperature)—on the base current of a bipolar transistor. 
The data from this experiment follows below. 

(a) Is there any evidence to support the claim that either 
polysilicon doping level or anneal conditions affect base 
current? Do these variables interact? Use a = 0.05. 

(b) Graphically analyze the interaction. 

(c) Analyze the residuals from this experiment. 

(d) Use Fisher’s LSD method to isolate the effects of anneal 
conditions on base current, with a = 0.05. 

14-8. An article in the Journal of Testing and Evaluation 

(1988, Vol. 16, pp. 508-515) investigated the effects of cyclic 

loading frequency and environment conditions on fatigue crack 

growth at a constant 22 MPa stress for a particular material. 


Environment 

Air H,O Salt H,O 
2.29 2.06 1.90 
10 2.47 2.05 1.93 
2.48 2.23 1.75 
2.12 2.03 2.06 
2.65 3.20 3.10 
2.68 3.18 3.24 
prequeney. Li!) ge ||| 306 3.98 
2.38 3.64 3.24 
2.24 11.00 9.96 
01 2.71 11.00 10.01 
: 2.81 9.06 9.36 
2.08 11.30 10.40 


s 
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The data from the experiment follow. The response variable is 

fatigue crack growth rate. 

(a) Is there indication that either factor affects crack growth 
rate? Is there any indication of interaction? Use a = 0.05. 

(b) Analyze the residuals from this experiment. 

(c) Repeat the analysis in part (a) using /n(y) as the response. 
Analyze the residuals from this new response variable and 
comment on the results. 


14-9. Consider a two-factor factorial experiment. Develop a 
formula for finding a 100(1 — a)% confidence interval on 
the difference between any two means for either a row or 
column factor. Apply this formula to find a 95% CI on the 
difference in mean warping at the levels of copper content 60 
and 80% in Exercise 14-6. 


Anneal (temperature/time) 


900 900 950 1000 1000 
60 180 60 15 30 

1x 19% | 4:40 8.30 10.15 10.29 11.01 

Polysilicon 4.60 8.90 10.20 | 10.30 10.58 
doping > x 192 | 3:20 7.81 9.38 10.19 10.81 

3.50 7.75 10.02 | 10.10 10.60 


14-4 GENERAL FACTORIAL EXPERIMENTS 


Many experiments involve more than two factors. In this section we introduce the case where 
there are a levels of factor A, b levels of factor B, c levels of factor C, and so on, arranged 
in a factorial experiment. In general, there will be abc... total observations, if there are 
n replicates of the complete experiment. 

For example, consider the three-factor-factorial experiment, with underlying model 


Yon = eee er Bp Be rye + (BY 


$2 Donat 
=e 

+ (tBy die + ei) = 9 (-10) 
P22 cman 


> 


Notice that the model contains three main effects, three two-factor interactions, a three-factor 
interaction, and an error term. Assuming that A, B, and C are fixed factors, the analysis of vari- 
ance is shown in Table 14-9. Note that there must be at least two replicates (n = 2) to compute 
an error sum of squares. The F-test on main effects and interactions follows directly from 
the expected mean squares. These ratios follow F distributions under the respective null 


hypotheses. 


EXAMPLE 14-2 Surface Roughness 
A mechanical engineer is studying the surface roughness of a 
part produced in a metal-cutting operation. Three factors, feed 
rate (A), depth of cut (B), and tool angle (C), are of interest. 
All three factors have been assigned two levels, and two 
replicates of a factorial design are run. The coded data are 
shown in Table 14-10. 

The ANOVA is summarized in Table 14-11. Since 
manual ANOVA computations are tedious for three-factor ex- 


periments, we have used Minitab for the solution of this prob- 
lem. The F-ratios for all three main effects and the interac- 
tions are formed by dividing the mean square for the effect of 
interest by the error mean square. Since the experimenter has 
selected a = 0.05, the critical value for each of these 
F-ratios is fos, = 5.32. Alternately, we could use the P- 
value approach. The P-values for all the test statistics are 
shown in the last column of Table 14-11. Inspection of these 


— 


i) 
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Table 14-9 Analysis of Variance Table for the Three-Factor Fixed Effects Model 
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Source of Sum of Degrees of Expected 
Variation Squares Freedom Mean Square Mean Squares Fo 
A SS 1 MS Bae a ae 
se, Oo gesey aoe 
a = . a | MS; 
B SS b=1 MS, a+ len ae 
? e b-1 MS; 
abn di. MSc 
C SS, = il MS, as 
¢ f c a= MS; 
cn>X(7B)j MS xB 
AB SS —1\(b-1 MS Pio oe sae 
AB (a \( ) AB g (a — 1)(b - 1) MS; 
bn=X(ry) ik MSac 
AC SS, ict MS. eae — 
AC (a \le ) AC (op = MS; 
an=X(By) ji MSpc 
BC SS b= 1le=1 MS, ee ae 
BC vow y Bc oO Ga ieot) MS; 
n==X(sBy)ix MSapc 
ABC SS Hee t MS. - a 
‘ABC (a \( Mle ) ABC o Go ie=ae=s) MS; 
Error SS abc(n — 1) MS_ o 
Total SSp abcn — 1 


P-values is revealing. There is a strong main effect of feed 
rate, since the F-ratio is well into the critical region. However, 


there is some indication of an effect due to the depth of cut, 


since P = 0.0710 is not much greater than a = 0.05. The next 
largest effect is the AB or feed rate < depth of cut interaction. 


Most likely, both feed rate and depth of cut are important 
process variables. 

Practical Interpretation: Further experiments might study 
the important factors in more detail to improve the surface 
roughness. 


Obviously, factorial experiments with three or more factors can require many runs, par- 
ticularly if some of the factors have several (more than two) levels. This point of view leads 
us to the class of factorial designs considered in Section 14-5 with all factors at two levels. 
These designs are easy to set up and analyze, and they may be used as the basis of many other 
useful experimental designs. 


Table 14-10 Coded Surface Roughness Data for Example 14-2 


Depth of Cut (B) 
0.025 inch 0.040 inch 
Feed Rate Tool Angle (C) Tool Angle (C) 
(A) 159° DEF Se 25y Th ooc 
9 11 9 10 
20 inches per minute 7 10 11 8 75 
10 10 12 16 
30 inches per minute 12 13 15 14 102 
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Minitab ANOVA for Example 14-2 


ANOVA (Balanced Designs) 


Factor Type Levels 
Feed fixed 2 
Depth fixed 2 
Angle fixed 2 


Analysis of Variance for Roughness 


we) 
a 


Source 

Feed 

Depth 

Angle 

Feed*Depth 

Feed* Angle 
Depth*Angle 
Feed*Depth* Angle 
Error 

Total 


OR RP ee eee 


— 
Nn 


Values 
20 30 
0.025 0.040 
15 25 
SS MS F P 
45.563 45.563 18.69 0.003 
10.563 10.563 4.33 0.071 
3.063 3.063 1.26 0.295 
7.563 7.563 3.10 0.116 
0.062 0.062 0.03 0.877 
1.563 1.563 0.64 0.446 
5.062 5.062 2.08 0.188 
19.500 2.437 
92.938 


EXERCISES FOR SECTION 14-4 


14-10. The quality control department of a fabric finishing 
plant is studying the effects of several factors on dyeing for a 
blended cotton/synthetic cloth used to manufacture shirts. 
Three operators, three cycle times, and two temperatures were 
selected, and three small specimens of cloth were dyed under 
each set of conditions. The finished cloth was compared to a 
standard, and a numerical score was assigned. The results are 
shown in the following table. 


(a) State and test the appropriate hypotheses using the analysis 
of variance with a = 0.05. 

(b) The residuals may be obtained from ey; = yijur — Vije.. 
Graphically analyze the residuals from this experiment. 

14-11. The percentage of hardwood concentration in raw 

pulp, the freeness, and the cooking time of the pulp are being 

investigated for their effects on the strength of paper. The data 


Temperature 
300° 350° 
Operator Operator 

Cycle Time 1 2 3 1 2 3 
23 Pil 31 24 38 34 

40 24 28 32 23 36 36 
DS 26 28 28 35 39 

36 34 33 37 34 34 

50 35 38 34 3) 38 36 
36 39 35 35 36 31 

28 35 26 26 36 28 

60 24 35 27 29 37 26 
Dil 34 DD) 25 34 34 
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from a three-factor factorial experiment are shown in the fol- (b) Find P-values for the F-ratios in part (a). _ 

lowing table. (c) The residuals are found by @iki ~ Yijkt ~ Yijk-- Graphically 
am) (a) Analyze the data using the analysis of variance assuming analyze the residuals from this experiment. 
SS that all factors are fixed. Use a = 0.05. 


Cooking Time 1.5 hours Cooking Time 2.0 hours 
Percentage 
of Hardwood Freeness Freeness 
Concentration 350 500 650 350 500 650 
10 96.6 97.7 99.4 98.4 99.6 100.6 
96.0 96.0 99.8 98.6 100.4 100.9 
15 98.5 96.0 98.4 97.5 98.7 99.6 
97.2 96.9 97.6 98.1 96.0 99.0 
20 97.5 95.6 97.4 97.6 97.0 98.5 
96.6 96.2 98.1 98.4 97.8 99.8 


14-5 2* FACTORIAL DESIGNS 


14-5.1 2” Design 


Factorial designs are frequently used in experiments involving several factors where it is 
necessary to study the joint effect of the factors on a response. However, several special 
cases of the general factorial design are important because they are widely employed 
in research work and because they form the basis of other designs of considerable practical 
value. 

The most important of these special cases is that of k factors, each at only two levels. 
These levels may be quantitative, such as two values of temperature, pressure, or time; or 
they may be qualitative, such as two machines, two operators, the “high” and “low” lev- 
els of a factor, or perhaps the presence and absence of a factor. A complete replicate of 
such a design requires 2 X 2 X -- + X 2 = 2* observations and is called a 2* factorial 
design. 

The 2‘ design is particularly useful in the early stages of experimental work, when many 
factors are likely to be investigated. It provides the smallest number of runs for which k fac- 
tors can be studied in a complete factorial design. Because there are only two levels for each 
factor, we must assume that the response is approximately linear over the range of the factor 
levels chosen. 


The simplest type of 2‘ design is the 2*—that is, two factors A and B, each at two levels. We 
usually think of these levels as the low and high levels of the factor. The 2” design is shown in 
Fig. 14-15. Note that the design can be represented geometrically as a square with the 2” = 4 
runs, or treatment combinations, forming the corners of the square. In the 2? design it is cus- 
tomary to denote the low and high levels of the factors A and B by the signs — and +, respec- 
tively. This is sometimes called the geometric notation for the design. 

A special notation is used to label the treatment combinations. In general, a treatment 
combination is represented by a series of lowercase letters. If a letter is present, the corre- 
sponding factor is run at the high level in that treatment combination; if it is absent, the factor 
is run at its low level. For example, treatment combination a indicates that factor A is at the 
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Main Effect 
of Factor A: 
2? Design 


Main Effect 
of Factor B: 
2? Design 


Interaction 
Effect AB: 
2? Design 


High 
(+) |b ab 
Treatment A B 
Q): =< 
B a aos 
6 - + 
ab + + 
Low }(1) a 
© Low A High 


(-) (+) 
Figure 14-15 The 2? factorial design. 


high level and factor B is at the low level. The treatment combination with both factors at the 
low level is represented by (1). This notation is used throughout the 2‘ design series. For ex- 
ample, the treatment combination in a 2‘ with A and C at the high level and B and D at the low 
level is denoted by ac. 

The effects of interest in the 2” design are the main effects A and B and the two-factor in- 
teraction AB. Let the letters (1), a, b, and ab also represent the totals of all n observations taken 
at these design points. It is easy to estimate the effects of these factors. To estimate the main 
effect of A, we would average the observations on the right side of the square in Fig. 14-15 
where A is at the high level, and subtract from this the average of the observations on the left 
side of the square, where A is at the low level, or 


Ss Se Se me ea lesb ea =o (0) (14-11) 


Similarly, the main effect of B is found by averaging the observations on the top of the square, 
where B is at the high level, and subtracting the average of the observations on the bottom of 
the square, where B is at the low level: 


BS vax aa = 5 [b + ab -—a— (1)] (14-12) 


Finally, the AB interaction is estimated by taking the difference in the diagonal averages 
in Fig. 14-15, or 


nS [ab + (1) —a-}] (14-13) 


Relationship 
Between a 
Contrast and 
an Effect 


Sum of Squares 
for an Effect 
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Table 14-12 Signs for Effects in the 2” Design 


ieatanent Factorial Effect 
Combination I A B AB 
(1) i = = : 
i 4 4 _ _ 
b als = oily — 
ab + + + + 


The quantities in brackets in Equations 14-11, 14-12, and 14-13 are called contrasts. For 
example, the A contrast is 


Contrast, = a + ab — b — (1) 


In these equations, the contrast coefficients are always either +1 or —1. A table of plus and 
minus signs, such as Table 14-12, can be used to determine the sign on each treatment 
combination for a particular contrast. The column headings for Table 14-12 are the main ef- 
fects A and B, the AB interaction, and J, which represents the total. The row headings are the 
treatment combinations. Note that the signs in the AB column are the product of signs from 
columns A and B. To generate a contrast from this table, multiply the signs in the appropriate 
column of Table 14-12 by the treatment combinations listed in the rows and add. For example, 
contrast,z = [(1)] + [—a] + [-b] + [ab] = ab + (1)-—a—b. 

Contrasts are used in calculating both the effect estimates and the sums of squares for A, 
B, and the AB interaction. For any 2* design with n replicates, the effect estimates are com- 
puted from 


Contrast 
Effect = ei (14-14) 
and the sum of squares for any effect is 
(Contrast)? 
SS = S-. (14-15) 


There is one degree of freedom associated with each effect (two levels minus one) so 
that the mean square error of each effect equals the sum of squares. The analysis of variance 
is completed by computing the total sum of squares SS (with 4n — 1 degrees of freedom) 
as usual, and obtaining the error sum of squares SS; (with 4(n — 1) degrees of freedom) by 
subtraction. 
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EXAMPLE 14-3 Wafer Layer 

An article in the AT&T Technical Journal (Vol. 65, 
March/April 1986, pp. 39-50) describes the application of 
two-level factorial designs to integrated circuit manufacturing. 
A basic processing step in this industry is to grow an epitaxial 
layer on polished silicon wafers. The wafers are mounted on a 
susceptor and positioned inside a bell jar. Chemical vapors are 
introduced through nozzles near the top of the jar. The suscep- 
tor is rotated, and heat is applied. These conditions are main- 
tained until the epitaxial layer is thick enough. 

Table 14-13 presents the results of a 2° factorial design 
with n = 4 replicates using the factors A = deposition time 
and B = arsenic flow rate. The two levels of deposition time 
are — =short and + =long, and the two levels of arsenic flow 
rate are — =55% and + =59%. The response variable is epi- 
taxial layer thickness (um). We may find the estimates of the 
effects using Equations 14-11, 14-12, and 14-13 as follows: 


A= : [a + ab — b— (1)] 
2n 
1 
gaye + 59.156 — 55.686 — 56.081] = 0.836 
1 
B= [b + ab—a-—(l)] 
2n 
1 
=p eee + 59.156 — 59.299 — 56.081] 
= —0.067 
1 
AB ee [ab + (1) -—a-—)] 
1 
AB = 4) [59.156 + 56.081 — 59.299 — 55.686] 
= 0.032 


Models and Residual Analysis 


CHAPTER 14 DESIGN OF EXPERIMENTS WITH SEVERAL FACTORS 


The numerical estimates of the effects indicate that the effect 
of deposition time is large and has a positive direction (in- 
creasing deposition time increases thickness), since changing 
deposition time from low to high changes the mean epitaxial 
layer thickness by 0.836 jm. The effects of arsenic flow rate 
(B) and the AB interaction appear small. 

The importance of these effects may be confirmed with 
the analysis of variance. The sums of squares for A, B, and AB 
are computed as follows: 


[atab—b-—(1)P [6.688] 


SS, = 6 = 46 = 2.7956 
[b+ab-—a-(1)}? [-0.538]7 
SSp 16 e 0.0181 
[ab+(1)-—a—b}? [0.252]? 
SSaz = 7G = —F6~ = 0.0040 


SSp = 14.037? + ++ + 14.932? 


(56.081 + --- + 59.156)? 
16 


= 3.0672 


Practical Interpretation: The analysis of variance is sum- 
marized in Table 14-14 and confirms our conclusions obtained 
by examining the magnitude and direction of the effects. 
Deposition time is the only factor that significantly affects 
epitaxial layer thickness, and from the direction of the effect 
estimates we know that longer deposition times lead to thicker 
epitaxial layers. 


It is easy to obtain a model for the response and residuals from a 2* design by fitting a regres- 
sion model to the data. For the epitaxial process experiment, the regression model is 


Y= £8, + B x, +e 


Table 14-13. The 2? Design for the Epitaxial Process Experiment 


Gireatment Design Factors Thickness (wm) 
Combination A B AB Thickness (1m) Total Average 
(1) = = al 14.037 14.165 13972 13.907 56.081 14.020 
a fe = = 14.821 14.757 14.843 14.878 59.299 14.825 
b = at = 13.880 13.860 14.032 13.914 55.686 13920) 
ab 14.888 14.921 14.415 14.932 59.156 14.789 


Coefficient 
and Effect 


Standard 
Error of a 
Coefficient 


t-statistic for a 
Coefficient 
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Table 14-14 Analysis of Variance for the Epitaxial Process Experiment 


Source of Degrees of 

Variation Sum of Squares Freedom Mean Square ho P-Value 
A (deposition time) 2.7956 1 2.7956 134.40 7.07 E-8 
B (arsenic flow) 0.0181 1 0.0181 0.87 0.38 
AB 0.0040 1 0.0040 0.19 0.67 
Error 0.2495 12 0.0208 
Total 3.0672 15 


since the only active variable is deposition time, which is represented by a coded variable x,. 
The low and high levels of deposition time are assigned values x; = —1 and x, = +1, 
respectively. The least squares fitted model is 


0.836 
} = 14.389 + (23) x 


where the intercept Sy is the grand average of all 16 observations (y) and the slope , is one- 
half the effect estimate for deposition time. The regression coefficient is one-half the effect es- 
timate because regression coefficients measure the effect of a unit change in x, on the mean of 
Y, and the effect estimate is based on a two-unit change from —1 to +1. 

A coefficient relates a factor to the response and, similar to regression analysis, interest 
centers on whether or not a coefficient estimate is significantly different from zero. A f-test for 
a coefficient can also be used to test the significance of an effect. Each effect estimate in 
Equations 14-11 through 14-13 is the difference between two averages (that we denote in gen- 
eral as y, — y_). Ina 2* experiment with n replicates, half the observations appear in each av- 
erage so that there are n2‘~ ! observations in each average. The associated coefficient estimate, 
say B, equals half the associated effect estimate so that 


~ effect y,—y_ 
= = 14-1 
tS 5 (14-16) 


The standard error of 8 equals half the standard error of the effect and an effect is simply the 
difference between two averages. Therefore, 


ie © an 1 1 
standard error 6 = ae += o/s (14-17) 


where G is estimated from the square root of mean square error. The t-statistic to test Hp: B = 0 
in a 2‘ experiment is 


pb Ga 
standard error B 1 


f= (14-18) 
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with degrees of freedom equal to those associated with mean square error. This statistic is sim- 
ilar to a two-sample t-test, but o is estimated from root mean square error. The estimate G ac- 
counts for the multiple treatments in an experiment and generally differs from the estimate 
used in a two-sample t-test. 

Some algebra can be used to show that for a 2* experiment the square of the f-statistic for 
the coefficient test equals the F-statistic used for the effect test in the analysis of variance. 
Also, the square of a t random variable with d degrees of freedom is an F random variable with | 
numerator and d denominator degrees of freedom. Thus, the test that compares the absolute 
value of the t-statistic to the ¢ distribution is equivalent to the F-test, and either method may be 
used to test an effect. 

This model can also be used to obtain the predicted values at the four points that form the 
corners of the square in the design. For example, consider the point with low deposition time 
(x, = —1) and low arsenic flow rate. The predicted value is 


0.836 
$ = 14.389 + (O82) —1 = 13.971 pm 


and the residuals for the four runs at that design point are 


e, = 14.037 — 13.971 = 0.066 


ey = 14.165 — 13.971 = 0.194 
e; = 13.972 — 13.971 = 0.001 
é, = 13.907 — 13.971 = —0.064 


The remaining predicted values and residuals at the other three design points are calculated in 
a similar manner. 

A normal probability plot of these residuals is shown in Fig. 14-16. This plot indicates 
that one residual e,; = —0.392 is an outlier. Examining the four runs with high deposition 
time and high arsenic flow rate reveals that observation y,,; = 14.415 is considerably smaller 


99 


95 
90 


80 
70 


60 


Normal probability 


-0.392 -0.294 -0.197 -0.099 -0.001 0.096 0.194 
Residual 


Figure 14-16 Normal probability plot of residuals for the 
epitaxial process experiment. 
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3 Low e High Deposition time, A $ Low ; High Arsenic flow rate, B 
6 e 
3 
e 
e 
-0.5 -0.5 
Figure 14-17 Plot of residuals versus deposition time. Figure 14-18 Plot of residuals versus arsenic flow 
rate. 


than the other three observations at that treatment combination. This adds some additional 
evidence to the tentative conclusion that observation 15 is an outlier. Another possibility is 
that some process variables affect the variability in epitaxial layer thickness. If we could 
discover which variables produce this effect, we could perhaps adjust these variables to levels 
that would minimize the variability in epitaxial layer thickness. This could have important im- 
plications in subsequent manufacturing stages. Figures 14-17 and 14-18 are plots of residuals 
versus deposition time and arsenic flow rate, respectively. Apart from that unusually large 
residual associated with y,;, there is no strong evidence that either deposition time or arsenic 
flow rate influences the variability in epitaxial layer thickness. 

Figure 14-19 shows the standard deviation of epitaxial layer thickness at all four runs in 
the 2? design. These standard deviations were calculated using the data in Table 14-13. Notice 
that the standard deviation of the four observations with A and B at the high level is 
considerably larger than the standard deviations at any of the other three design points. Most 
of this difference is attributable to the unusually low thickness measurement associated with 
y,5- The standard deviation of the four observations with A and B at the low level is also some- 
what larger than the standard deviations at the remaining two runs. This could indicate that 
other process variables not included in this experiment may affect the variability in epitaxial 
layer thickness. Another experiment to study this possibility, involving other process variables, 
could be designed and conducted. (The original paper in the AT&T Technical Journal shows 
that two additional factors, not considered in this example, affect process variability.) 


14-5.2 2" Design for k = 3 Factors 


The methods presented in the previous section for factorial designs with k = 2 factors each at 
two levels can be easily extended to more than two factors. For example, consider k = 3 
factors, each at two levels. This design is a 2° factorial design, and it has eight runs or treat- 
ment combinations. Geometrically, the design is a cube as shown in Fig. 14-20(a), with the 
eight runs forming the corners of the cube. Figure 14-20(b) lists the eight runs in a table, with 
each row representing one of the runs and the — and + settings indicating the low and high 
levels for each of the three factors. This table is sometimes called the design matrix. This 
design allows three main effects to be estimated (A, B, and C) along with three two-factor in- 
teractions (AB, AC, and BC) and a three-factor interaction (ABC). 

The main effects can easily be estimated. Remember that the lowercase letters (1), a, b, 
ab, c, ac, bc, and abc represent the total of all n replicates at each of the eight runs in the 
design. As seen in Fig. 14-21(a), the main effect of A can be estimated by averaging the four 
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0.250 
ab 
+ 
Cc 
a 
A 0.051 —_ 
+ (a) Geometric view 


Figure 14-19 The standard deviation Figure 14-20 The 2° design. 
of epitaxial layer thickness at the four 
runs in the 2” design. 


Main effects 


Two-factor interactions 
(b) 


e=+ runs 
B o =-— runs 


ABC 
Three-factor interaction 
(c) 
Figure 14-21 Geometric presentation of contrasts corresponding 


to the main effects and interaction in the 2° design. (a) Main effects. 
(b) Two-factor interactions. (c) Three-factor interaction. 


Run A B Cc 
1 L = = 
2 + - = 
3 - + 
4 + + 
5 - - + 
6 + - + 
7 - + + 
8 + + + 


(b) The 23 design matrix 


Main Effect 
of Factor A: 
23 Design 


Main Effect 
of Factor B: 
23 Design 


Main Effect 
of Factor C: 
23 Design 
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treatment combinations on the right-hand side of the cube, where A is at the high level, and by 
subtracting from this quantity the average of the four treatment combinations on the left-hand 
side of the cube where A is at the low level. This gives 


A= Jay Fa 
atabt+actabe (I) +tbt+c+be 
4n 4n 


This equation can be rearranged as 


A 


Ya+ 
1 
gn [a t+ ab + ac + abc — (1) — b — € — be] 


In a similar manner, the effect of B is the difference in averages between the four treatment 
combinations in the back face of the cube [Fig. 14-19(a)], and the four in the front. This yields 


B= yp — Ye- 
1 


Sg eae bear ac 


The effect of C is the difference in average response between the four treatment combinations 
in the top face of the cube in Figure 14-19(a) and the four in the bottom, that is, 


C= Vou. = Me 


1 
= let de these abe — la = bab) 


The two-factor interaction effects may be computed easily. A measure of the AB interaction 
is the difference between the average A effects at the two levels of B. By convention, one-half 
of this difference is called the AB interaction. Symbolically, 


s Average A Effect 
High (+) [(abc — bc) + (ab — b)] 
2n 
Low (—) {wie =) + [a= UF 
2n 


Difference [abe — be + ab -—b-ac+c-at(I)] 
2n 
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Because the AB interaction is one-half of this difference, 


Two-Factor 
Interaction 1 
Effect: AB = 7 [abe — be + ab — b—ac +c —a + (1)] 
23 Design i‘ 


We could write the AB effect as follows: 


abo +ab+c+t+(1) be+bt+acta 
4n 4n 


AB = 


In this form, the AB interaction is easily seen to be the difference in averages between runs 
on two diagonal planes in the cube in Fig. 14-19(b). Using similar logic and referring to 
Fig. 14-19(b), we find that the AC and BC interactions are 


Two-Factor 
Interaction 1 
Effect: AC = 7 [(1) —a + b—ab—c + ac — be + abc} 
23 Design fe 
BC = 7 [(1) +a — b — ab — c — ac + be + abc] 
n 


The ABC interaction is defined as the average difference between the AB interaction for 
the two different levels of C. Thus, 


ABC = 7 {abe — be| — [ac — c] — [ab — b] + [a — (1)]} 


or 


Three-Factor 


Interaction 1 
Effect: ABC = 7 [abe — be — ac +c — ab + b + a ~ (1)] 
n 


23 Design 


As before, we can think of the ABC interaction as the difference in two averages. If the runs 
in the two averages are isolated, they define the vertices of the two tetrahedra that comprise 
the cube in Fig. 14-21(c). 

In the equations for the effects, the quantities in brackets are contrasts in the treatment 
combinations. A table of plus and minus signs can be developed from the contrasts and is 
shown in Table 14-15. Signs for the main effects are determined directly from the test matrix 
in Figure 14-20(b). Once the signs for the main effect columns have been established, the 
signs for the remaining columns can be obtained by multiplying the appropriate main effect 
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Table 14-15 Algebraic Signs for Calculating Effects in the 2? Design 


Treatment Factorial Effect 
Combination I A B AB C AC BC ABC 


(1) ‘ = 


row by row. For example, the signs in the AB column are the products of the A and B column 
signs in each row. The contrast for any effect can easily be obtained from this table. 
Table 14-15 has several interesting properties: 


1. Except for the identity column /, each column has an equal number of plus and minus 
signs. 

2. The sum of products of signs in any two columns is zero; that is, the columns in the 
table are orthogonal. 

3. Multiplying any column by column / leaves the column unchanged; that is, J is an 
identity element. 

4. The product of any two columns yields a column in the table, for example A X B = 
AB, and AB X ABC = A°B’C = C, since any column multiplied by itself is the 
identity column. 


The estimate of any main effect or interaction in a 2* design is determined by multiplying 
the treatment combinations in the first column of the table by the signs in the corresponding 
main effect or interaction column, by adding the result to produce a contrast, and then by di- 
viding the contrast by one-half the total number of runs in the experiment. 


EXAMPLE 14-4 Surface Roughness 


Consider the surface roughness experiment originally de- and the sum of squares for A is found using Equation 14-15: 
scribed in Example 14-2. This is a 2° factorial design in the 2 2 

factors feed rate (A), depth of cut (B), and tool angle (C), with ss, = (Contrast) = (27) = 45.5625 

n = 2 replicates. Table 14-16 presents the observed surface n2k 2(8) 


roughness data. 


; It is easy to verify that the other effects are 
The effect of A, for example, is 


B = 1.625 
I C = 0.875 
A ri [a+ ab + ac + abc — (1) -—b-c — bc] AB = = 1.375 
” AC = 0125 
1 = 
= 2-208 427 £03 440 = 16 = 20= 21 = 19] Be bees 
4(2) ABG= 135 
= 1 [27] = 3.375 Examining the magnitude of the effects clearly shows that 


feed rate (factor A) is dominant, followed by depth of cut (B) 
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Table 14-16 Surface Roughness Data for Example 14-4 


Design Factors 


Treatment Surface 
Combinations A B Ge Roughness Totals 

(1) il al = Q, 7 16 

a 1 =1 =] 10, 12 22 

b il 1 el 9,11 20 
ab 1 1 = 12,15 27 

c il al 1 11, 10 21 
ac 1 =1 1 10, 13 23 
be =il 1 1 10, 8 18 
abc 1 1 1 16, 14 30 


and the AB interaction, although the interaction effect is rela- 
tively small. The analysis of variance, summarized in Table 
14-17, confirms our interpretation of the effect estimates. 

Minitab will analyze 2“ factorial designs. The output 
from the Minitab DOE (Design of Experiments) module for 
this experiment is shown in Table 14-18. The upper portion of 
the table displays the effect estimates and regression coeffi- 
cients for each factorial effect. However, the t-statistic com- 
puted from Equation 14-18 is reported for each effect instead 
of the F-statistic used in Table 14-17. To illustrate, for the 
main effect of feed Minitab reports tf = 4.32 (with eight 
degrees of freedom), and t* = (4.32)* = 18.66, which is 
approximately equal to the F-ratio for feed reported in Table 
14-17 (F = 18.69). This F-ratio has one numerator and eight 
denominator degrees of freedom. 

The lower panel of the Minitab output in Table 14-18 is 
an analysis of variance summary focusing on the types of 
terms in the model. A regression model approach is used in the 
presentation. You might find it helpful to review Section 12-2.2, 
particularly the material on the partial F-test. The row entitled 
“main effects” under source refers to the three main 
effects feed, depth, and angle, each having a single degree of 


freedom, giving the total 3 in the column headed “DE” The 
column headed “Seq SS” (an abbreviation for sequential sum 
of squares) reports how much the model sum of squares in- 
creases when each group of terms is added to a model that 
contains the terms listed above the groups. The first number in 
the “Seq SS” column presents the model sum of squares for 
fitting a model having only the three main effects. The row la- 
beled “‘2-Way Interactions” refers to AB, AC, and BC, and the 
sequential sum of squares reported here is the increase in the 
model sum of squares if the interaction terms are added to a 
model containing only the main effects. Similarly, the sequen- 
tial sum of squares for the three-way interaction is the increase 
in the model sum of squares that results from adding the term 
ABC to a model containing all other effects. 

The column headed “Adj SS” (an abbreviation for ad- 
justed sum of squares) reports how much the model sum of 
squares increases when each group of terms is added to a 
model that contains all the other terms. Now since any 2* de- 
sign with an equal number of replicates in each cell is an 
orthogonal design, the adjusted sum of squares will equal the se- 
quential sum of squares. Therefore, the F-tests for each row in 
the Minitab analysis of variance table are testing the significance 


Table 14-17 Analysis of Variance for the Surface Finish Experiment 


Source of Degrees of 

Variation Sum of Squares Freedom Mean Square to P-Value 
A 45.5625 1 45.5625 18.69 0.0025 
B 10.5625 1 10.5625 4.33 0.0709 
Cc 3.0625 1 3.0625 1.26 0.2948 
AB 7.5625 1 7.5625 3.10 0.1162 
AC 0.0625 1 0.0625 0.03 0.8784 
BC 1.5625 1 1.5625 0.64 0.4548 
ABC 5.0625 1 5.0625 2.08 0.1875 
Error 19.5000 8 2.4375 

Total 92.9375 15 


Table 14-18 Minitab Analysis for Example 14-4 
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Estimated Effects and Coefficients for Roughness 


Term Effect 
Constant 

Feed 3.3750 
Depth 1.6250 
Angle 0.8750 
Feed*Depth 1.3750 
Feed* Angle 0.1250 
Depth*Angle —0.6250 
Feed*Depth* Angle 1.1250 


Analysis of Variance for Roughness 


Source DF 
Main Effects 3 
2-Way Interactions 3 
3-Way Interactions 1 
Residual Error 8 

Pure Error 8 
Total 15 


Coef 


11.0625 


1.6875 
0.8125 
0.4375 
0.6875 
0.0625 


—0.3125 


Seq SS 
59.188 

9.187 

5.062 
19.500 
19.500 
92.938 


0.5625 


Adj SS 
59.188 
9.187 
5.062 
19.500 
19.500 


StDev Coef 
0.3903 
0.3903 
0.3903 
0.3903 
0.3903 
0.3903 
0.3903 
0.3903 


Adj MS 
19.729 
3.062 
5.062 
2.437 
2.437 


T P 
28.34 0.000 
4.32 0.003 
2.08 0.071 
1.12 0.295 
1.76 0.116 
0.16 0.877 
—0.80 0.446 
1.44 0.188 
F P 
8.09 0.008 
1.26 0.352 
2.08 0.188 


of each group of terms (main effects, two-factor interactions, 
and three-factor interactions) as if they were the last terms to 
be included in the model. Clearly, only the main effect terms are 
significant. The t-tests on the individual factor effects indicate 


Models and Residual Analysis 


that feed rate and depth of cut have large main effects, and 
there may be some mild interaction between these two factors. 
Therefore, the Minitab output is in agreement with the results 
given previously. 


We may obtain the residuals from a 2‘ design by using the method demonstrated earlier for 
the 2? design. As an example, consider the surface roughness experiment. The three largest 
effects are A, B, and the AB interaction. The regression model used to obtain the predicted 


values is 


Y = Bo + Byxy + Box + Byoxpx + € 


where x, represents factor A, x represents factor B, and x,x, represents the AB interaction. The 
regression coefficients B,, B2, and B,, are estimated by one-half the corresponding effect esti- 


mates, and By is the grand average 


SS 
II 


11.0625 + ( 


. Thus, 


3.375 1.625 
ae, xy + a X2 + 


11.0625 + 1.6875x, + 0.8125x) + 0.6875x,x» 


1.375 


xen 


Note that the regression coefficients are presented by Minitab in the upper panel of Table 14-18. 
The predicted values would be obtained by substituting the low and high levels of A and B into 
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Figure 14-22 
Normal probability 
plot of residuals from 
the surface roughness 
experiment. 


Normal probability 


-2.250 -1.653 -0.917 -0.250 0.417 1.083 1.750 
Residual 


this equation. To illustrate this, at the treatment combination where A, B, and C are all at the low 
level, the predicted value is 


} = 11.0625 + 1.6875(—1) + 0.8125(—1) + 0.6875(—1)(—1) = 9.25 


Since the observed values at this run are 9 and 7, the residuals are 9 — 9.25 = —0.25 and 
7 — 9.25 = —2.25. Residuals for the other 14 runs are obtained similarly. 

A normal probability plot of the residuals is shown in Fig. 14-22. Since the residuals lie 
approximately along a straight line, we do not suspect any problem with normality in the data. 
There are no indications of severe outliers. It would also be helpful to plot the residuals 
versus the predicted values and against each of the factors A, B, and C. 


Projection of 2 Designs 

Any 2* design will collapse or project into another 2‘ design in fewer variables if one or more of 
the original factors are dropped. Sometimes this can provide additional insight into the 
remaining factors. For example, consider the surface roughness experiment. Since factor C and 
all its interactions are negligible, we could eliminate factor C from the design. The result is to 
collapse the cube in Fig. 14-20 into a square in the A — B plane; therefore, each of the four runs 
in the new design has four replicates. In general, if we delete h factors so that r = k — h factors 
remain, the original 2* design with n replicates will project into a 2’ design with n2" replicates. 


14-5.3 Single Replicate of the 2 Design 


As the number of factors in a factorial experiment grows, the number of effects that can be 
estimated also grows. For example, a 2* experiment has 4 main effects, 6 two-factor 
interactions, 4 three-factor interactions, and 1 four-factor interaction, while a 2° experiment 
has 6 main effects, 15 two-factor interactions, 20 three-factor interactions, 15 four-factor 
interactions, 6 five-factor interactions, and 1 six-factor interaction. In most situations the 
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sparsity of effects principle applies; that is, the system is usually dominated by the main ef- 
fects and low-order interactions. The three-factor and higher order interactions are usually 
negligible. Therefore, when the number of factors is moderately large, say, k = 4 or 5, a com- 
mon practice is to run only a single replicate of the 2" design and then pool or combine the 
higher order interactions as an estimate of error. Sometimes a single replicate of a 2" design is 
called an unreplicated 2‘ factorial design. 

When analyzing data from unreplicated factorial designs, occasionally real high-order 
interactions occur. The use of an error mean square obtained by pooling high-order interactions 
is inappropriate in these cases. A simple method of analysis can be used to overcome this prob- 
lem. Construct a plot of the estimates of the effects on a normal probability scale. The effects 
that are negligible are normally distributed, with mean zero and variance o” and will tend to fall 
along a straight line on this plot, whereas significant effects will have nonzero means and will 


not lie along the straight line. We will illustrate this method in the next example. 


EXAMPLE 14-5 Plasma Etch 

An article in Solid State Technology [“Orthogonal Design for 
Process Optimization and Its Application in Plasma Etching” 
(May 1987, pp. 127—132)] describes the application of facto- 
rial designs in developing a nitride etch process on a single- 
wafer plasma etcher. The process uses C,F, as the reactant 
gas. It is possible to vary the gas flow, the power applied to the 
cathode, the pressure in the reactor chamber, and the spacing 
between the anode and the cathode (gap). Several response 


variables would usually be of interest in this process, but 
in this example we will concentrate on etch rate for silicon 
nitride. 

We will use a single replicate of a 2* design to investigate 
this process. Since it is unlikely that the three- and four-factor 
interactions are significant, we will tentatively plan to com- 
bine them as an estimate of error. The factor levels used in the 
design are shown below: 


Design Factor 


Gap Pressure C.F, Flow Power 
Level (cm) (mTorr) (SCCM) (w) 
Low (—) 0.80 450 125 DUIS) 
High (+) 1.20 550 200 325 


Table 14-19 presents the data from the 16 runs of the 2* design. 
Table 14-20 is the table of plus and minus signs for the 24 de- 
sign. The signs in the columns of this table can be used to esti- 
mate the factor effects. For example, the estimate of factor A is 


A [a + ab + ac + abc + ad + abd + acd 


abcd — (1) —~b —c — be — d — bd — cd — bcd] 


1 
= — [669 + 650 + 642 + 635 + 749 + 868 + 860 


8 
729 — 550 — 604 — 633 — 601 — 1037 
— 1052 — 1075 — 1063] 
= —101.625 


Thus, the effect of increasing the gap between the anode and 
the cathode from 0.80 to 1.20 centimeters is to decrease the 
etch rate by 101.625 angstroms per minute. 


It is easy to verify (using Minitab, for example) that the 
complete set of effect estimates is 


A = —101.625 AD = —153.625 
B = —1.625 BD = —0.625 
AB = —7.875 ABD = 4.125 
Cc = 7.375 CD = —2.125 
AC = = —24.875 ACD = 5.625 
BC = —43.875 BCD = —25.375 
ABC = ~—15.625 ABCD = —40.125 
D = 306.125 


The normal probability plot of these effects from the plasma 
etch experiment is shown in Fig. 14-23. Clearly, the main ef- 
fects of A and D and the AD interaction are significant, be- 
cause they fall far from the line passing through the other 
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Table 14-19 The 24 Design for the Plasma Etch Experiment 


A B Cc D Etch Rate 
(Gap) (Pressure) (C,F, Flow) (Power) (A/min) 

-1 =i -1 =il 550 

1 —1 —l —1 669 
=I 1 =i = 604 

1 1 -1 —1 650 
-1 =i 1 =i 633 

1 —1 1 —1 642 
=I 1 1 =I 601 

1 1 1 —1 635 
-1 =i =i 1 1037 

1 -1 —l1 1 7TA9 
=I! 1 =I 1 1052 

1 1 —1 1 868 
=! =Il 1 1 1075 

1 -1 1 1 860 
=I 1 1 1 1063 

1 1 1 1 729 


points. The analysis of variance summarized in Table 14-21 
confirms these findings. Notice that in the analysis of vari- 
ance we have pooled the three- and four-factor interactions 
to form the error mean square. If the normal probability plot 
had indicated that any of these interactions were important, 
they would not have been included in the error term. 
Practical Interpretation: Since A = — 101.625, the ef- 
fect of increasing the gap between the cathode and anode 


Table 14-20 Contrast Constants for the 2+ Design 


is to decrease the etch rate. However, D = 306.125; thus, 
applying higher power levels will increase the etch rate. 
Figure 14-24 is a plot of the AD interaction. This plot indi- 
cates that the effect of changing the gap width at low 
power settings is small, but that increasing the gap at high 
power settings dramatically reduces the etch rate. High 
etch rates are obtained at high power settings and narrow 
gap widths. 


A B AB C AC BC ABC D AD BD ABD CD ACD BCD ABCD 
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Figure 14-23 Normal probability plot of effects Figure 14-24 AD (Gap-Power) interaction from the 
from the plasma etch experiment. plasma etch experiment. 


The residuals from the experiment in Example 14-5 can be obtained from the regression 
model 


101.625 306.125 153.625 
$ = 776.0625 — (a) x, + (2) x4 — (Be) X4X4 


For example, when both A and D are at the low level, the predicted value is 


9 = 776.0625 — (2) (-1 . (2) -n - (2) 1-1 


Table 14-21 Analysis of Variance for the Plasma Etch Experiment 


Source of Degrees of 

Variation Sum of Squares Freedom Mean Square to P-Value 
A 41,310.563 1 41,310.563 20.28 0.0064 
B 10.563 1 10.563 <1 — 
Cc 217.563 1 217.563 <1 — 
D 374,850.063 1 374,850.063 183.99 0.0000 
AB 248.063 1 248.063 <1 — 
AC 2,475.063 1 2,475.063 1.21 0.3206 
AD 94,402.563 1 94,402.563 46.34 0.0010 
BC 7,700.063 1 7,700.063 3.78 0.1095 
BD 1.563 1 1.563 <1 — 
CD 18.063 1 18.063 <1 —_— 
Error 10,186.813 5 2,037.363 


Total 531,420.938 


— 
Nn 
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99 


Normal probability 


-72.50 -49.33 -26.17 -3.00 20.17. 43.33 66.50 
Residual 
Figure 14-25 Normal probability plot of residuals from the 
plasma etch experiment. 


and the four residuals at this treatment combination are 
ey, = 550 — 597 = —47 é, = 604 — 597 = 7 
e3 = 633 —597 = 36 e, = 601 — 597 = 4 


The residuals at the other three treatment combinations (A high, D low), (A low, D high), and 
(A high, D high) are obtained similarly. A normal probability plot of the residuals is shown in 
Fig. 14-25. The plot is satisfactory. 


14-5.4 Addition of Center Points to a 2‘ Design 


A potential concern in the use of two-level factorial designs is the assumption of linearity in 
the factor effects. Of course, perfect linearity is unnecessary, and the 2‘ system will work quite 
well even when the linearity assumption holds only approximately. However, there is a method 
of replicating certain points in the 2‘ factorial that will provide protection against curvature as 
well as allow an independent estimate of error to be obtained. The method consists of adding 
center points to the 2" design. These consist of n¢ replicates run at the point x; = 0 (i = 1, 
2,...,). One important reason for adding the replicate runs at the design center is that cen- 
ter points do not affect the usual effects estimates in a 2 design. We assume that the k factors 
are quantitative. 

To illustrate the approach, consider a 2? design with one observation at each of the 
factorial points (—, —), (+, —), (—, +), and (+, +) and n¢ observations at the center points 
(0, 0). Figure 14-26 illustrates the situation. Let y, be the average of the four runs at the four 
factorial points, and let yc be the average of the n- run at the center point. If the difference 
Yr — Ycis small, the center points lie on or near the plane passing through the factorial points, 


Curvature Sum 
of Squares 
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+1 


Figure 14-26 A 2? design with center points. 


and there is no curvature. On the other hand, if y- — yc is large, curvature is present. A single- 
degree-of-freedom sum of squares for curvature is given by 


npnc(Yr — Ye) 
Np + Nc 


S S@urvanire 7 


=) Sunes (14-19) 


where, in general, n, is the number of factorial design points. This quantity may be compared 
to the error mean square to test for curvature. Notice that when Equation 14-19 is divided by 
6° = MS,, the result is similar to the square of the f statistic used to compare two means. 

More specifically, when points are added to the center of the 2* design, the model we may 
entertain is 


k k 
Y=Bot+ > Bix +>) > Byxig + > Bit re 
j= i<j j= 


where the 8; are pure quadratic effects. The test for curvature actually tests the hypotheses 


k 

Ho: >» Bij =0 
j=l 
k 

A: > Bi, #0 
= 


Furthermore, if the factorial points in the design are unreplicated, we may use the - center 
points to construct an estimate of error with nc — | degrees of freedom. 
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+1 


40 


A = Reaction time (min) 


Figure 14-27 The 2? design with five center 


points for Example 14-6. 


EXAMPLE 14-6 Process Yield 
A chemical engineer is studying the percentage of conversion or 
yield of a process. There are two variables of interest, reaction 
time and reaction temperature. Because she is uncertain about 
the assumption of linearity over the region of exploration, the en- 
gineer decides to conduct a 2” design (with a single replicate of 
each factorial run) augmented with five center points. The design 
and the yield data are shown in Fig. 14-27. 

Table 14-22 summarizes the analysis of variance for this 
experiment. The mean square error is calculated from the cen- 
ter points as follows: 


(yi — y y 
SSE Center points 7 
MS; = = 
nc — 1 nc — 1 
5 
; — 40.46) 
i=l (y ) 0.1720 
= 4 = = 0.0430 


The average of the points in the factorial portion of the design is 
Yr = 40.425, and the average of the points at the center is 
¥c = 40.46. The difference yr — yo = 40.425 — 40.46 = 
—0.035 appears to be small. The curvature sum of squares in the 
analysis of variance table is computed from Equation 14-19 as 
follows: 


ngnc(Yr — Yo" 


SScurvature _ np + 1c 
4)(5)(—0.035)" 
2uh ae Y = 0.0027 


Practical Interpretation: The analysis of variance indi- 
cates that both factors exhibit significant main effects, that 
there is no interaction, and that there is no evidence of curvature 
in the response over the region of exploration. That is, the null 
hypothesis Ho: a Bj, = 0 cannot be rejected. 


Table 14-22 Analysis of Variance for Example 14-6 


Source of Sum of Degrees of Mean 

Variation Squares Freedom Square ho P-Value 
A (Time) 2.4025 1 2.4025 55.87 0.0017 
B (Temperature) 0.4225 1 0.4225 9.83 0.0350 
AB 0.0025 1 0.0025 0.06 0.8237 
Curvature 0.0027 1 0.0027 0.06 0.8163 
Error 0.1720 4 0.0430 
Total 3.0022 8 


i) 
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14-12. An engineer is interested in the effect of cutting 
speed (A), metal hardness (B), and cutting angle (C) on the life 
of a cutting tool. Two levels of each factor are chosen, and two 
replicates of a 2? factorial design are run. The tool life data (in 
hours) are shown in the table at right. 


Treatment Replicate 
Combination I II 
(1) 221 311 
a 325 435 
b 354 348 
ab 552 472 
c 440 453 
ac 406 377 
be 605 500 
abc 392 419 


(a) Analyze the data from this experiment. 

(b) Find an appropriate regression model that explains tool 
life in terms of the variables used in the experiment. 

(c) Analyze the residuals from this experiment. 


14-13. Four factors are thought to influence the taste of a 
soft-drink beverage: type of sweetener (A), ratio of syrup to 
water (B), carbonation level (C), and temperature (D). Each 
factor can be run at two levels, producing a 2‘ design. At each 
run in the design, samples of the beverage are given to a test 


Treatment ___Replicate 
Combination I Il 
(1) 159 163 
a 168 175 
b 158 163 
ab 166 168 
c 175 178 
ac 179 183 
be 173 168 
abc 179 182 
d 164 159 
ad 187 189 
bd 163 159 
abd 185 191 
ed 168 174 
acd 197 199 
bcd 170 174 
abcd 194 198 


panel consisting of 20 people. Each tester assigns the beverage 
a point score from | to 10. Total score is the response variable, 
and the objective is to find a formulation that maximizes total 
score. Two replicates of this design are run, and the results are 
shown in the table. Analyze the data and draw conclusions. 
Use a = 0.05 in the statistical tests. 


14-14. The data shown here represent a single replicate of a 
2° design that is used in an experiment to study the compressive 
strength of concrete. The factors are mix (A), time (B), labora- 
tory (C), temperature (D), and drying time (E). 


(1) = 700 e = 800 
a = 900 ae = 1200 
b = 3400 be = 3500 
ab = 5500 abe = 6200 
c = 600 ce = 600 
ac = 1000 ace = 1200 
be = 3000 bce = 3006 
abc = 5300 abce = 5500 
d = 1000 de = 1900 
ad = 1100 ade = 1500 
bd = 3000 bde = 4000 
abd = 6100 abde = 6500 
cd = 800 cde = 1500 
acd = 1100 acde = 2000 
bcd = 3300 bcde = 3400 
abcd = 6000 abcde = 6800 


(a) Estimate the factor effects. 

(b) Which effects appear important? Use a normal probability 
plot. 

(c) If it is desirable to maximize the strength, in which 
direction would you adjust the process variables? 

(d) Analyze the residuals from this experiment. 


14-15. An article in JEEE Transactions on Semiconduc-tor 
Manufacturing (Vol. 5, 1992, pp. 214-222) describes an ex- 
periment to investigate the surface charge on a silicon wafer. 
The factors thought to influence induced surface charge are 
cleaning method (spin rinse dry or SRD and spin dry or SD) 
and the position on the wafer where the charge was measured. 
The surface charge (X10'! q/cm*) response data are as shown. 


Test Position 


L R 
1.66 1.84 

sD 1.90 1.84 

Cleaning 1.92 1.62 
Method At ~7.58 
SRD] —1.35 ~2,90 

~2.08 —5.36 


= 


i) 
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(a) Estimate the factor effects. 
(b) Which factors appear important? Use a = 0.05. 
(c) Analyze the residuals from this experiment. 


14-16. Anarticle in Oikos: A Journal of Ecology (“Regulation 
of Root Vole Population Dynamics by Food Supply and 
Predation: A Two-Factor Experiment” (2005, Vol. 109, pp. 
387-395)] investigated how food supply interacts with pre- 
dation in the regulation of root vole (Microtus oeconomus 
Pallas) population dynamics. A replicated two-factor field 
experiment manipulating both food supply and predation 
condition for root voles was conducted. Four treatments 
were applied: —P, +F (no-predator, food-supplemented); 
+P, +F (predator-access, food-supplemented); —P, —F 
(no-predator, nonsupplemented); +P, —F (predator-access, 
food-supplemented). The population density of root voles 
(voles ha_') for each treatment combination in each is shown 
below. 


Food Supply Predation 


(F) (P) Replicates 

+1 =I 88.589 114.059 200.979 
+1 +1 56.949 97.079 78.759 
= =I 65.439 89.089 172.339 
—l1 +1 40.799 47.959 74.439 


(a) What is an appropriate statistical model for this experiment? 
(b) Analyze the data and draw conclusions. 
(c) Analyze the residuals from this experiment. Are there any 


(a) Construct a normal probability plot of the effect estimates. 
Which effects appear to be large? 

(b) Conduct an analysis of variance to confirm your findings 
for part (a). 

(c) Construct a normal probability plot of the residuals. Is the 
plot satisfactory? 

(d) Plot the residuals versus the predicted yields and versus 
each of the five factors. Comment on the plots. 

(e) Interpret any significant interactions. 

(f) What are your recommendations regarding process 
operating conditions? 

(g) Project the 2° design in this problem into a 2” for r <5 
design in the important factors. Sketch the design and 
show the average and range of yields at each run. Does 
this sketch aid in data interpretation? 


14-18. An experiment described by M. G. Natrella in the 
National Bureau of Standards’ Handbook of Experimental 
Statistics (No. 91, 1963) involves flame-testing fabrics after 
applying fire-retardant treatments. The four factors considered 
are type of fabric (A), type of fire-retardant treatment (B), 
laundering condition (C—the low level is no laundering, the 
high level is after one laundering), and method of conducting 
the flame test (D). All factors are run at two levels, and the 
response variable is the inches of fabric burned on a standard 
size test sample. The data are: 


problems with model adequacy? (1) = 42 d =40 

14-17. An experiment was run in a semiconductor a =31 ad =30 
fabrication plant in an effort to increase yield. Five factors, b = 45 bd =50 

—~ each at two levels, were studied. The factors (and levels) were ab =29 abd = 25 
@ A = aperture setting (small, large), B = exposure time (20% c = 39 cd =40 
below nominal, 20% above nominal), C = development time ac = 28 acd = 25 

(30 and 45 seconds), D = mask dimension (small, large), and be = 46 bcd = 50 

E = etch time (14.5 and 15.5 minutes). The following abc = 32 abcd = 23 


unreplicated 2° design was run: 


1) = 7 e = 8 
” = 9 de = 12 (a) Estimate the effects and prepare a normal plot of the 
b = 34 be = 35 effects. 
ab =55 abe =52 (b) Construct an analysis of variance table based on the model 
Cc = 16 ce = 15 tentatively identified in part (a). 
ac = 20 ace =22 (c) Construct a normal probability plot of the residuals and 
be = 40 bce = 45 comment on the results. 
ae = 00 ahee = 85 14-19. Consider the data from Exercise 14-12. Suppose 
oe oe that the data from the second replicate was not available. 
ad = 10 ade = 10 Analyze the data from replicate I only and comment on your 
bd = 32 bde = 30 Splines. 

g 
abd = 50 abde = 53 
cd =18 cde =15 14-20. A 2* factorial design was run in a chemical 
acd = 21 acde = 20 process. The design factors are A = time, B = concentration, 
bcd = 44 bcde = 41 C = pressure, and D = temperature. The response variable is 
abcd = 61 abcde = 63 yield. The data follow: 


Yield Factor Levels 
Run A BC OD _ (pounds) = + 
1 = = = = 12 A (hours) 2 3 
a 18 B(%) 14 18 
3 = fF = = 13 C(psi) 60 80 
4 16 DCC) 200 250 
5 SS a = 17 
6 15 
7 20 
8 15 
9 = = = + 10 
10 25 
11 13 
12 24 
13 = =|} = =F 19 
14 21 
15 17 
16 23 


(a) Estimate the factor effects. Based on a normal probability 
plot of the effect estimates, identify a model for the data 
from this experiment. 

(b) Conduct an ANOVA based on the model identified in 
part (a). What are your conclusions? 

(c) Analyze the residuals and comment on model adequacy. 

(d) Find a regression model to predict yield in terms of the 
actual factor levels. 

(e) Can this design be projected into a 2? design with two 
replicates? If so, sketch the design and show the average 
and range of the two yield values at each cube corner. 
Discuss the practical value of this plot. 


14-21. An experiment has run a single replicate of a 2* 

design and calculated the following factor effects: 

A = 80.25 AB = 53.25 ABC = —2.95 

B = —65.50 AC = 11.00 ABD =-—8.00 

C= —9,.25 AD = 9.75 ACD = 10.25 

D = —20.50 BC = 18.36 BCD =—7.95 
BD = 15.10 ABCD = —6.25 
CD = —1.25 


(a) Construct a normal probability plot of the effects. 

(b) Identify a tentative model, based on the plot of effects in 
part (a). 

(c) Estimate the regression coefficients in this model, assum- 
ing that y = 400. 

14-22. A two-level factorial experiment in four factors was 

conducted by Chrysler and described in the article “Sheet 

Molded Compound Process Improvement” by P. I. Hsieh and 

D. E. Goodwin (Fourth Symposium on Taguchi Methods, 
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American Supplier Institute, Dearborn, MI, 1986, pp. 13-21). 

The purpose was to reduce the number of defects in the finish 

of sheet-molded grill opening panels. A portion of the experi- 

mental design, and the resulting number of defects, y, observed 
on each run is shown in the table following. This is a single 
replicate of the 2* design. 

(a) Estimate the factor effects and use a normal probability 
plot to tentatively identify the important factors. 

(b) Fit an appropriate model using the factors identified in 
part (a) above. 

(c) Plot the residuals from this model versus the predicted num- 
ber of defects. Also, prepare a normal probability plot of the 
residuals. Comment on the adequacy of these plots. 

(d) The table also shows the square root of the number of 
defects. Repeat parts (a) and (c) of the analysis using the 
square root of the number of defects as the response. Does 
this change the conclusions? 


The Gril Defects Experiment 


Run A B C D y Vy 
1 56 7.48 

2 17 4.12 

3 2, 1.41 

4 2.00 

5 3 1.73 

6 2.00 

7 50 7.07 

8 2 1.41 

9 1.00 
10 0.00 
11 3 1.73 
12 12 3.46 
13 3 1.73 
14 4 2.00 
15 0 0.00 
16 0 0.00 

14-23. Consider a 2? factorial experiment with four center 


points. The data are (1) = 21, a = 125, b = 154, ab = 352, 
and the responses at the center point are 92, 130, 98, 152. 
Compute an ANOVA with the sum of squares for curvature 
and conduct an F-test for curvature. Use a = 0.05. 


14-24. Consider the experiment in Exercise 14-14. Suppose 
that a center point with five replicates is added to the factorial 
runs and the responses are 2800, 5600, 4500, 5400, 3600. 
Compute an ANOVA with the sum of squares for curvature 
and conduct an F-test for curvature. Use a = 0.05. 

14-25. Consider the experiment in Exercise 14-17. Suppose 
that a center point with five replicates is added to the factorial 
runs and the responses are 45, 40, 41, 47, and 43. 


s 
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(a) Estimate the experimental error using the center points. -l -1 1 -1 —1l 703.67 
Compare this to the estimate obtained originally in Exercise 1 may 1 = -| 642.14 
14-17 by pooling apparently nonsignificant effects. =4 1 1 -1 a] 692.98 

(b) Test for curvature with a = 0.05. ‘ 

Me, 1 1 1 = =1 669.26 

14-26. An article in Talanta (2005, Vol. 65, pp. 895-899) 

eo presented a 2° factorial design to find lead level by using flame l | | : a 718 
«= atomic absorption spectrometry (FAAS). The data are shown 1 =f = | Zz! 475.52 
in the following table. —1 1 -l 1 —l 478.76 
1 1 =ll 1 =I 568.23 
-l —l 1 1 —l 444.72 
Factors Lead Recovery (%) 
a Gi 7 AE al ©) 1 =Il 1 1 al 410.37 
= — 1 1 1 1 = 428.51 
: ae eae 1 1 1 1 = 491.47 
2 51.3 48 é 

3 57.9 58.1 1 1 1 1 1 607.34 

4 78.9 85.9 1 1 1 1 1 620.8 
5 78.9 84.2 -l 1 —1 —1 1 610.55 
6 84.2 84.2 1 1 = =| 1 638.04 
7 94.4 90.9 = -1 1 =I 1 585.19 
8 94.7 105.3 1 = 1 Al 1 586.17 
—l 1 1 =! 1 601.67 
: . 1 1 1 =I 1 608.31 

The factors and levels are shown in the following table. 

1 1 1 1 1 442.9 
1 =ll =ll 1 1 434.41 
Factor Low (—) High (+) -] 1 -| l 1 417.66 
Reagent concentration (RC) 5x 10° 5x 10° 1 1 —| 1 1 510.84 
1 
pnolla) - -1 1 1 1 392.11 
pH oe oe 1 = 1 1 1 343.22 
Shaking time (ST) (min) 10 30 
—l 1 1 1 1 385.52 
1 1 1 1 1 446.73 


(a) Construct a normal probability plot of the effect estimates. 
Which effects appear to be large? 

(b) Conduct an analysis of variance to confirm your findings 
for part (a). 

(c) Analyze the residuals from this experiment. Are there any 
problems with model adequacy? 


14-27. An experiment to study the effect of machining fac- 
tors on ceramic strength was described at http://www.itl. 
nist.gov/div898/handbook/. Five factors were considered at 
— two levels each: A = Table Speed, B = Down Feed Rate, C = 
() Wheel Grit, D = Direction, E = Batch. The response is the av- 
erage of the ceramic strength over 15 repetitions. The following 
data are from a single replicate of a 2° factorial design. 


A B Cc D E Strength 
1 1 1 1 1 680.45 
1 1 1 1 1 722.48 

—1 1 1 1 1 702.14 
1 1 1 1 1 666.93 


(a) Estimate the factor effects and use a normal probability 
plot of the effects. Identify which effects appear to be 
large. 

(b) Fit an appropriate model using the factors identified in 
part (a) above. 

(c) Prepare a normal probability plot of the residuals. Also, 
plot the residuals versus the predicted ceramic strength. 
Comment on the adequacy of these plots. 

(d) Identify and interpret any significant interactions. 

(e) What are your recommendations regarding process oper- 
ating conditions? 


14-28. Consider the following Minitab output for a 2? fac- 

torial experiment. 

(a) How many replicates were used in the experiment? 

(b) Use Equation 14-17 to calculate the standard error of a 
coefficient. 

(c) Calculate the entries marked with “‘?” in the output. 


Factorial Fit: y versus A, B, C 
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Estimated Effects and Coefficients for y (coded units) 


Term Effect Coef SE Coef T P 
Constant 579.33 38.46 15.06 0.000 
A 2.95 1.47 38.46 0.04 0.970 
B 15.92 } 38.46 0.21 0.841 
C —37.87 —18.94 38.46 —0.49 0.636 
A*B 20.43 10.21 38.46 2 0.797 
A*C -17.11 —8.55 38.46 —0.22 0.830 
B*C 441 2.21 38.46 0.06 0.956 
A*B*C 13.35 6.68 2? 0.17 0.866 


S = 153.832 R-Sq = 5.22% R-—Sq (adj) = 0.00% 


Analysis of Variance for y (coded units) 


Source DF SeqSS AdjSS Adj MS F P 
Main Effects 3 6785 6785 2261.8 ? 0.960 
2-Way 3 2 2918 972.5 0.04 0.988 
Interactions 
3-Way 1 ? 713 713.3 0.03 0.866 
Interactions 
Residual 8 189314 189314 23664.2 
Error 
Pure Error 8 189314 189314 23664.2 
Total 15 199730 


14-29. An article in Analytica Chimica Acta [“Design- 
of-Experiment Optimization of Exhaled Breath Condensate 
Analysis Using a Miniature Differential Mobility Spectrometer 
(DMS)” (2008, Vol. 628, No. 2, pp. 155—-161)] examined four 
parameters that affect the sensitivity and detection of the ana- 
lytical instruments used to measure clinical samples. They op- 
timized the sensor function using EBC samples spiked with 
acetone, a known clinical biomarker in breath. The following 
table shows the results for a single replicate of a 2* factorial 
experiment for one of the outputs, the average amplitude of 
acetone peak over three repetitions. 
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Configuration A B C D Y 

1 ap aa 0.12 

2 0.1193 
3 0.1196 
4 0.1192 
5 0.1186 
6 0.1188 
7 0.1191 
8 0.1186 
9 0.121 

10 0.1195 
11 = oh = ata 0.1196 
12 0.1191 


oO 
| 
| 
dt 


eee 
ans 

| 

| 

| 

+ 

S So 
aan aan 
= = 
oo ‘o 
es) & 


The factors and levels are shown in the following table. 


A_ RF voltage of the DMS sensor (1200 or 1400 V) 
B_ Nitrogen carrier gas flow rate (250 or 500mLmin“') 


C_ Solid phase microextraction (SPME) filter type 
(polyacrylate or PDMS—DVB) 


D_ GC cooling profile (cryogenic and noncryogenic) 


(a) Estimate the factor effects and use a normal probability plot 
of the effects. Identify which effects appear to be large, and 
identify a model for the data from this experiment. 

(b) Conduct an ANOVA based on the model identified in part 
(a). What are your conclusions? 

(c) Analyze the residuals from this experiment. Are there any 
problems with model adequacy? 

(d) Project the design in this problem into a 2’ design for r < 4 
in the important factors. Sketch the design and show the 
average and range of yields at each run. Does this sketch 
aid in data representation? 


14-6 BLOCKING AND CONFOUNDING IN THE 2‘ DESIGN 


It is often impossible to run all the observations in a 2 factorial design under homogeneous 
conditions. Blocking is the design technique that is appropriate for this general situation. 
However, in many situations the block size is smaller than the number of runs in the complete 
replicate. In these cases, confounding is a useful procedure for running the 2 design in 2” blocks 
where the number of runs in a block is less than the number of treatment combinations in one 
complete replicate. The technique causes certain interaction effects to be indistinguishable from 
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Figure 14-28 A 2? 
design in two blocks. 
(a) Geometric view. 

(b) Assignment of the 
four runs to two blocks. 


e= Run inblock 1 Block 1 Block 2 
o = Run in block 2 (1) a 
ab b 
(1) 
oa Od 
- + 
A Assignment of the four 
Geometric view runs to two blocks 
(a) (b) 


blocks or confounded with blocks. We wiil illustrate confounding in the 2‘ factorial design in 
2” blocks, where p < k. 

Consider a 2? design. Suppose that each of the 2” = 4 treatment combinations requires four 
hours of laboratory analysis. Thus, two days are required to perform the experiment. If days are 
considered as blocks, we must assign two of the four treatment combinations to each day. 

This design is shown in Fig. 14-28. Notice that block | contains the treatment combina- 
tions (1) and ab and that block 2 contains a and b. The contrasts for estimating the main 
effects of factors A and B are 


Contrast, = ab + a — b — (1) 


Contrastg = ab + b— a — (1) 


Note that these contrasts are unaffected by blocking since in each contrast there is one plus 
and one minus treatment combination from each block. That is, any difference between block 1 
and block 2 that increases the readings in one block by an additive constant cancels out. The 
contrast for the AB interaction is 


Contrasts; = ab + (1) -a-—b 


Since the two treatment combinations with the plus signs, ab and (1), are in block | and the 
two with the minus signs, a and J, are in block 2, the block effect and the AB interaction are 
identical. That is, the AB interaction is confounded with blocks. 

The reason for this is apparent from the table of plus and minus signs for the 2? design 
shown in Table 14-12. From the table we see that all treatment combinations that have a plus 
on AB are assigned to block 1, whereas all treatment combinations that have a minus sign on 
AB are assigned to block 2. 

This scheme can be used to confound any 2* design in two blocks. As a second example, 
consider a 2? design, run in two blocks. From the table of plus and minus signs, shown in Table 
14-15, we assign the treatment combinations that are minus in the ABC column to block 1 and 
those that are plus in the ABC column to block 2. The resulting design is shown in Fig. 14-29. 

There is a more general method of constructing the blocks. The method employs a 
defining contrast, say, 


L = 4x, + 9X. +0 + Oxy (14-20) 


where x; is the level of the ith factor appearing in a treatment combination and a; is the expo- 
nent appearing on the ith factor in the effect that is to be confounded with blocks. For the 2‘ 
system, we have either a; = O or 1, and either x; = O (low level) or x; = 1 (high level). 


Figure 14-29 The 2? 
design in two blocks 
with ABC confounded. 
(a) Geometric view. 
(b) Assignment of the 
eight runs to two 
blocks. 
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e@ = Run in block 1 


Block 1 Block 2 
o = Run in block 2 
(1) a 
ab 6 
C 
ac Cc 
B bc abc 
A 
Assignment of the eight 
Geometric view runs to two blocks 
(a) (b) 


Treatment combinations that produce the same value of L (modulus 2) will be placed in the 
same block. Since the only possible values of L (mod 2) are 0 and 1, this will assign the 2* 
treatment combinations to exactly two blocks. 

As an example, consider the 2? design with ABC confounded with blocks. Here x, corre- 
sponds to A, x, to B, x3 to C, and a; = a, = a; = 1. Thus, the defining contrast that would be 
used to confound ABC with blocks is 


L=x, + x, + x3 


To assign the treatment combinations to the two blocks, we substitute the treatment combina- 
tions into the defining contrast as follows: 


(1): L = 1(0) + 1(0) + 1(0) = 0 = 0 (mod 2) 
a: L= (1) + 1(0) + 1(0) = 1 = 1 (mod 2) 
b: L=1(0) + 1(1) + 1(0) = 1 = 1 (mod 2) 
ab: L = 1(1) + 1(1) + 1(0) = 2 = 0 (mod 2) 
c: L=1(0) + 1(0) + 101) = 1 = 1 (mod 2) 
ac. L = 1(1) + 1(0) + 1(1) = 2 = 0(mod 2) 
be: L = 1(0) + 1(1) + 1(1) = 2 = 0 (mod 2) 
abe: L = 1(1) + 1(1) + 1(1) = 3 = 1 (mod 2) 


Thus (1), ab, ac, and be are run in block 1, and a, b, c, and abc are run in block 2. This same 
design is shown in Fig. 14-29. 

A shortcut method is useful in constructing these designs. The block containing the treat- 
ment combination (1) is called the principal block. Any element [except (1)] in the principal 
block may be generated by multiplying two other elements in the principal block modulus 2 
on the exponents. For example, consider the principal block of the 2° design with ABC 
confounded, shown in Fig. 14-29. Note that 


ab + ac = a’bc = be 
ab + bc = ab’c = ac 


ac * bc = abc* = ab 
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Treatment combinations in the other block (or blocks) may be generated by multiplying 
one element in the new block by each element in the principal block modulus 2 on the 
exponents. For the 2? with ABC confounded, since the principal block is (1), ab, ac, and bc, 
we know that the treatment combination b is in the other block. Thus, elements of this 


second block are 


EXAMPLE 14-7 Missile Miss Distance 

An experiment is performed to investigate the effect of four 
factors on the terminal miss distance of a shoulder-fired 
ground-to-air missile. The four factors are target type (A), 
seeker type (B), target altitude (C), and target range (D). Each 
factor may be conveniently run at two levels, and the optical 
tracking system will allow terminal miss distance to be meas- 
ured to the nearest foot. Two different operators or gunners are 
used in the flight test and, since there may be differences be- 
tween operators, the test engineers decided to conduct the 24 
design in two blocks with ABCD confounded. Thus, the defin- 
ing contrast is 


L=x, + xX. +x, + xy 


b+ (1) =b 
b+ ab = ab’ =a 
b:+ac = abc 
b-bc =b’e=c 


The experimental design and the resulting data are shown 
in Fig. 14-30. The effect estimates obtained from Minitab are 
shown in Table 14-23. A normal probability plot of the effects 
in Fig. 14-31 reveals that A (target type), D (target range), 
AD, and AC have large effects. A confirming analysis of vari- 
ance, pooling the three-factor interactions as error, is shown 
in Table 14-24. 

Practical Interpretation: Since the AC and AD interactions 
are significant, it is logical to conclude that A (target type), C 
(target altitude), and D (target range) all have important effects 
on the miss distance and that there are interactions between tar- 
get type and altitude and target type and range. Notice that the 
ABCD effect is treated as blocks in this analysis. 


e = Run in block 1 B 


o = Run in block 2 


Geometric view 


(a) 


Block 1 Block 2 
(1) = a=7 
ab = b =5 
ac = c =6 
be =8 d =4 
ad =10 abc =6 
bd = bed =7 
ed = acd = 
abcd = abd =12 


Assignment of the sixteen 
runs to two blocks 


(b) 


Figure 14-30 The 2* design in two blocks for Example 14-7. (a) Geometric view. (b) Assignment of the 


16 runs to two blocks. 
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Table 14-23 Minitab Effect Estimates for 


Example 14-7 


Estimated Effects and Coefficients for Distance 


Term 
Constant 
Block 

A 


ABC 
ABD 
ACD 
BCD 


Effect 


2.625 
0.625 
0.875 
1.875 
—0.125 
=2:315 
1.625 
—0.375 
—0.375 
—0.125 
—0.125 
0.875 
—0.375 
—0.375 


Coef 
6.938 
0.063 
1.312 
0.313 
0.438 
0.938 
—0.063 
—1.187 

0.813 
—0.188 
—0.187 
—0.062 
—0.063 

0.438 
—0.187 
—0.187 


Normal score 


Effect 


Figure 14-31 Normal probability plot of the 
effects from Minitab, Example 14-6. 
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It is possible to confound the 2‘ design in four blocks of 2‘? observations each. To con- 
struct the design, two effects are chosen to confound with blocks, and their defining contrasts 
are obtained. A third effect, the generalized interaction of the two effects initially chosen, is 
also confounded with blocks. The generalized interaction of two effects is found by multiply- 
ing their respective letters and reducing the exponents modulus 2. 

For example, consider the 2* design in four blocks. If AC and BD are confounded with 
blocks, their generalized interaction is (AC)(BD) = ABCD. The design is constructed by using 


Table 14-24 Analysis of Variance for Example 14-7 


Source of Sum of Degrees of Mean 
Variation Squares Freedom Square to P-Value 
Blocks (ABCD) 0.0625 1 0.0625 0.06 — 
A 27.5625 1 27.5625 25.94 0.0070 
B 1.5625 1 1.5625 1.47 0.2920 
C 3.0625 1 3.0625 2.88 0.1648 
D 14.0625 1 14.0625 13.24 0.0220 
AB 0.0625 1 0.0625 0.06 — 
AC 22.5625 1 22.5625 21.24 0.0100 
AD 10.5625 1 10.5625 9.94 0.0344 
BC 0.5625 1 0.5625 0.53 — 
BD 0.5625 1 0.5625 0.53 — 
CD 0.0625 1 0.0625 0.06 — 
Error (ABC + ABD + ACD + BCD) 4.2500 4 1.0625 
Total 84.9375 15 
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the defining contrasts for AC and BD: 
5 =X) + X3 
LI, =X. + X4 


It is easy to verify that the four blocks are 


Block 1 Block 2 Block 3 Block 4 
L, =0,L, =0 L, =1,L,=0 L,=0,L, =1 L,=1,L,=1 
(1) a b a 
ac Cc abc bc 
bd abd d ad 
abcd bcd acd cd 


This general procedure can be extended to confounding the 2‘ design in 2? blocks, where 
p <-k. Start by selecting p effects to be confounded, such that no effect chosen is a general- 
ized interaction of the others. Then the blocks can be constructed from the p defining contrasts 
L,, Lo,..., L, that are associated with these effects. In addition to the p effects chosen to be 
confounded, exactly 2? — p — 1 additional effects are confounded with blocks; these are the 
generalized interactions of the original p effects chosen. Care should be taken so as not to con- 
found effects of potential interest. 

For more information on confounding in the 2* factorial design, refer to Montgomery 
(2009). This book contains guidelines for selecting factors to confound with blocks so that 
main effects and low-order interactions are not confounded. In particular, the book contains a 
table of suggested confounding schemes for designs with up to seven factors and a range of 


block sizes, some of which are as small as two runs. 


EXERCISES FOR SECTION 14-6 


14-30. Consider the data from the first replicate of Ex- 

ercise 14-12. 

(a) Suppose that these observations could not all be run under 
the same conditions. Set up a design to run these observa- 
tions in two blocks of four observations each, with ABC 
confounded. 

(b) Analyze the data. 


14-31. Consider the data from the first replicate of Ex- 

ercise 14-13. 

(a) Construct a design with two blocks of eight observations 
each, with ABCD confounded. 

(b) Analyze the data. 


14-32. Consider the data from Exercise 14-18. 

(a) Construct the design that would have been used to run this 
experiment in two blocks of eight runs each. 

(b) Analyze the data and draw conclusions. 

14-33. Construct a 2° design in two blocks. Select the 

ABCDE interaction to be confounded with blocks. 

14-34. Consider the data from the first replicate of Exercise 

14-13, assuming that four blocks are required. Confound ABD 

and ABC (and consequently CD) with blocks. 


(a) Construct a design with four blocks of four observations each. 
(b) Analyze the data. 


14-35. Construct a 2° design in four blocks. Select the appro- 
priate effects to confound so that the highest possible interactions 
are confounded with blocks. 


14-36. Consider the 2° factorial design. Set up a design to 
be run in four blocks of 16 runs each. Show that a design that 
confounds three of the four-factor interactions with blocks is 
the best possible blocking arrangement. 


14-37. Anarticle in Quality Engineering [“Designed Experi- 
ment to Stabilize Blood Glucose Levels” (1999-2000, Vol. 12, 
pp. 83-87)] reported on an experiment to minimize variations in 
blood glucose levels. The factors were: volume of juice intake 
before exercise (4 or 8 0z), amount of exercise on a Nordic 
Track cross-country skier (10 or 20 min), and delay between 
the time of juice intake (0 or 20 min) and the beginning of the 
exercise period. The experiment was blocked for time of day. 
The data follow. 

(a) What effects are confounded with blocks? Comment on 

any concerns with the confounding in this design. 

(b) Analyze the data and draw conclusions. 
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Average 
Juice Exercise Delay Time of Blood 
Run (0z) (min) (min) Day Glucose 
1 4 10 0 pm ales) 
2 8 10 0 am 103 
3 4 20 0 am 83.5 
4 8 20 0 pm 126 
5 4 10 20 am 125.5 
6 8 10 20 pm 129.5 
7 4 20 20 pm 95 
8 8 20 20 am 93 
14-38. An article in Industrial and Engineering Chemistry 


Factorial Fit: y versus Block, A, B, C, D 
Estimated Effects and Coefficients for y (coded units) 


(‘Factorial Experiments in Pilot Plant Studies” (1951, 
pp. 1300—1306)] reports on an experiment to investigate 
the effect of temperature (A), gas throughput (B), and con- 
centration (C) on the strength of product solution in a re- 
circulation unit. Two blocks were used with ABC con- 
founded, and the experiment was replicated twice. The data 
follow. 

(a) Analyze the data from this experiment. 


Replicate 1 
Block 1 Block 2 
(1) = 99 a= 18 
ab = 52 b= 51 
ac = 42 c =108 
be = 95 abc = 35 
Replicate 2 
Block 3 Block 4 
(1) = 46 a= 18 
ab = 47 b= 62 
ac = 22 c=104 
bc = 67 abc = 36 


Term Effect Coef Se Coef T P 
Constant 579.33 9.928 58.35 0.000 
Block 105.68 9.928 10.64 0.000 
A —15.41 —7.70 9.928  —0.78 0.481 
B 2.95 1.47 9.928 0.15 0.889 
Cc 15.92 7.96 9.928 0.80 0.468 
D —37.87 —18.94 9.928 -—1.91 0.129 
A*B —8.16 —4.08 9.928 —0.41 0.702 
A*C 5.91 2.95 9.928 0.30 0.781 
A*D 30.28 ? 9.928 2? 0.202 
B*C 20.43 10.21 9.928 1.03 0.362 
B*D -17.11  —8.55 9.928 —0.86 0.437 
C*D 4.41 2.21 9.928 0.22 0.835 
S = 39.7131 R-Sq = 96.84% R-Sq (adj) = 88.16% 
Analysis of Variance for y (coded units) 
Source DF SeqSS AdjSS_ Adj MS F P 
Blocks ? 178694 178694 178694 113.30 0.000 
Main Effects 4 77135 7735 1934 1.23 0.424 
2-Way 6 6992 6992 ? 0.74 0.648 
Interactions 
Residual 4 6309 6309 1577 
Error 
Total 15 199730 


14-40. An article in Advanced Semiconductor Manufacturing 
Conference (ASMC) (May 2004, pp. 325-29) stated that dis- 
patching rules and rework strategies are two major operational 
elements that impact productivity in a semiconductor fabrication 
plant (fab). A four-factor experiment was conducted to determine 
the effect of dispatching rule time (5 or 10 min), rework delay 
(0 or 15 min), fab temperature (60 or 80°F), and rework levels 
(level 0 or level 1) on key fab performance measures. The per- 
formance measure that was analyzed was the average cycle time. 
The experiment was blocked for the fab temperature. Data modi- 
fied from the original study are shown in the following table. 


(b) Analyze the residuals and comment on model adequacy. 

(c) Comment on the efficiency of this design. Note that we 
have replicated the experiment twice, yet we have no 
information on the ABC interaction. 

(d) Suggest a better design, specifically, one that would 
provide some information on all interactions. 


14-39. Consider the following Minitab output from a single 

replicate of a 2* experiment in two blocks with ABCD con- 

founded. 

(a) Comment on the value of blocking in this experiment. 

(b) What effects were used to generate the residual error in 
the ANOVA? 

(c) Calculate the entries marked with “?” in the output. 


Dispatching Rework Fab Average 
Rule Time Delay Rework Temperature Cycle Time 
Run (min) (min) Level (GR) (min) 
1 5 0 0 60 218 
2 10 0 0 80 256.5 
3 5) 0 1 80 Pel 
4 10 0 1 60 302.5 
5) 5 15 0 80 298.5 
6 10 15 0 60 314 
7 5 15 1 60 249 
8 10 15 1 80 241 


(a) What effects are confounded with blocks? Do you find any 
concerns with confounding in this design? If so, comment 


on it. 


(b) Analyze the data and draw conclusions. 
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14-7 FRACTIONAL REPLICATION OF THE 2' DESIGN 


As the number of factors in a 2* factorial design increases, the number of runs required 
increases rapidly. For example, a 2° requires 32 runs. In this design, only 5 degrees of free- 
dom correspond to main effects, and 10 degrees of freedom correspond to two-factor 
interactions. Sixteen of the 31 degrees of freedom are used to estimate high-order interactions— 
that is, three-factor and higher order interactions. Often there is little interest in these high- 
order interactions, particularly when we first begin to study a process or system. If we can 
assume that certain high-order interactions are negligible, a fractional factorial design in- 
volving fewer than the complete set of 2‘ runs can be used to obtain information on the main 
effects and low-order interactions. In this section, we will introduce fractional replications of 
the 2* design. 

A major use of fractional factorials is in screening experiments. These are experiments 
in which many factors are considered with the purpose of identifying those factors (if any) that 
have large effects. Screening experiments are usually performed in the early stages of a 
project when it is likely that many of the factors initially considered have little or no effect 
on the response. The factors that are identified as important are then investigated more 
thoroughly in subsequent experiments. 


14-7.1  One-Half Fraction of the 2" Design 


A one-half fraction of the 2‘ design contains 2‘! runs and is often called a 2‘! fractional fac- 
torial design. As an example, consider the 2°’ design—that is, a one-half fraction of the 2°. 
This design has only four runs, in contrast to the full factorial that would require eight runs. 
The table of plus and minus signs for the 2° design is shown in Table 14-25. Suppose we se- 
lect the four treatment combinations a, b, c, and abc, as our one-half fraction. These treatment 
combinations are shown in the top half of Table 14-25 and in Fig. 14-32(a). 

Notice that the 2*~' design is formed by selecting only those treatment combinations that 
yield a plus on the ABC effect. Thus, ABC is called the generator of this particular fraction. 
Furthermore, the identity element / is also plus for the four runs, so we call 


I= ABC 


the defining relation for the design. 
The treatment combinations in the 2*' design yields three degrees of freedom associated 
with the main effects. From the upper half of Table 14-25, we obtain the estimates of the main 


Table 14-25 Plus and Minus Signs for the 2? Factorial Design 


Treatment Factorial Effect 
Combination I A B C AB AC BC ABC 


a = aie ang, 


Figure 14-32 The 
one-half fractions of 
the 23 design. (a) The 
principal fraction, 

I = +ABC. (b) The 
alternate fraction, 
I= —ABC. 
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The principal fraction, J = +ABC The alternate fraction, J =-ABC 
(a) (b) 


effects as linear combinations of the observations, say, 
A= [a —b—c + abc] 
B= \,[-a + b-c + abc] 


C= %[-a-b+c + abc] 


It is also easy to verify that the estimates of the two-factor interactions should be the follow- 
ing linear combinations of the observations: 


BC = bla —b-—c + abc] 
AC = '4[-a + b-—c + abc] 
AB 


%[-a — b+ c + abc] 


Thus, the linear combination of observations in column A, €,, estimates both the main effect 
of A and the BC interaction. That is, the linear combination €, estimates the sum of these two 
effects A + BC. Similarly, €, estimates B + AC, and f¢ estimates C + AB. Two or more 
effects that have this property are called aliases. In our 2?~' design, A and BC are aliases, B 
and AC are aliases, and C and AB are aliases. Aliasing is the direct result of fractional replica- 
tion. In many practical situations, it will be possible to select the fraction so that the main 
effects and low-order interactions that are of interest will be aliased only with high-order 
interactions (which are probably negligible). 

The alias structure for this design is found by using the defining relation J = ABC. 
Multiplying any effect by the defining relation yields the aliases for that effect. In our 
example, the alias of A is 


A =A: ABC = A’BC = BC 
since A + J = A and A” = IJ. The aliases of B and C are 

B = B- ABC = AB’C = AC 
and 

C = C: ABC = ABC’ = AB 


Now suppose that we had chosen the other one-half fraction, that is, the treatment com- 
binations in Table 14-25 associated with minus on ABC. These four runs are shown in the 
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lower half of Table 14-25 and in Fig. 14-32(b). The defining relation for this design is 
I = —ABC. The aliases are A = —BC, B = —AC, and C = —AB. Thus, estimates of A, B, and 
C that result from this fraction really estimate A — BC, B — AC, and C — AB. In practice, it 
usually does not matter which one-half fraction we select. The fraction with the plus sign in 
the defining relation is usually called the principal fraction, and the other fraction is usually 
called the alternate fraction. 

Note that if we had chosen AB as the generator for the fractional factorial, 


A=A-+AB=B8B 


and the two main effects of A and B would be aliased. This typically loses important 
information. 

Sometimes we use sequences of fractional factorial designs to estimate effects. For 
example, suppose we had run the principal fraction of the 2*' design with generator ABC. 
From this design we have the following effect estimates: 


Suppose that we are willing to assume at this point that the two-factor interactions are negligible. 
If they are, the 21 design has produced estimates of the three main effects A, B, and C. However, 
if after running the principal fraction we are uncertain about the interactions, it is possible to esti- 
mate them by running the alternate fraction. The alternate fraction produces the following effect 
estimates: 


€, =A — BC 
€; = B — AC 
c= C— AB 


We may now obtain de-aliased estimates of the main effects and two-factor interactions 
by adding and subtracting the linear combinations of effects estimated in the two individual 
fractions. For example, suppose we want to de-alias A from the two-factor interaction BC. 
Since £4 = A + BC and £4 = A — BC, we can combine these effect estimates as follows: 


1 i 1 
5 Ea + f4) = 5(A + BC + A — BC) =A 


and 


1 i 1 
3 ba — &4) = 5 (A + BC — A + BC) = BC 


For all three pairs of effect estimates, we would obtain the following results: 


Effect, i from ¥, (i, + I) from ¥, (i, — I) 

oa Y(A+BC+A-BC)=A %[A + BC — (A — BC)] = BC 
i=B %4(B+AC+B-AC)=B %[B + AC — (B— AC)] = AC 
i=C %(C + AB + C — AB) =C %[C + AB — (C— AB)] = AB 
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Thus, by combining a sequence of two fractional factorial designs, we can isolate both the 
main effects and the two-factor interactions. This property makes the fractional factorial de- 
sign highly useful in experimental problems since we can run sequences of small, efficient ex- 
periments, combine information across several experiments, and take advantage of learning 
about the process we are experimenting with as we go along. This is an illustration of the 
concept of sequential experimentation. 

A 2*"! design may be constructed by writing down the treatment combinations for a full 
factorial with k — 1 factors, called the basic design, and then adding the kth factor by identi- 
fying its plus and minus levels with the plus and minus signs of the highest order interaction. 
Therefore, a 2*”' fractional factorial is constructed by writing down the basic design as a full 
2° factorial and then equating factor C with the +AB interaction. Thus, to construct the 
principal fraction, we would use C = +AB as follows: 


Basic Design Fractional Design 
Full 2? 23-11 = +ABC 
A B A B C=AB 
To obtain the alternate fraction we would equate the last column to C = —AB. 
EXAMPLE 14-8 Plasma Etch 
To illustrate the use of a one-half fraction, consider the plasma In this design, the main effects are aliased with the three- 
etch experiment described in Example 14-5. Suppose that we factor interactions; note that the alias of A is 


decide to use a 2*"! design with J = ABCD to investigate the 

four factors gap (A), pressure (B), C.F, flow rate (C), and power A-I=A- ABCD OF A = A°BCD = BCD 
setting (D). This design would be constructed by writing down and similarly B = ACD, C = ABD, and D = ABC. 

as the basic design a 2} in the factors A, B, and C and then set- 
ting the levels of the fourth factor D = ABC. The design and the 
resulting etch rates are shown in Table 14-26. The design is 
shown graphically in Fig. 14-33. AB-I=AB- ABCD or AB = A’B’?CD = CD 


The two-factor interactions are aliased with each other. For 
example, the alias of AB is CD: 


Table 14-26 The 24”! Design with Defining Relation J = ABCD 


Treatment Etch 

A B Cc D = ABC Combination Rate 
= = = = (1) 550 
te - - + ad 749 
= ot trl ar bd 1052 
= ab 650 

= = aP oF cd 1075 
= ac 642 

= bc 601 

oe abcd 729 
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(1) = 550 


abcd = 729 


I 
l 
I B 
I 
I 


= A 
bd = 10524 
oc 


o 


oc 


ad = 749 


Figure 14-33 The 2*”' design for the experiment of Example 14-8. 


The other aliases are AC = BD and AD = BC. 

The estimates of the main effects and their aliases are 
found using the four columns of signs in Table 14-26. For ex- 
ample, from column A we obtain the estimated effect 


£, =A + BCD = ¢(-550 + 749 — 1052 + 650 — 1075 
+ 642 — 601 + 729) 
= 127.00 


The other columns produce 
€; = B+ ACD=4.00 f¢= C+ ABD = 11.50 
and 
£p = D + ABC = 290.50 


Clearly, £4 and fp are large, and if we believe that the three- 
factor interactions are negligible, the main effects A (gap) and 
D (power setting) significantly affect etch rate. 

The interactions are estimated by forming the AB, AC, 
and AD columns and adding them to the table. For example, 


Computer Solution 


i) 
5 
a 


the signs in the AB column are +, —, —, 
this column produces the estimate 


fag = AB + CD = (550 — 749 — 1052 + 650 + 1075 
642 — 601 + 729) = —10 


From the AC and AD columns we find 

fac = AC + BD = —25.50 
and 

fap = AD + BC = —197.50 


The 4p estimate is large; the most straightforward interpreta- 
tion of the results is that since A and D are large, this is the AD 
interaction. Thus, the results obtained from the 2+! design 
agree with the full factorial results in Example 14-5. 

Practical Interpretation: Often a fraction of a 2* design 
is satisfactory when an experiment uses four or more 
factors. 


Fractional factorial designs are usually analyzed with a software package. Table 14-26 shows 
the effect estimates obtained from Minitab for Example 14-8. They are in agreement with the 


hand calculation reported earlier. 


Normal Probability Plot of Effects 


The normal probability plot is very useful in assessing the significance of effects from a frac- 
tional factorial design, particularly when many effects are to be estimated. We strongly recom- 
mend examining this plot. Figure 14-34 presents the normal probability plot of the effects 
from Example 14-8. This plot was obtained from Minitab. Notice that the A, D, and AD inter- 
action effects stand out clearly in this graph. 


Figure 14-35 
Projection of a 27"! 
design into three 2? 
designs. 
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Table 14-27 Effect Estimates from Minitab, 
Example 14-8 


Fractional Factorial Fit 


1.5 

10 = Estimated Effects and Coefficients for Etch Rt 
£05 Term Effect Coef 
ti 0.0 ‘ Constant 756.00 
5 6 ' . Gap —127.00 —63.50 
= oA Pressure 4.00 2.00 
a F 11.50 515 
ze ci Power 290.50 145.25 
a ee Oe eae veuu: Gaee Gap*Pressure —10.00 —5.00 
lise Gap*F ~25.50 ~12.75 
Figure 14-34 Normal probability plot of Gap*Power ~197.50 ~98.75 


the effects from Minitab, Example 14-8. 


Residual Analysis 

The residuals can be obtained from a fractional factorial by the regression model method 
shown previously. Note that the Minitab output for Example 14-8 in Table 14-27 shows the 
regression coefficients. The residuals should be graphically analyzed as we have discussed 
before, both to assess the validity of the underlying model assumptions and to gain additional 
insight into the experimental situation. 


Projection of the 2‘! Design 

If one or more factors from a one-half fraction of a 2‘ can be dropped, the design will project 
into a full factorial design. For example, Fig. 14-35 presents a 2°! design. Notice that this 
design will project into a full factorial in any two of the three original factors. Thus, if we think 
that at most two of the three factors are important, the 2?! design is an excellent design for iden- 
tifying the significant factors. This projection property is highly useful in factor screening, 
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Figure 14-36 The 2? 
design obtained by 
dropping factors B and 
C from the plasma 
etch experiment in 
Example 14-7. 


(1052, 1075) (749, 729) 
+1 


D (Power) 


(550, 601) 


=i 
- ® (650, 642) 


A (Gap) 


because it allows negligible factors to be eliminated, resulting in a stronger experiment in the 
active factors that remain. 

In the 2*' design used in the plasma etch experiment in Example 14-8, we found that two 
of the four factors (B and C) could be dropped. If we eliminate these two factors, the remain- 
ing columns in Table 14-24 form a 2” design in the factors A and D, with two replicates. This 
design is shown in Fig. 14-36. The main effects of A and D and the strong two-factor AD 
interaction are clearly evident from this graph. 


Design Resolution 

The concept of design resolution is a useful way to catalog fractional factorial designs accord- 
ing to the alias patterns they produce. Designs of resolution II, IV, and V are particularly 
important. The definitions of these terms and an example of each follow. 


1. Resolution III Designs. These are designs in which no main effects are aliased 
with any other main effect, but main effects are aliased with two-factor interac- 
tions and some two-factor interactions may be aliased with each other. The 2°"! 
design with J = ABC is a resolution III design. We usually employ a Roman nu- 
meral subscript to indicate design resolution; thus, this one-half fraction is a 2};! 
design. 


2. Resolution IV Designs. These are designs in which no main effect is aliased with any 
other main effect or two-factor interactions, but two-factor interactions are aliased 
with each other. The 2*"' design with J = ABCD used in Example 14-8 is a resolution 
IV design (2}y!). 

3. Resolution V Designs. These are designs in which no main effect or two-factor 
interaction is aliased with any other main effect or two-factor interaction, but two- 
factor interactions are aliased with three-factor interactions. The 2>' design with 
I = ABCDE is a resolution V design (2? '). 


Resolution III and IV designs are particularly useful in factor screening experiments. A reso- 
lution IV design provides good information about main effects and will provide some infor- 
mation about all two-factor interactions. 


14-7.2 Smaller Fractions: The 2‘? Fractional Factorial 


Although the 2‘! design is valuable in reducing the number of runs required for an experiment, 
we frequently find that smaller fractions will provide almost as much useful information at even 
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greater economy. In general, a 2 design may be run in a 1/2? fraction called a 2” fractional fac- 
torial design. Thus, a 1/4 fraction is called a 2 * design, a 1/8 fraction is called a 2‘ 3 design, 
a 1/16 fraction a 2* * design, and so on. 

To illustrate the 1/4 fraction, consider an experiment with six factors and suppose that the 
engineer is primarily interested in main effects but would also like to get some information 
about the two-factor interactions. A 2°"! design would require 32 runs and would have 
31 degrees of freedom for estimating effects. Since there are only six main effects and 15 two- 
factor interactions, the one-half fraction is inefficient—it requires too many runs. Suppose we 
consider a 1/4 fraction, or a 2° ? design. This design contains 16 runs and, with 15 degrees of 
freedom, will allow all six main effects to be estimated, with some capability for examining the 
two-factor interactions. 

To generate this design, we would write down a 2* design in the factors A, B, C, and D 
as the basic design and then add two columns for E and F. To find the new columns we 
could select the two design generators J = ABCE and J = BCDF. Thus, column E would 
be found from E = ABC, and column F would be F = BCD. That is, columns ABCE and 
BCDF are equal to the identity column. However, we know that the product of any two 
columns in the table of plus and minus signs for a 2* design is just another column in the 
table; therefore, the product of ABCE and BCDF or ABCE(BCDF) = AB?C’DEF = ADEF 
is also an identity column. Consequently, the complete defining relation for the 2° * 
design is 


I = ABCE = BCDF = ADEF 


We refer to each term in a defining relation (such as ABCE above) as a word. To find the alias 
of any effect, simply multiply the effect by each word in the foregoing defining relation. For 
example, the alias of A is 


A = BCE = ABCDF = DEF 


The complete alias relationships for this design are shown in Table 14-28. In general, the 
resolution of a 2‘? design is equal to the number of letters in the shortest word in the com- 
plete defining relation. Therefore, this is a resolution IV design; main effects are aliased with 
three-factor and higher interactions, and two-factor interactions are aliased with each other. 
This design would provide good information on the main effects and would give some idea 
about the strength of the two-factor interactions. The construction and analysis of the design 
are illustrated in Example 14-9. 


Table 14-28 Alias Structure for the 2%)? Design with I = ABCE = 
BCDF = ADEF 


A = BCE = DEF = ABCDF 
B = ACE = CDF = ABDEF 
C = ABE = BDF = ACDEF 
D = BCF = AEF = ABCDE 
E = ABC = ADF = BCDEF 
F = BCD = ADE = ABCEF 
ABD = CDE = ACF = BEF 
ACD = BDE = ABF = CEF 


AB = CE = ACDF = BDEF 
AC = BE = ABDF = CDEF 
AD = EF = BCDE = ABCF 
AE = BC = DF = ABCDEF 
AF = DE = BCEF = ABCD 
BD = CF = ACDE = ABEF 
BF = CD = ACEF = ABDE 
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Table 14-29 A 25? Design for the Injection-Molding Experiment 


Observed 
Shrinkage 
Run A B C D E = ABC F = BCD (X10) 
1 = = 6 
2 + = 10 
3} oF ap 32 
4 = + 60 
5 oF ar 4 
6 - + 15 
7 = = 26 
8 + = 60 
9 = ar 8 
10 + + 12 
11 oF = 34 
12 = = 60 
13 oF = 16 
14 - = 5 
15 = ap 37 
16 + + 52 


EXAMPLE 14-9 Injection Molding 

Parts manufactured in an injection-molding process are show- 
ing excessive shrinkage, which is causing problems in assem- 
bly operations upstream from the injection-molding area. In 
an effort to reduce the shrinkage, a quality-improvement team 
has decided to use a designed experiment to study the injection- 
molding process. The team investigates six factors—mold 
temperature (A), screw speed (B), holding time (C), cycle time 
(D), gate size (E), and holding pressure (F')—each at two lev- 
els, with the objective of learning how each factor affects 
shrinkage and obtaining preliminary information about how 
the factors interact. 

The team decides to use a 16-run two-level fractional fac- 
torial design for these six factors. The design is constructed by 
writing down a 2* as the basic design in the factors A, B, C, 
and D and then setting E = ABC and F = BCD as discussed 
above. Table 14-29 shows the design, along with the observed 
shrinkage (X10) for the test part produced at each of the 16 
runs in the design. 

A normal probability plot of the effect estimates from this 
experiment is shown in Fig. 14-37. The only large effects are A 
(mold temperature), B (screw speed), and the AB interac- 
tion. In light of the alias relationship in Table 14-28, it 
seems reasonable to tentatively adopt these conclusions. 
The plot of the AB interaction in Fig. 14-38 shows that the 
process is insensitive to temperature if the screw speed is at 
the low level but sensitive to temperature if the screw speed 
is at the high level. With the screw speed at a low level, the 


process should produce an average shrinkage of around 10% 
regardless of the temperature level chosen. 

Based on this initial analysis, the team decides to set 
both the mold temperature and the screw speed at the low 
level. This set of conditions should reduce the mean shrink- 
age of parts to around 10%. However, the variability in 
shrinkage from part to part is still a potential problem. In 
effect, the mean shrinkage can be adequately reduced by 
the above modifications; however, the part-to-part variabil- 
ity in shrinkage over a production run could still cause 
problems in assembly. One way to address this issue is to 
see if any of the process factors affect the variability in 
parts shrinkage. 

Figure 14-39 presents the normal probability plot of the 
residuals. This plot appears satisfactory. The plots of residuals 
versus each factor were then constructed. One of these plots, 
that for residuals versus factor C (holding time), is shown in 
Fig. 14-40. The plot reveals much less scatter in the residuals 
at the low holding time than at the high holding time. These 
residuals were obtained in the usual way from a model for pre- 
dicted shrinkage 


3 = Bo + Bix + Box + Brox 
= 27.3125 + 6.9375x, + 17.8125x, + 5.9375x,%) 


where x), x, and xx, are coded variables that correspond to 
the factors A and B and the AB interaction. The regression 


Normal probability 
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Figure 14-37 Normal probability plot of effects 


for Example 14-9. 


model used to produce the residuals essentially removes 
the location effects of A, B, and AB from the data; the 
residuals therefore contain information about unexplained 
variability. Figure 14-40 indicates that there is a pattern in 
the variability and that the variability in the shrinkage of 
parts may be smaller when the holding time is at the low 
level. 

Practical Interpretation: Figure 14-41 shows the data 
from this experiment projected onto a cube in the factors A, 
B, and C. The average observed shrinkage and the range of 
observed shrinkage are shown at each corner of the cube. 
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Figure 14-39 
uals for Example 14-9. 


Normal probability plot of resid- 
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Figure 14-38 Plot of AB (mold temperature— 
screw speed) interaction for Example 14-9. 


From inspection of this figure, we see that running the 
process with the screw speed (B) at the low level is the key to 
reducing average parts shrinkage. If B is low, virtually any 
combination of temperature (A) and holding time (C) will re- 
sult in low values of average parts shrinkage. However, from 
examining the ranges of the shrinkage values at each corner 
of the cube, it is immediately clear that setting the holding 
time (C) at the low level is the most appropriate choice if we 
wish to keep the part-to-part variability in shrinkage low during 
a production run. 


iN 
ee 


Residuals 
ro) 


6 Low High 
Holding time (C) 

Figure 14-40 Residuals versus holding time 

(C) for Example 14-9. 
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B, screw speed 


Figure 14-41 Average 

shrinkage and range of - 
shrinkage in factors A, B, = + 
and C for Example 14-9. A, mold temperature 


The concepts used in constructing the 2° * fractional factorial design in Example 14-9 
can be extended to the construction of any 2” fractional factorial design. In general, a 2* 
fractional factorial design containing 2‘? runs is called a 1/2? fraction of the 2‘ design or, 
more simply, a 2‘? fractional factorial design. These designs require the selection of p inde- 
pendent generators. The defining relation for the design consists of the p generators initially 
chosen and their 2? — p — 1 generalized interactions. 

The alias structure may be found by multiplying each effect column by the defining 
relation. Care should be exercised in choosing the generators so that effects of potential inter- 
est are not aliased with each other. Each effect has 2’ — 1| aliases. For moderately large values 
of k, we usually assume higher order interactions (say, third- or fourth-order or higher) to be 
negligible, and this greatly simplifies the alias structure. 

It is important to select the p generators for the 2‘ ” fractional factorial design in such 
a way that we obtain the best possible alias relationships. A reasonable criterion is to 
select the generators so that the resulting 2“” design has the highest possible design res- 
olution. Montgomery (2009) presented a table of recommended generators for 2‘? frac- 
tional factorial designs for k = 15 factors and up to as many as n = 128 runs. A portion 
of his table is reproduced here as Table 14-30. In this table, the generators are shown with 
either + or — choices; selection of all generators as + will give a principal fraction, while 
if any generators are — choices, the design will be one of the alternate fractions for the 
same family. The suggested generators in this table will result in a design of the highest 
possible resolution. Montgomery (2009) also provided a table of alias relationships for 
these designs. 


EXAMPLE 14-10 Aliases with Seven Factors 


To illustrate the use of Table 14-30, suppose that we have 
seven factors and that we are interested in estimating the seven 
main effects and obtaining some insight regarding the two- 
factor interactions. We are willing to assume that three-factor 
and higher interactions are negligible. This information sug- 
gests that a resolution IV design would be appropriate. 

Table 14-30 shows that two resolution IV fractions are 
available: the 2/y? with 32 runs and the 2/y? with 16 runs. The 


aliases involving main effects and two- and three-factor inter- 
actions for the 16-run design are presented in Table 14-31. 
Notice that all seven main effects are aliased with three-factor 
interactions. All the two-factor interactions are aliased in 
groups of three. Therefore, this design will satisfy our objec- 
tives; that is, it will allow the estimation of the main effects, 
and it will give some insight regarding two-factor interactions. 
It is not necessary to run the 2/y 7 design, which would require 
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Table 14-30 Selected 2? Fractional Factorial Designs 


Number of Number of 
Factors Number of Design Factors Number of Design 
k Fraction Runs Generators k Fraction Runs Generators 
3 Dis 4 C = +AB 10 H = ABCG 
Ny! 8 = +ABC = +ACDE 
5 Oe 16 = +ABCD pes 128 K = +ACDF 
Qy? 8 = +AB = +BCDF 
= ENC H = +ACDF 
6 er} 32 F = +ABCDE = +ABDE 
a? 16 = +ABC oS 64 = +ABCE 
= +BCD = +ABCD 
Onn 8 = +AB = +ABCE 
= +AC = +ABDE 
= EAC = +ACDE 
7. Va! 64 = +ABCDEF eg 32 = +BCDE 
Diag 32 = +ABCD = +ABC 
= +ABDE = +BCD 
Die 16 = +ABC = +ACD 
= +BCD = +ABD 
= +ACD = +ABCD 
Qa? 8 = +AB giao 16 = +AB 
= +AC 11 = +CDE 
F = <+BC H = +ABCD 
G = +ABC = +ABF 
8 2? 64 = +ABCD = +BDEF 
= +ABEF oe 64 = +ADEF 
23 32 = +ABC = +ABC 
= +ABD = +BCD 
= +BCDE = +CDE 
Deve 16 = +BCD = +ACD 
= +ACD = +ADE 
= +ABC ee 32 = +BDE 
= +ABD = +ABC 
9 Ma 128 = +ACDFG = +BCD 
= +BCEFG = +ACD 
Dime 64 = +ABCD = +ABD 
= +ACEF J = +ABCD 
= +CDEF K = +AB 
24 32 = +BCDE prin 16 L= +AC 
= +ACDE 
= +ABDE 
J = +ABCE 
pa 16 = +ABC 
= +BCD 
= +ACD 
= +ABD 
J = ABCD 


Source: Montgomery (2009) 
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Table 14-31 Generators, Defining Relation, and Aliases for the 24,7 
Fractional Factorial Design 


Generators and Defining Relation 


E = ABC, F = BCD, G = ACD 
I = ABCE = BCDF = ADEF = ACDG = BDEG = ABFG = CEFG 
Aliases 
A = BCE = DEF = CDG = BFG AB = CE = FG 
B = ACE = CDF = DEG = AFG AC = BE = DG 
C = ABE = BDF = ADG = EFG AD = EF = CG 
D = BCF = AEF = ACG = BEG AE = BC = DF 
E = ABC = ADF = BDG = CFG AF = DE = BG 
F = BCD = ADE = ABG = CEG AG = CD = BF 
G = ACD = BDE = ABF = CEF BD = CF = EG 


ABD = CDE = ACF = BEF = BCG = AEG = DFG 


32 runs. The construction of the 2/y? design is shown in Table 
14-32. Notice that it was constructed by starting with the 16- 
run 2+ design in A, B, C, and D as the basic design and then 
adding the three columns E = ABC, F = BCD, and G = ACD 
as suggested in Table 14-30. Thus, the generators for this de- 
sign are ] = ABCE, I = BCDF, and I = ACDG. The complete 
defining relation is J = ABCE = BCDF = ADEF = ACDG 

BDEG = CEFG = ABFG. This defining relation was used to 


produce the aliases in Table 14-31. For example, the alias rela- 
tionship of A is 


A = BCE = ABCDF = DEF = CDG 
= ABDEG = ACEFG = BFG 


which, if we ignore interactions higher than three factors, 
agrees with Table 14-31. 


For seven factors, we can reduce the number of runs even further. The 2’ * design is an 
eight-run experiment accommodating seven variables. This is a 1/16th fraction and is ob- 
tained by first writing down a 2? design as the basic design in the factors A, B, and C, and then 


Table 14-32 A 2/7? Fractional Factorial Design 


Basic Design 
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Table 14-33 A 2/74 Fractional Factorial Design 


A B C D=AB E = AC F=BC G = ABC 


forming the four new columns from J = ABD, I = ACE, I = BCF, and I = ABCG, as sug- 
gested in Table 14-30. The design is shown in Table 14-33. 

The complete defining relation is found by multiplying the generators together two, three, 
and finally four at a time, producing 


I = ABD = ACE = BCF = ABCG = BCDE = ACDF = CDG = ABEF 
BEG = AFG = DEF = ADEG = CEFG = BDFG = ABCDEFG 


The alias of any main effect is found by multiplying that effect through each term in the 
defining relation. For example, the alias of A is 


A = BD = CE = ABCF = BCG = ABCDE = CDF = ACDG 
= BEF = ABEG = FG = ADEF = DEG = ACEFG = ABDFG = BCDEFG 


This design is of resolution III, since the main effect is aliased with two-factor interactions. If 
we assume that all three-factor and higher interactions are negligible, the aliases of the seven 
main effects are 


€,=A+BD+CE+FG 
(3 = B+ AD+CF+EG 
(co =C + AE + BF + DG 
€p = D+ AB+CG+EF 
(; =E+ AC+BG+ DF 
(fp =F +BC+AG+ DE 
(g =G+CD+ BE+ AF 


This 27,* design is called a saturated fractional factorial, because all the available 
degrees of freedom are used to estimate main effects. It is possible to combine sequences of 
these resolution III fractional factorials to separate the main effects from the two-factor 
interactions. The procedure is illustrated in Montgomery (2009) and in Box, Hunter, and 
Hunter (2005). 


@ @ 
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EXERCISES FOR SECTION 14-7 
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14-41. Consider the problem in Exercise 14-17. Suppose 

that only half of the 32 runs could be made. 

(a) Choose the half that you think should be run. 

(b) Write out the alias relationships for your design. 

(c) Estimate the factor effects. 

(d) Plot the effect estimates on normal probability paper and 
interpret the results. 

(e) Set up an analysis of variance for the factors identified as 
potentially interesting from the normal probability plot in 
part (d). 

(f) Analyze the residuals from the model. 

(g) Provide a practical interpretation of the results. 


14-42. Suppose that in Exercise 14-20 it was possible to 
run only a } fraction of the 2* design. Construct the design and 
use only the data from the eight runs you have generated to 
perform the analysis. 


14-43. Anarticle by L. B. Hare [“In the Soup: A Case Study 
to Identify Contributors to Filling Variability,’ Journal of 
Quality Technology (Vol. 20, pp. 36-43)] describes a factorial 
experiment used to study filling variability of dry soup mix 
packages. The factors are A = number of mixing ports through 
which the vegetable oil was added (1, 2), B = temperature 
surrounding the mixer (cooled, ambient), C = mixing time 
(60, 80 sec), D = batch weight (1500, 2000 Ib), and E = number 
of days of delay between mixing and packaging (1, 7). Between 
125 and 150 packages of soup were sampled over an eight-hour 
period for each run in the design, and the standard deviation of 
package weight was used as the response variable. The design 
and resulting data follow. 


A B C D E v 
Std Mixer Batch Std 
Order Ports Temp Time Weight Delay Dev 
1 1.13 
2 1.25 
3 0.97 
4 1.7 
5 1.47 
6 1.28 
7 1.18 
8 0.98 
9 0.78 
10 1.36 
11 1.85 
12 0.62 
13 1.09 
14 1.1 
15 0.76 
16 2.1 


(a) What is the generator for this design? 

(b) What is the resolution of this design? 

(c) Estimate the factor effects. Which effects are large? 

(d) Does a residual analysis indicate any problems with the 
underlying assumptions? 

(e) Draw conclusions about this filling process. 

14-44. Montgomery (2009) described a 2*7! fractional fac- 

torial design used to study four factors in a chemical process. 

The factors are A = temperature, B = pressure, C = concen- 

tration, and D = stirring rate, and the response is filtration 

rate. The design and the data are as follows: 


Treatment _ Filtration 

Run A B C D=ABC_ Combination Rate 

1 = = = = (1) 45 

2 ad 100 

3 bd 45 

4 ab 65 

3) cd 75 

6 ac 60 

7 bc 80 

8 + abcd 96 


(a) Write down the alias relationships. 

(b) Estimate the factor effects. Which factor effects appear 
large? 

(c) Project this design into a full factorial in the three apparently 
important factors and provide a practical interpretation of 
the results. 


14-45. R.D. Snee (“Experimenting with a Large Number of 
Variables,” in Experiments in Industry: Design, Analysis and 
Interpretation of Results, Snee, Hare, and Trout, eds., ASQC, 
1985) described an experiment in which a 2° design with J = 
ABCDE was used to investigate the effects of five factors on 
the color of a chemical product. The factors are A = 
solvent/reactant, B = catalyst/reactant, C = temperature, D = 
reactant purity, and E = reactant pH. The results obtained are 
as follows: 


e = — 0.63 d = 6.79 
a= 251 ade = 6.47 
b = — 2.68 bde = 3.45 
abe = 1.66 abd = 5.68 
c= 2.06 cde = 5.22 
ace= 1.22 acd = 4,38 
bce = — 2.09 bcd = 4.30 
abc = 1.93 abcde = 4.05 


(a) Prepare a normal probability plot of the effects. Which 
factors are active? 


(b) Calculate the residuals. Construct a normal probability 
plot of the residuals and plot the residuals versus the fitted 
values. Comment on the plots. 

(c) If any factors are negligible, collapse the 2°"! design into 
a full factorial in the active factors. Comment on the re- 
sulting design, and interpret the results. 

14-46. An article in Quality Engineering [“A Com- 

parison of Multi-response Optimization: Sensitivity to 

Parameter Selection” (1999, Vol. 11, pp. 405-—415)] con- 

ducted a half replicate of a 2° factorial design to optimize 

the retort process of beef stew MREs, a military ration. 

The design factors are x, = Sauce Viscosity, x. = Residual 

Gas, x3 = Solid/Liquid Ratio, x, = Net Weight, x, = 

Rotation Speed. The response variable is the heating rate 

index, a measure of heat penetration, and there are two 

replicates. 


Heating Rate 
Index 
Run xX} Xp X3 X4 Xs I Il 

1 1 1 1 1 1 8.46 9.61 
2 1 1 1 1 1 15.68 14.68 
3 =Al ial =i —-1 14.94 13.09 
4 1 1 =1 =I 1 12.52 12.71 
5 —1 | eal 1 =i = ily 16.36 
6 1 =I 1 = 1 1144 = 11.83 
q ell 1 1 =il 1 10.45 Qa 
8 1 1 1 = —-1 19.73 16.94 
9) Sl al =i 1 -1l 17.37 = 16.36 
10 1 =I =! 1 1 1498 11.93 
11 = 1 =i 1 1 8.4 8.16 
12 1 1 =! 1 —1 19.08 15.40 
13 Sle el 1 1 1 13.07 10.55 
14 1 =1 1 1 —-1 1857 = 20.53 
15 al 1 1 i =i 20S9 Bile 
16 1 1 1 1 1 1403 11.31 


(a) Estimate the factor effects. Based on a normal probability 
plot of the effect estimates, identify a model for the data 
from this experiment. 

(b) Conduct an ANOVA based on the model identified in part 
(a). What are your conclusions? 

(c) Analyze the residuals and comment on model ade- 
quacy. 

(d) Find a regression model to predict yield in terms of the 
coded factor levels. 

(e) This experiment was replicated, so an ANOVA could have 
been conducted without using a normal plot of the effects 
to tentatively identify a model. What model would be 
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appropriate? Use the ANOVA to analyze this model and 
compare the results with those obtained from the normal 
probability plot approach. 


14-47. An article in Industrial and Engineering Chemistry 
(“More on Planning Experiments to Increase Research 
Efficiency” (1970, pp. 60-65)] uses a 2° -* design to investigate 
the effect on process yield of A = condensation temperature, 
B = amount of material 1, C = solvent volume, D = conden- 
sation time, and E = amount of material 2. The results obtained 
are as follows: 


e = 23.2 cd = 23.8 
ab = 15.5 ace = 23.4 
ad = 16.9 bde = 16.8 
be = 16.2 abcde = 18.1 


(a) Verify that the design generators used were / = ACE and 
I= BDE. 

(b) Write down the complete defining relation and the aliases 
from the design. 

(c) Estimate the main effects. 

(d) Prepare an analysis of variance table. Verify that the AB 
and AD interactions are available to use as error. 

(e) Plot the residuals versus the fitted values. Also construct a 
normal probability plot of the residuals. Comment on the 
results. 


14-48. Suppose that in Exercise 14-14 only a % fraction 
of the 2° design could be run. Construct the design and analyze 
the data that are obtained by selecting only the response for 
the eight runs in your design. 


14-49. For each of the following designs write down the 
aliases, assuming that only main effects and two factor inter- 
actions are of interest. 

(a) 2? (b) 2iv* 

14-50. Consider the 2°? design in Table 14-29. 

(a) Suppose that after analyzing the original data, we find that 
factors C and E can be dropped. What type of 2* design is 
left in the remaining variables? 

(b) Suppose that after the original data analysis, we find that 
factors D and F can be dropped. What type of 2* design 
is left in the remaining variables? Compare the results 
with part (a). Can you explain why the answers are dif- 
ferent? 


14-51. An article in the Journal of Radioanalytical and 
Nuclear Chemistry (2008, Vol. 276, No. 2, pp. 323-328) 
presented a 2°‘ fractional factorial design to identify 
sources of Pu contamination in the radioactivity material 
analysis of dried shellfish at the National Institute of 
Standards and Technology (NIST). The data are shown in 
the following table. No contamination occurred at runs 1, 4, 
and 9. 


s 
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2°4 Glassware Reagent SamplePrep Tracer Dissolution Hood Chemistry Ashing Response, mBq 

Run xX, Xp X3 ey Xs5 X6 x7 Xg y 

1 —1 =1 =] =] =] =] =] —-1 0 

2, aril =I =I =I = +1 +1 =P il 3),3)l 

3 =1 +1 =] =] +1 =] +1 +1 0.0373 
4 ail +1 el a1 qe aril = = 0 

5 =1 =1 +1 = +1 +1 +1 =1 0.0649 
6 +1 = +1 Al atl =l sal +1 0.133 
7 1 +1 +1 =] =1 +1 -1 +1 0.0461 

8 aril +1 +1 =I = =I +1 =Il 0.0297 
9 =1 =] =] +1 +1 +1 =] +1 0 
10 ail el = +1 +1 = 1 +1 Sl 0.287 
11 =1 +1 = +1 -1 +1 +1 =! 0.133 
12 aril +1 =1 +1 =1 al al +P il 0.0476 
13 =1 =1 +1 +1 =1 =] +1 +1 0.133 
14 +1 =l +1 +1 il aril al =l Sls 
15 =1 +1 +1 +1 +1 —1 =] =I 0.0153 
16 +1 aril +1 +1 4 ll +1 +1 +1 2.47 


The factors and levels are shown in the following table. 5 = S25 sb Sa Sh 14.7 
6 1 =1 1-1 1-1 =1 22.3 
Hace sae ae 7-1 1 d-1-1 1-1 16.1 
Glassware Distilled water Soap, acid, stored 8 1 1.4.4 | 02.1 
Reagent New Old 
Sample prep Co-precipitation Electrodeposition 
Tauer Stouk Fresh The factors and levels are shown in the following table. 
Dissolution Without With 
Hood B A Factor ll +1 
Chemistry Without With A _ Television No advertising Advertising 
Ashing Without With eerste 
B Billboard No advertising Advertising 
; ; ; ; advertising 
(a) Was down the alas TelaHOnAHIDS C Newspaper No advertising Advertising 
(b) Estimate the main effects. advertising 
(c) Prepare a normal probability plot for the effects and inter- . . 
Candy wrapper Conservative Flashy design 
pret the results. : é 
Soa ; design design 
14-52. An article in the Journal of Marketing Research = : 1 shelf el 
(1973, Vol. 10, No. 3, pp. 270-276) presented a 2’~* fractional Disp ayeasign INGA Ge SIP SCEN GUSTS 
display display 


factorial design to conduct marketing research: 


F Free sample No free samples Free samples 


program 
Sa Sales for a 6-week eae: ean ea 
Runs A B C D E F G_ period (in $1000) ane pas cea ta 
candy bar 
LS SiS a it a a 8.7 
2 Pa fata : 4 15.1 (a) Write down the alias relationships. 
(b) Estimate the main effects. 
3 bose ee 9.7 (c) Prepare a normal probability plot for the effects and inter- 
4 1 1-1 1-1 -1-1 11.3 pret the results. 
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14-8 RESPONSE SURFACE METHODS AND DESIGNS 


Response surface methodology, or RSM, is a collection of mathematical and statistical tech- 
niques that are useful for modeling and analysis in applications where a response of interest is 
influenced by several variables and the objective is to optimize this response. For example, 
suppose that a chemical engineer wishes to find the levels of temperature (x,) and feed 
concentration (x,) that maximize the yield (y) of a process. The process yield is a function of 
the levels of temperature and feed concentration, say, 


Y = f (x1, x2) +e 


where € represents the noise or error observed in the response Y. If we denote the expected 
response by E(Y) = f(x), x2) = 1, then the surface represented by 


1 =f (x1, %2) 


is called a response surface. 

We may represent the response surface graphically as shown in Fig. 14-42, where 1 is 
plotted versus the levels of x; and x». Notice that the response is represented as a surface plot 
in a three-dimensional space. To help visualize the shape of a response surface, we often plot 
the contours of the response surface as shown in Fig. 14-43. In the contour plot, lines of con- 
stant response are drawn in the x,, x, plane. Each contour corresponds to a particular height of 
the response surface. The contour plot is helpful in studying the levels of x, and x, that result 
in changes in the shape or height of the response surface. 

In most RSM problems, the form of the relationship between the response and the inde- 
pendent variables is unknown. Thus, the first step in RSM is to find a suitable approximation for 
the true relationship between Y and the independent variables. Usually, a low-order polynomial 
in some region of the independent variables is employed. If the response is well modeled by a lin- 
ear function of the independent variables, the approximating function is the first-order model 


Y= Bo + Bx, + Box a BX, +e (14-21) 


If there is curvature in the system, then a polynomial of higher degree must be used, such as 
the second-order model 


Y= Bo >) Bix; >) Bux? + SD Byxix; + € (14-22) 


i<j 
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Figure 14-42 A three-dimensional response surface showing the Figure 14-43 A contour plot of the yield response 


expected yield as a function of temperature and feed concentration. surface in Figure 14-42. 
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Many RSM problems use one or both of these approximating polynomials. Of course, it is 
unlikely that a polynomial model will be a reasonable approximation of the true functional re- 
lationship over the entire space of the independent variables, but for a relatively small region 
they usually work quite well. 

The method of least squares, discussed in Chapters 11 and 12, is used to estimate the pa- 
rameters in the approximating polynomials. The response surface analysis is then done in 
terms of the fitted surface. If the fitted surface is an adequate approximation of the true 
response function, analysis of the fitted surface will be approximately equivalent to analysis of 
the actual system. 

RSM is a sequential procedure. Often, when we are at a point on the response surface 
that is remote from the optimum, such as the current operating conditions in Fig. 14-43, 
there is little curvature in the system and the first-order model will be appropriate. Our 
objective here is to lead the experimenter rapidly and efficiently to the general vicinity of 
the optimum. Once the region of the optimum has been found, a more elaborate model 
such as the second-order model may be employed, and an analysis may be performed to 
locate the optimum. From Fig. 14-43, we see that the analysis of a response surface can 
be thought of as “climbing a hill,’ where the top of the hill represents the point of maxi- 
mum response. If the true optimum is a point of minimum response, we may think of 
“descending into a valley.” 

The eventual objective of RSM is to determine the optimum operating conditions for the 
system or to determine a region of the factor space in which operating specifications are 
satisfied. Also, note that the word “optimum” in RSM is used in a special sense. The “hill 
climbing” procedures of RSM guarantee convergence to a local optimum only. 


Method of Steepest Ascent 

Frequently, the initial estimate of the optimum operating conditions for the system will be far 
from the actual optimum. In such circumstances, the objective of the experimenter is to move 
rapidly to the general vicinity of the optimum. We wish to use a simple and economically ef- 
ficient experimental procedure. When we are remote from the optimum, we usually assume 
that a first-order model is an adequate approximation to the true surface in a small region of 
the x’s. 

The method of steepest ascent is a procedure for moving sequentially along the path of 
steepest ascent, that is, in the direction of the maximum increase in the response. Of course, if 
minimization is desired, we are talking about the method of steepest descent. The fitted 
first-order model is 


k 
3 = Bo + Dd Bix (14-23) 
i=l 


and the first-order response surface, that is, the contours of ¥, is a series of parallel lines such as 
that shown in Fig. 14-44. The direction of steepest ascent is the direction in which y increases 
most rapidly. This direction is normal to the fitted response surface contours. We usually take 
as the path of steepest ascent the line through the center of the region of interest and normal 
to the fitted surface contours. Thus, the steps along the path are proportional to the regression 
coefficients {p i}. The experimenter determines the actual step size based on process knowl- 
edge or other practical considerations. 

Experiments are conducted along the path of steepest ascent until no further increase in 
response is observed. Then a new first-order model may be fit, a new direction of steepest as- 
cent determined, and further experiments conducted in that direction until the experimenter 
feels that the process is near the optimum. 


%2 


Path of 
steepest 
ascent 


Region of fitted 
first-order response 


Figure 14-44  First- 
order response surface 
and path of steepest 
ascent. 


30 


A 
y= 


EXAMPLE 14-11 Process Yield Steepest Ascent 

In Example 14-6 we described an experiment on a chemical 
process in which two factors, reaction time (x,) and reaction 
temperature (x), affect the percent conversion or yield (Y). 
Figure 14-27 shows the 2? design plus five center points used 
in this study. The engineer found that both factors were 
important, there was no interaction, and there was no curva- 
ture in the response surface. Therefore, the first-order model 


Y = Bo + Bix, + Bax. + € 


1 160.0 
eet 
158.3 
41.00 
Ze 156.7 
2g 
8 40.50 
20 155.0 
£ 
g 
& 153.3 40.00 
151.7 
39.50 
-1 150.0 
30.00 31.67 33.33 35.00 36.67 38.33 40.00 
| 0) +1 
x, (time) 


Contour plot 
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= 50 


x1 


should be appropriate. Now the effect estimate of time is 1.55 
hours and the effect estimate of temperature is 0.65°F, and 
since the regression coefficients 8, and 8, are one-half of the 
corresponding effect estimates, the fitted first-order model is 


$ = 40.44 + 0.775x, + 0.325x> 


Figure 14-45(a) and (b) show the contour plot and three- 
dimensional surface plot of this model. Figure 14-45 also 


45.00 S&S 
Sse 
eseses 
matateseces 
43.00 SB Ssosoc8 
Sess 
41.00 
Conversion 
$25 
39.00 SS 
KEES 3% 
160.0 
158.0 40.00 
156.0 a ee +1 
ee 34.00 g 
Xp (temperature) : 32.00 x1 (time) 


150.0 30.00 
-1 -1 


Three-dimensional surface plot 
(b) 


Figure 14-45 Response surface plots for the first-order model in Example 14-11. 
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Point A: 40 minutes, 157°F, y = 40.5 
Point B: 45 minutes, 159°F, y = 51.3 
Point C: 50 minutes, 161°F, y = 59.6 
Point D: 55 minutes, 163°F, y = 67.1 
Point H: 60 minutes, 165°F, y = 63.6 
Point F: 65 minutes, 167°F, y = 60.7 


70 


Figure 14-46 Steepest ascent experiment for Example 14-11. 


shows the relationship between the coded variables x, and x, 
(that defined the high and low levels of the factors) and the 
original variables, time (in minutes) and temperature (in °F). 

From examining these plots (or the fitted model), we see 
that to move away from the design center—the point (x, = 0, 
xX, = 0)—along the path of steepest ascent, we would move 
0.775 unit in the x, direction for every 0.325 unit in the x, 
direction. Thus, the path of steepest ascent passes through the 
point (x, = 0, x. = 0) and has a slope 0.325/0.775. The engi- 
neer decides to use 5 minutes of reaction time as the basic step 
size. Now, 5 minutes of reaction time is equivalent to a step in 
the coded variable x, of Ax, = 1. Therefore, the steps along 
the path of steepest ascent are Ax, = 1.0000 and Ax, = 


(0.325/0.775)Ax, = 0.42. A change of Ax, = 0.42 in the 
coded variable x, is equivalent to about 2°F in the original 
variable temperature. Therefore, the engineer will move along 
the path of steepest ascent by increasing reaction time by 
5 minutes and temperature by 2°F. An actual observation on 
yield will be determined at each point. 

Next Steps: Figure 14-46 shows several points along this 
path of steepest ascent and the yields actually observed from 
the process at those points. At points A—D the observed yield 
increases steadily, but beyond point D, the yield decreases. 
Therefore, steepest ascent would terminate in the vicinity of 
55 minutes of reaction time and 163°F with an observed per- 
cent conversion of 67%. 


Analysis of a Second-Order Response Surface 
When the experimenter is relatively close to the optimum, a second-order model is usually 
required to approximate the response because of curvature in the true response surface. The 


fitted second-order model is 


k k 
3 =Bot+ Dd Bix + Dy Biixz + DD Biyxi3; 
= i= 


i<j 


where B denotes the least squares estimate of B. In this section we show how to use this fitted 
model to find the optimum set of operating conditions for the x’s and to characterize the nature 


of the response surface. 


EXAMPLE 14-12 Process Yield Central Composite Design 


Continuation of Example 14-11 

Consider the chemical process from Example 14-11, where 
the method of steepest ascent terminated at a reaction time of 
55 minutes and a temperature of 163°F. The experimenter 
decides to fit a second-order model in this region. Table 14-34 
and Fig. 14-47 show the experimental design, which consists of 


a 2° design centered at 55 minutes and 165°F, five center points, 
and four runs along the coordinate axes called axial runs. This 
type of design is called a central composite design, and it is a 
very popular design for fitting second-order response surfaces. 

Two response variables were measured during this 
phase of the experiment: percentage conversion (yield) and 
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Table 14-34 Central Composite Design for Example 14-12 


Conversion Viscosity 
Observation Time Temperature Coded Variables (percent) (mPa-sec) 
Number (minutes) (°F) xX) Xp Response 1 Response 2 
1 50 160 = = 65.3 35 
2 60 160 1 =] 68.2 39 
3 50 170 all 1 66 36 
4 60 170 1 1 69.8 43 
5) 48 165 —1.414 0 64.5 30 
6 62 165 1.414 0 69 44 
7 55 158 0 —1.414 64 31 
8 55 172 0 1.414 68.5 45 
9 55 165 0 0 68.9 37 
10 Be) 165 0 0 69.7 34 
11 5p) 165 0 0 68.5 35 
12 55 165 0 0 69.4 36 
13 55 165 0 0 69 37 
viscosity. The least-squares quadratic model for the yield The viscosity response is adequately described by the 
response is first-order model 
J} = 69.1 + 1.633x, +1.083x, — 0.96927 — 1.219%3 3 = 37.08 + 3.85x, + 3.10x, 


+ 0.225x 1x2 
Table 14-36 summarizes the analysis of variance for this model. 


The analysis of variance for this model is shown in Table 14-35. The response surface is shown graphically in Fig. 14-49. Notice 


Figure 14-48 shows the response surface contour plot and that viscosity increases as both time and temperature increase. 
the three-dimensional surface plot for this model. From exam- Practical Interpretation: As in most response surface 
ination of these plots, the maximum yield is about 70%, problems, the experimenter in this example had conflicting 
obtained at approximately 60 minutes of reaction time and objectives regarding the two responses. The objective was to 
167°F. maximize yield, but the acceptable range for viscosity was 

7) 
+2 
@ (0, 1.414) 
(-1, 1) (1, 1) 
(-1.414, 0) (1.414, 0) 
e e e 
-2 (0, 0) +20 xy 
(-1, -1) (1, -1) 
@ (0, -1.414) 


Figure 14-47 Central 
composite design for 9 
Example 14-12. 
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Table 14-35 Analysis of Variance for the Quadratic Model, Yield Response 


Source of Sum of Degrees of Mean 
Variation Squares Freedom Square to P-Value 
Model 45.89 5 9.178 14.93 0.0013 
Residual 4.30 7 0.615 
Total 50.19 12 
Independent Coefficient Standard t for Hy 
Variable Estimate Error Coefficient = 0 P-Value 
Intercept 69.100 0.351 197.1 
xy 1.633 0.277 5.891 0.0006 
Xo 1.083 0.277 3.907 0.0058 
xt —0.969 0.297 —3,259 0.0139 
x —1.219 0.297 —4.100 0.0046 
X Xo 0.225 0.392 0.5740 0.5839 
1 <t 
70.12 SoeSSees 
67.23 


Xp (temperature) 
fo) 


MLL LL], 
77. 
64.34 LLL, 


Conversion LLL LIX OQQY 
61.45 
172.0 
169.2 62.00 
#1 166.4 59.20 +1 
-1 : 163.6 56.40 
48.00 50.33 52.67 55.00 57.33 59.67 62.00 0 160 8 53.60 
=1 0 real Xp (temperature) oe AO x (time) 
x1 (time) -1 -1 
Contour plot Surface plot 
(a) (b) 
Figure 14-48 Response surface plots for the yield response, Example 14-12. 
Table 14-36 Analysis of Variance for the First-Order Model, Viscosity Response 
Source of Sum of Degrees of Mean 
Variation Squares Freedom Square ho P-Value 
Model 195.4 2 ee 15.89 0.0008 
Residual 61.5 10 6.15 
Total 256.9 12 
Independent Coefficient Degrees of Standard t for Hy 
Variable Estimate Freedom Error Coefficient = 0 P-Value 
Intercept 37.08 1 0.69 53.91 
xy 3.85 1 0.88 4.391 0.0014 
Xy 3.10 1 0.88 3.536 0.0054 
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Figure 14-49 Response surface plots for the viscosity response, Example 14-12. 


38 = yy = 42. When there are only a few independent vari- conversion, y, = 38, and y, = 42 highlighted. The shaded ar- 


ables, an easy way to solve this problem is to overlay the re- eas on this plot identify unfeasible combinations of time and 
sponse surfaces to find the optimum. Figure 14-50 shows the temperature. This graph shows that several combinations of 
overlay plot of both responses, with the contours y,; = 69% time and temperature will be satisfactory. 


Example 14-12 illustrates the use of a central composite design (CCD) for fitting a 
second-order response surface model. These designs are widely used in practice because they 
are relatively efficient with respect to the number of runs required. In general, a CCD in k 
factors requires 2* factorial runs, 2k axial runs, and at least one center point (three to five cen- 
ter points are typically used). Designs for k = 2 and k = 3 factors are shown in Fig. 14-51. 

The central composite design may be made rotatable by proper choice of the axial 
spacing a in Fig. 14-51. If the design is rotatable, the standard deviation of predicted response 
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x2 x2 


@ (0, a) @ 


(-1, +1) (+1, +1) 


° ° ° 
(-a, 0) (0, 0) (a,0) 1 
(-1, -1) (+1, -1) 
Figure 14-51 Central 
composite designs for @ (0, -a) C) 
k = 2andk = 3. 
y is constant at all points that are the same distance from the center of the design. For rotata- 
bility, choose a = (F)'/*, where F is the number of points in the factorial part of the design 
(usually F = 2*). For the case of k = 2 factors, a = (2457 4 = 1.414, as was used in the 
design in Example 14-12. Figure 14-52 presents a contour plot and a surface plot of the stan- 
dard deviation of prediction for the quadratic model used for the yield response. Notice that 
the contours are concentric circles, implying that yield is predicted with equal precision for all 
points that are the same distance from the center of the design. Also, as one would expect, the 
precision decreases with increasing distance from the design center. 
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(a) (b) 
Figure 14-52 Plots of constant VV V(¥) for a rotatable central composite design. 


EXERCISES FOR SECTION 14-8 


14-53. Anarticle in Rubber Age (Vol. 89, 1961, pp.453—458) and temperature (°C). These factors are coded as x, = (time — Ss 
describes an experiment on the manufacture of a product in 12)/8 and x, = (temperature — 250)/30. The following data I 
which two factors were varied. The factors are reaction time (hr) were observed where y is the yield (in percent): 


Run 
Number x) Xp y 
1 =|| 0 83.8 
2 1 0 81.7 
3} 0 0 82.4 
4 0 0 82.9 
5 0 =|| 84.7 
6 0 1 75.9 
7 0 0 81.2 
8 —1.414 —1.414 81.3 
9 —1.414 1.414 83.1 
10 1.414 —1.414 85.3 
11 1.414 1.414 IPE! 
12 0 0 82.0 


(a) Plot the points at which the experimental runs were made. 

(b) Fit a second-order model to the data. Is the second-order 
model adequate? 

(c) Plot the yield response surface. What recommendations 
would you make about the operating conditions for this 
process? 

14-54. Anarticle in Quality Engineering (“Mean and Variance 

Modeling with Qualitative Responses: A Case Study” (1998— 

1999, Vol. 11, pp. 141-148)] studied how three active ingredi- 

ents of a particular food affect the overall taste of the product. 

The measure of the overall taste is the overall mean liking score 

(MLS). The three ingredients are identified by the variables x,, 

Xp, and x3. The data are shown in the following table. 


@ 


Run xy xX x3 MLS 
1 1 1 =| 6.3261 
2 1 1 1 6.2444 
3 0 0 0 6.5909 
4 0 —l 0 6.3409 
5 1 =| 1 5.907 
6 1 —l —l 6.488 
7 0 0 =| 5.9773 
8 0 1 0 6.8605 
9 = =| 1 6.0455 

10 0 0 1 6.3478 
11 1 0 0 6.7609 
12 1 1 1 5.7727 
13 = 1 = 6.1805 
14 —l 1 1 6.4894 
15 = 0 0 6.8182 


(a) Fit a second-order response surface model to the data. 

(b) Construct contour plots and response surface plots for 
MLS. What are your conclusions? 

(c) Analyze the residuals from this experiment. Does your 
analysis indicate any potential problems? 

(d) This design has only a single center point. Is this a good 
design in your opinion? 
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Consider the first-order model 


3 = 504+ 1.5x, — 0.8x 


14-55. 


where —1 = x; = 1. Find the direction of steepest ascent. 


14-56. A manufacturer of cutting tools has developed two 
empirical equations for tool life (y,) and tool cost (y). Both 
models are functions of tool hardness (x,) and manufacturing 
time (x,). The equations are 


y, = 10 
3 = 23 


Sx, 


3x1 


2x9 


4x 2 


and both equations are valid over the range —1.5 = x; = 1.5. 
Suppose that tool life must exceed 12 hours and cost must be 
below $27.50. 

(a) Is there a feasible set of operating conditions? 

(b) Where would you run this process? 

14-57. An article in Tappi (1960, Vol. 43, pp. 38-44) de- 
scribes an experiment that investigated the ash value of paper 
pulp (a measure of inorganic impurities). Two variables, temper- 
ature T in degrees Celsius and time f in hours, were studied, and 
some of the results are shown in the following table. The coded 
predictor variables shown are 


(T — 775) 
W5 ° 


(t — 3) 
15 


= x2 = 


and the response y is (dry ash value in %) X 10°. 


x yy a y 
= =) 0 -1.5 168 
1 -1 92 0 15. 179 
=) 1 216 ee) 
1 1 99 0 0 175 
=15 (me) a ake, 
1.5 0 48 0 0 146 


(a) What type of design has been used in this study? Is the 
design rotatable? 

(b) Fit a quadratic model to the data. Is this model satisfactory? 

(c) If it is important to minimize the ash value, where would 
you run the process? 


14-58. In their book Empirical Model Building and 
Response Surfaces (John Wiley, 1987), Box and Draper 
described an experiment with three factors. The data shown in 
the following table are a variation of the original experiment 
on page 247 of their book. Suppose that these data were col- 
lected in a semiconductor manufacturing process. 
(a) The response y, is the average of three readings on resistiv- 
ity for a single wafer. Fit a quadratic model to this response. 
(b) The response y, is the standard deviation of the three resis- 
tivity measurements. Fit a linear model to this response. 
(c) Where would you recommend that we set x,, x,, and x; if 
the objective is to hold mean resistivity at 500 and mini- 
mize the standard deviation? 


=~ 


(_ 


i) 


628 CHAPTER 14 DESIGN OF EXPERIMENTS WITH SEVERAL FACTORS 


x x2 x3 Ji Y2 ma x2 x3 oat M2 
1 1 1 24.00 12.49 1 0 0 501.67 92.50 
0 =] | 120.33 8.39 -1 1 0 264.00 63.50 
1 =1 il 213.67 42.83 0 1 0 427.00 88.61 
—-1 0 =1 86.00 3.46 1 1 0 730.67 21.08 
0 0 il 136.63 80.41 =I il 1 220.67 133.82 
1 0 = 340.67 16.17 -1 1 239.67 23.46 
=i 1 el 112.33 27.57 1 = 1 422.00 18.52 
0 1 =1 256.33 4.62 -1 0 1 199.00 29.44 
1 1 il 271.67 23.63 0 0 1 485.33 44.67 
=1 =1 0 81.00 0.00 1 0 1 673.67 158.21 
0 =I 0 101.67 17.67 il 1 1 176.67 55.51 
1 =1 0 357.00 32.91 0 1 1 501.00 138.94 
= 0 0 171.33 15.01 1 1 1 1010.00 142.45 
0 0 0 372.00 0.00 
14-59. Consider the first-order model 14-60. An article in the Journal of Materials Processing 
y = 12 + 12x, — 21x + 1.6x3 — 0.6x4 Technology (1997, Vol. 67, pp. 55-61) used response surface 


where —1 = x, =1. 


(a) Find the direction of steepest ascent. 


(b) Assume the current design is centered at the point (0, 0, 0, 0). 
Determine the point that is three units from the current center 


point in the direction of steepest ascent. 


methodology to generate surface roughness prediction models 
are shown in 


for turning EN 24T steel (290 BHN). The data 
the following table. 


Speed, V Feed, Depth of cut, Coding Surface roughness, 

Trial (mmin'') f(mmrev ') d (mm) xy Xp X3 R, (wm) 

1 36 0.15 0.50 1 1 1 1.8 

2 117 0.15 0.50 1 -1 -1 1.233 

3 36 0.40 0.50 -1 1 —1 53 

4 117 0.40 0.50 1 1 —1 5.067 

5 36 0.15 1.125 =| =] 1 2.133 

6 117 0.15 1.125 1 -1 1 1.45 

7 36 0.40 1.125 -1 1 1 6.233 

8 117 0.40 1.125 1 1 1 5.167 

9 65 0.25 0.75 0 0 0 2.433 
10 65 0.25 0.75 0 0 0 2.3 
11 65 0.25 0.75 0 0 0 2.367 
12 65 0.25 0.75 0 0 0 2.467 
13 28 0.25 0.75 -vV2 0 0 3.633 
14 150 0.25 0.75 V2 0 0 2.767 
15 65 0.12 0.75 0 -Vv2 0 1.153 
16 65 0.50 0.75 0 V2 0 6.333 
17 65 0.25 0.42 0 0 -VvV2 2.533 
18 65 0.25 1.33 0 0 V2 3.20 
19 28 0.25 0.75 -vV2 0 0 3.233 
20 150 0.25 0.75 V2 0 0 2.967 
21 65 0.12 0.75 0 -Vv2 0 1.21 
22 65 0.50 0.75 0 V2 0 6.733 
23 65 0.25 0.42 0 0 -Vv2 2.833 
24 65 0.25 1.33 0 0 V2 3.267 


The factors and levels for the experiment are shown in the fol- 
lowing table. 
(a) Plot the points at which the experimental runs were made. 


14- 
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(b) Fit both first-and second-order models to the data. Comment 
on the adequacies of these models. 

(c) Plot the roughness response surface for the second-order 
model and comment. 


Levels Lowest Low Center High Highest 
Coding =Va —1 0 1 V2 
Speed, V (m min“ ') 28 36 65 117 150 
Feed, f(mm rev ') 0.12 0.15 0.25 0.40 0.50 
Depth of cut, d (mm) 0.42 0.50 0.75 1.125 1.33 
Supplemental Exercises Timeaninntesy 
am) 14-61. An article in Process Engineering (1992, No. 71, Gear Type 90 120 
= pp. 46-47) presents a two-factor factorial experiment used to 20-tooth 0.0265 0.0560 
investigate the effect of pH and catalyst concentration on prod- 0.0340 0.0650 
uct viscosity (cSt). The data are as follows: : , 
24-tooth 0.0430 0.0720 
: 0.0510 0.0880 
Catalyst Concentration 
an 25 7 28-tooth 0.0405 0.0620 
5.6 192, 199, 189, 198 178, 186, 179, 188 pl ka 
5.9 185, 193, 185, 192 197, 196, 204, 204 


(a) Test for main effects and interactions using a = 0.05. 
What are your conclusions? 
(b) Graph the interaction and discuss the information provided 
by this plot. 
(c) Analyze the residuals from this experiment. 
mm) 14-62. Heat treating of metal parts is a widely used manu- 
= facturing process. An article in the Journal of Metals (Vol. 41, 
1989) describes an experiment to investigate flatness distortion 
from heat treating for three types of gears and two heat-treating 
times. The data are as follows: 
(a) Is there any evidence that flatness distortion is different 
for the different gear types? Is there any indication that 


heat treating time affects the flatness distortion? Do these 
factors interact? Use a = 0.05. 

(b) Construct graphs of the factor effects that aid in drawing 
conclusions from this experiment. 

(c) Analyze the residuals from this experiment. Comment on 
the validity of the underlying assumptions. 


14-63. An article in the Textile Research Institute Journal 
(1984, Vol. 54, pp. 171-179) reported the results of an experi- 
ment that studied the effects of treating fabric with selected (qq 
inorganic salts on the flammability of the material. Two appli- 
cation levels of each salt were used, and a vertical burn test 
was used on each sample. (This finds the temperature at which 
each sample ignites.) The burn test data follow. 


SS 


Salt 
Level Untreated MgCl, NaCl CaCO, CaCl, Na,CO; 
1 812 a2, 739 733 725 751 
827 728 731 728 727 761 
876 764 726 720 719 i553) 
2 945 7194 TAI 786 756 910 
881 760 744 771 781 854 
919 757 727 779 814 848 


(a) Test for differences between salts, application levels, and 
interactions. Use a = 0.01. 

(b) Draw a graph of the interaction between salt and application 
level. What conclusions can you draw from this graph? 


(c) Analyze the residuals from this experiment. 


14-64. An article in the JEEE Transactions on Components, 
Hybrids, and Manufacturing Technology (1992, Vol. 15) de- 
scribes an experiment for investigating a method for aligning 
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optical chips onto circuit boards. The method involves placing 
solder bumps onto the bottom of the chip. The experiment 
used three solder bump sizes and three alignment methods. 
The response variable is alignment accuracy (in micrometers). 
The data are as follows: 


Solder Bump Size Alignment Method 
(diameter in wm) 1 2 3 
4.60 1.55 1.05 
75 4.53 1.45 1.00 
233 2; 0.82 
130 2.44 1.76 0.95 
4.95 Dell 2.36 
260 4.55 2.60 2.46 


(a) Is there any indication that either solder bump size or 
alignment method affects the alignment accuracy? Is 
there any evidence of interaction between these factors? 
Use a = 0.05. 

(b) What recommendations would you make about this process? 

(c) Analyze the residuals from this experiment. Comment on 
model adequacy. 


14-65. Anarticle in Solid State Technology (Vol. 29, 1984, 
pp. 281-284) describes the use of factorial experiments in 
photolithography, an important step in the process of manu- 
facturing integrated circuits. The variables in this experiment 
(all at two levels) are prebake temperature (A), prebake 
time (B), and exposure energy (C), and the response variable 
is delta line width, the difference between the line on the 
mask and the printed line on the device. The data are as fol- 
lows: (1) = —2.30, a = —9.87, b = —18.20, ab = —30.20, 
c = —23.80, ac = —4.30, bc = —3.80, and abc = —14.70. 
(a) Estimate the factor effects. 

(b) Use a normal probability plot of the effect estimates to 
identity factors that may be important. 

(c) What model would you recommend for predicting the 
delta line width response, based on the results of this exper- 
iment? 

(d) Analyze the residuals from this experiment, and comment 
on model adequacy. 


14-66. An article in the Journal of Coatings Technology 

(Vol. 60, 1988, pp. 27-32) describes a 2* factorial design used 

for studying a silver automobile basecoat. The response vari- 

able is distinctness of image (DOI). The variables used in the 

experiment are 

A = Percentage of polyester by weight of polyester/melamine 
(low value = 50%, high value = 70%) 

B = Percentage of cellulose acetate butyrate carboxylate (low 
value = 15%, high value = 30%) 
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C = Percentage of aluminum stearate (low value = 1%, high 
value = 3%) 

D = Percentage of acid catalyst (low value = 0.25%, high 
value = 0.50%) 

The responses are (1) = 63.8, a = 77.6, b = 68.8, ab = 

76.5, c = 72.5, ac = 77.2, bc = 77.7, abc = 84.5, d = 60.6, 

ad = 64.9, bd = 72.7, abd = 73.3, cd = 68.0, acd = 76.3, 

bcd = 76.0, and abcd = 75.9. 

(a) Estimate the factor effects. 

(b) From a normal probability plot of the effects, identify a 
tentative model for the data from this experiment. 

(c) Using the apparently negligible factors as an estimate of 
error, test for significance of the factors identified in part 
(b). Use a = 0.05. 

(d) What model would you use to describe the process, based 
on this experiment? Interpret the model. 

(e) Analyze the residuals from the model in part (d) and com- 
ment on your findings. 


14-67. An article in the Journal of Manufacturing Systems 

(Vol. 10, 1991, pp. 32—40) describes an experiment to investigate 

the effect of four factors, P = waterjet pressure, F = abrasive 

flow rate, G = abrasive grain size, and V = jet traverse speed, on 

the surface roughness of a waterjet cutter. A 2* design follows. 

(a) Estimate the factor effects. 

(b) Form a tentative model by examining a normal probabil- 
ity plot of the effects. 

(c) Is the model in part (b) a reasonable description of the 
process? Is lack of fit significant? Use a = 0.05. 

(d) Interpret the results of this experiment. 

(e) Analyze the residuals from this experiment. 


Factors Surface 
V F P G Roughness 

Run (in/min) (lb/min) (kpsi) (Mesh No.) (wm ) 
1 6 2.0 38 80 104 
2 2 2.0 38 80 98 
3 6 2.0 30 80 103 
4 2 2.0 30 80 96 
5 6 1.0 38 80 137 
6 2 1.0 38 80 112 
ll 6 1.0 30 80 143 
8 2 1.0 30 80 129 
9 6 2.0 38 170 88 
10 2 2.0 38 170 70 
11 6 2.0 30 170 110 
12 2 2.0 30 170 110 
13 6 1.0 38 170 102 
14 2 1.0 38 170 76 
15 6 1.0 30 170 98 
16 2 1.0 30 170 68 


@o@ 


= 


a) 


14-68. Construct a a ' design for the problem in Exercise 
14-66. Select the data for the eight runs that would have been re- 
quired for this design. Analyze these runs and compare your con- 
clusions to those obtained in Exercise 14-66 for the full factorial. 
14-69. Construct a 2h : design for the problem in Exercise 
14-67. Select the data for the eight runs that would have been re- 
quired for this design. Analyze these data and compare your con- 
clusions to those obtained in Exercise 14-67 for the full factorial. 
14-70. Construct a ao design in 16 runs. What are the 
alias relationships in this design? 

14-71. Construct a oe : design in eight runs. What are the 
alias relationships in this design? 

14-72. Anarticle in the Journal of Quality Technology (Vol. 17, 
1985, pp. 198-206) describes the use of a replicated fractional 
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factorial to investigate the effect of five factors on the free height 

of leaf springs used in an automotive application. The factors are 

A = furnace temperature, B = heating time, C = transfer time, 

D = hold down time, and E = quench oil temperature. The data 

are shown in the following table. 

(a) What is the generator for this fraction? Write out the alias 
structure. 

(b) Analyze the data. What factors influence mean free 
height? 

(c) Calculate the range of free height for each run. Is there any 
indication that any of these factors affect variability in free 
height? 

(d) Analyze the residuals from this experiment and comment 
on your findings. 


A B C D E Free Height 
= = = = = 7.78 7.78 7.81 
+ = = oF = 8.15 8.18 7.88 
= ar = ste = 7.50 7.56 7.50 
E ai = = = 7.59 7.56 7.75 
a = =P ste = 7.54 8.00 7.88 
+ = + = = 7.69 8.09 8.06 
os at +P = = 7.56 7.52 744 
+ + + + = 7.56 7.81 7.69 
oe = i = oF 7.50 7.56 7.50 
se = = + + 7.88 7.88 744 
= ap = ar aP 7.50 7.56 7.50 
B + = = of 7.63 TAS 7.56 
= = sp ar =F 7.32 744 744 
fe = of = ab 7.56 7.69 7.62 
= ar ap = aP 7.18 7.18 V5 
+ + + + + 7.81 7.50 7.59 
14-73. Anarticle in Rubber Chemistry and Technology (Vol. (Wodedilevcls 
47, 1974, pp. 825-836) describes an experiment that studies the ae 
Mooney viscosity of rubber to several variables, including silica a a Z 
filler (parts per hundred) and oil filler (parts per hundred). Data = = nok 
typical of that reported in this experiment follow, where ! =I 1é-15 
=I 1 12.87 
a3 ‘ 1 1 13.53 
oe “_ a. i= “— =F =y 13.90 
1 =1 14.88 
; : =1 1 12.25 
(a) What type of experimental design has been used? 1 1 13.35 


(b) Analyze the data and draw appropriate conclusions. 
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14-74. Anarticle in Tropical Science [“Proximate Composition 
of the Seeds of Acacia Nilotica var Adansonti (Bagaruwa) 
and Extraction of Its Protein” (1992, Vol. 32, No. 3, pp. 
263-—268)] reported on research extracting and concentrating 
the proteins of the bagaruwa seeds in livestock feeding in 
Nigeria to eliminate the toxic substances from the seeds. The 
following are the effects of extraction time and flour to 
solvent ratio on protein extractability of the bagaruwa seeds 
in distilled water: 


Percentage of Protein 


Alone Saloni Extracted at Time (min) 


Ratio (w/v) (%) 30 60 90 120 
3 30.5 45.7 30.5 31.0 

36.9 44.3 29.5 22.1 

7 32.9 42.4 28.2 Uh) 

37.5 40.9 27.3 34.1 

11 29.0 39.5 29.0 29.0 

32.7 43.6 30.5 28.4 


All values are means of three determinations. 

(a) Test the appropriate hypotheses and draw conclusions 
using the analysis of variance with a = 0.5. 

(b) Graphically analyze the interaction. 

(c) Analyze the residuals from this experiment. 


14-75. An article in Plant Disease (“Effect of Nitrogen and 
Potassium Fertilizer Rates on Severity of Xanthomonas Blight 
of Syngonium Podophyllum” (1989, Vol. 73, No. 12, pp. 
972-975)] showed the effect of the variable nitrogen and 
potassium rates on the growth of “White Butterfly” and the 
mean percentage of disease. Data representative of that col- 
lected in this experiment is provided in the following table. 


Nitrogen Potassium (mg/pot/wk) 
(mg/pot/wk) 30 90 120 
50 61.0 61.3 45.5 42.5 59.5 58.2 
150 54.5 55.9 53.5 51.9 340 35.9 
250 42.7 404 365 37.4 32.5 33.8 


(a) State the appropriate hypotheses. 

(b) Use the analysis of variance to test these hypotheses with 
a = 0.05. 

(c) Graphically analyze the residuals from this experiment. 

(d) Estimate the appropriate variance components. 


14-76. An article in Biotechnology Progress (2001, Vol. 17, 
pp. 366-368) reported on an experiment to investigate and 
optimize the operating conditions of the nisin extraction in 
aqueous two-phase systems (ATPS). A 2? full factorial design 
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with center points was used to verify the most significant 
factors affecting the nisin recovery. The factor x, was the con- 
centration (% w/w) of PEG 4000 and x, was the concentration 
(% wiw) of Na,SO,. The range and levels of the variables 
investigated in this study are presented below. Nisin extraction 
is a ratio representing the concentration of nisin and this was 
the response y. 


Trial xy X y 
1 13 11 62.874 
2 15 11 76.133 
3 13 13 87.467 
4 15 13 102.324 
5) 14 12) 76.187 
6 14 12 77.523 
W 14 12 76.782 
8 14 12 77.438 
9 14 12 78.742 


(a) Compute an ANOVA table for the effects and test for 
curvature with a = 0.05. Is curvature important in this re- 
gion of the factors? 

(b) Calculate residuals from the linear model and test for 
adequacy of your model. 

(c) Ina new region of factor space a central composite design 
(CCD) was used to perform second order optimization. 
The results are shown in the following table. Fit a second 
order model to this data and make conclusions. 


Coded Uncoded 
Trial xX, X2 xy Xo y 
1 =1 = 15 14 102.015 
2 1 -1 16 14 106.868 
3 =1 1 15 16 108.13 
4 1 1 16 16 110.176 
5) —1.414 0 14.793 15 105.236 
6 1.414 0 16.207. 15 110.289 
7 0 —1.414 15.5 13.586 103.999 
8 0 1.414 15.5 16.414 110.171 
9 0 0 15.5 15 108.044 
10 0 0 15.5 15 109.098 
11 0 0 IS)9) 15 107.824 
12 0 0 15.5 15 108.978 
13 0 0 15.5 15 109.169 
14-77. Anarticle in the Journal of Applied Electrochemistry 


(May 2008, Vol. 38, No. 5, pp. 583-590) presented a 2’~? frac- 
tional factorial design to perform optimization of polybenzim- 
idazole-based membrane electrode assemblies for H, / O, fuel 
cells. The design and data are shown in the following table. 
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Current Density 
Runs A B e D E F G (CD mA cm?) 
1 1 1 1 1 1 1 sl 160 
2 1 1 1 1 1 1 +1 20 
3 1 1 1 1 1 1 =I 80 
4 1 1 1 1 1 1 +1 317 
5 1 1 1 1 1 1 +1 19 
6 1 1 1 1 1 1 —1 4 
7 1 1 1 1 1 1 +1 20 
8 1 1 1 1 1 1 =1 87.7 
9 1 1 1 1 1 1 +1 1100 
10 1 1 1 1 1 1 —1 12 
11 1 1 1 1 +1 = +P il ap) 
12 1 1 1 1 1 1 -1 880 
13 1 1 1 1 aril aril =I 16 
14 1 1 1 1 1 1 +1 20 
15 1 1 1 1 1 1 ul 8 
16 1 1 1 1 +1 +1 +1 15 
The factors and levels are shown in the following table. (a) Write down the alias relationships. 
(b) Estimate the main effects. 
Factor = 4] (c) Prepare a normal probability plot for the effects and inter- 


pret the results. 
(d) Calculate the sum of squares for the alias set that con- 
tains the ABG interaction from the corresponding effect 


A Amount of binder inthe 0.2mgcem? 1 mgcm? 
catalyst layer 


B_ Electrocatalyst loading 0.lmgcm? 1mgcem? estimate. 
Amount of carbon inthe 2 mg cm? 4.5 mg cm? 14-78. An article in Biotechnology Progress (December 
gas diffusion layer 2002, Vol. 18, No. 6, pp. 1170-1175) presented a 27~? frac- 
D_ Hot compaction time 1 min 10 min tional factorial to evaluate factors promoting astaxanthin pro- 
E Compaction temperature 100°C 150°C duction. The data are shown in the following table. 
F Hot compaction load 0.04 toncm* 0.2 ton cm? 
G Amount of PTFE in the 0.lmgem? 1mgem? 
gas diffusion layer 


Weight Content Cellular Content 


Runs A B Cc D E ie G (mg/g) (pg/cell) 
1 1 1 1 1 1 1 = 4.2 10.8 
2 1 1 1 1 1 1 1 4.4 24.9 
3 =I 1 = =I 1 =I 1 7.8 DiS 
4 1 1 —1 1 1 1 1 14.9 36.3 
5) =I =il 1 1 =I =I 1 255 112.6 
6 1 =1 1 —1 1 —1 =1 26.7 159.3 
7 il 1 1 =I il 1 = Al 23.9 145.2 
8 1 1 1 1 1 1 1 21.9 243.2 
9 1 1 1 1 1 1 1 24.3 Well 

10 = 1 1 1 1 —1 =] 20.5 112.2 
11 =il 1 1 = 1 = 10.8 Mk) 
12 - —1 1 =1 1 1 1 20.8 149.7 
13 1 1 =Il al 1 1 Sal 13.5 140.1 
14 - 1 =1 1 = 1 1 10.3 47.3 
15 1 =I sl 1 1 = 1 23 153.2 
16 1 1 1 1 1 1 1 12.1 35.2 
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The factors and levels are shown in the following table. 
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from the design. 
(b) Estimate the main effects. 


interpret the results. 


Factor al aril 
A Nitrogen concentration (mM) 4.06 0 
B Phosphorus concentration (mM) 0.21 0 
C Photon flux density (uE ms *) 100 500 
D Magnesium concentration (mM) 1 0 
E Acetate concentration (mM) 0 15 
F Ferrous concentration (mM) 0 0.45 
G NaCl concentration (mM) Optimal haematococcus medium OES) 
(a) Write down the complete defining relation and the aliases A B vos Se 
1 1 21.33333 6.027714 
(c) Plot the effect estimates on normal probability paper and l 2 20 7.549834 
ce rh - ; : 1 3 32.66667 3.511885 
ieee > rans 
the lubricant used while working (B), and the hardness of 2 2 33 6.557439 
the metal used in the machine (C). Two levels of each factor 2 3 23 10 


are chosen and a single replicate of a 2? experiment is run. 
The rework time data (in hours) are shown in the following 
table. 


Treatment Time 
Combination (in hours) 

(1) Di 

a 34 

b 38 

ab 59 

c 44 

ac 40 

bc 63 
abc 37 


(a) These treatments cannot all be run under the same condi- 
tions. Set up a design to run these observations in two 
blocks of four observations each, with ABC confounded 
with blocks. 

(b) Analyze the data. 

14-80 Consider the following results from a two-factor ex- 


periment with two levels for factor A and three levels for fac- 
tor B. Each treatment has three replicates. 


(a) Calculate the sum of squares for each factor and the inter- 
action. 

(b) Calculate the sum of squares total and error. 

(c) Complete an ANOVA table with F-statistics. 

14-81. Consider the following ANOVA table from a two- 

factor factorial experiment. 


Two-way ANOVA: y versus A, B 


Source DF SS MS F P 
A 3 1213770 404590 2? 0.341 
B 2 ? 17335441 58.30 0.000 
Error t 1784195 ? 

Total 11 37668847 


(a) How many levels of each factor were used in the experiment? 

(b) How many replicates were used? 

(c) What assumption is made in order to obtain an estimate of 
error? 

(d) Calculate the missing entries (denoted with “?’’) in the 
ANOVA table. 


14-82. An article in Process Biochemistry (Dec. 1996, Vol. 
31, No. 8, pp. 773-785) presented a 27-3 fractional factorial to 


perform optimization of manganese dioxide bioleaching me- C | 


dia. The data are shown in the following table. 


i) 
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Manganese 
Extraction 
Runs A B G D E F G Yield (%) 
1 1 1 1 1 1 1 1 99.0 
2 1 1 1 1 1 =i] 1 97.4 
3 al 1 =I =I 1 1 1 OT 
4 1 1 1 1 1 1 =] 90.0 
5 =I =I 1 =I 1 1 =i 100.0 
6 1 =1 1 = =1 1 1 98.0 
v7 =i 1 1 1 1 1 1 90.0 
8 1 1 1 —1 1 =i] -1 93.5 
g) 1 1 1 1 al 1 1 100.0 
10 1 —1 —1 1 1 1 =1 98.6 
11 =il 1 =il 1 1 =l =I 97.1 
12 1 1 —1 1 =1 =1 1 92.4 
13 =i =Il 1 1 1 Sl 1 93.0 
14 1 —1 1 1 1 1 1 95.0 
15 =i 1 1 1 =I 1 =i 97.0 
16 1 1 1 1 1 1 1 98.0 
The factors and levels are shown in the following table. (c) Plot the effect estimates on normal probability paper and 


interpret the results 
(d) Calculate the missing entries (denoted with “?”’) in the fol- 


Factor = +1 : so : ‘ 
lowing ANOVA table from Minitab with two-way interac- 
A Mineral concentration (%) 10 20 tions. 
B Molasses (g/liter) 100 200 
C NH,NO, pie 25) 2.50 aue DE 5S MS F P 
D KH,PO, (g/liter) 0.75 1.50 - 5 5 
E MgSO, (g/liter) 05 1.00 Main Effects 7 95.934 oie ? ? 
F Yeast extract (g/liter) 0.20 0.50 2-Way : 7 67.884 : 53.69 0.105 
: Interactions 
G NaHCO, (g/liter) 2.00 4.00 RAE 9 9 
Total 15 163.999 


(a) Write down the complete defining relation and the aliases 
from the design. 
(b) Estimate the main effects. 


MIND-EXPANDING EXERCISES 


14-83. Consider an unreplicated 2* factorial, and sup- 14-84. What blocking scheme would you recommend 
pose that one of the treatment combinations is missing. _ if it were necessary to run a 2* design in four blocks of 
One logical approach to this problem is to estimate the four runs each? 

missing value with a number that makes the highest or- 14-85. Consider a 27 design in two blocks with 
der interaction estimate zero. Apply this technique tothe 4B confounded with blocks. Prove algebraically that 
data in Example 14-5, assuming that ab is missing. § $45 = SSpiccrs 

Compare the results of the analysis of these data with the 

results in Example 14-5. 
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MIND-EXPANDING EXERCISES 


14-86. Consider a 2? design. Suppose that the largest 

number of runs that can be made in one block is four, but 

we can afford to perform a total of 32 observations. 

(a) Suggest a blocking scheme that will provide some 
information on all interactions. 

(b) Show an outline (source of variability, degrees of 
freedom only) for the analysis of variance for this 
design. 


14-87. Construct a 2°"! design. Suppose that it is 
necessary to run this design in two blocks of eight runs 
each. Show how this can be done by confounding a two- 
factor interaction (and its aliased three-factor interac- 
tion) with blocks. 

14-88. Construct a 2/y 7 design. Show how this design 
may be confounded in four blocks of eight runs each. Are 
any two-factor interactions confounded with blocks? 
14-89. Construct a 2})y >? design. Show how this de- 
sign can be confounded in two blocks of eight runs each 


IMPORTANT TERMS AND CONCEPTS 


without losing information on any of the two-factor 

interactions. 

14-90. Set up a 2); + design using D = AB, E = AC, 

F = BC, and G = ABC as the design generators. Ignore 

all interaction above the two factors. 

(a) Verify that each main effect is aliased with three 
two-factor interactions. 

(b) Suppose that a second 2); + design with generators 
D = —AB, E = —AC, F = —BC, and G = ABC is 
run. What are the aliases of the main effects in this 
design? 

(c) What factors may be estimated if the two sets of 
factor effect estimates above are combined? 


14-91. Consider the square root of the sum of squares 
for curvature and divide by the square root of mean 
squared error. Explain why the statistic that results has a 
t distribution and why it can be used to conduct a f test for 
curvature that is equivalent to the F test in the ANOVA. 


Analysis of variance Contrast 

(ANOVA) Defining relation 
Blocking and nuisance Design matrix 

factors Factorial experiment 
Center points Fractional factorial design 
Central composite Generator 

design Interaction 
Confounding Main effect 


Normal probability Resolution 
plot of factor Response surface 
effects Screening experiment 
Optimization Steepest ascent 
experiment (or descent) 
Orthogonal design 2" factorial design 


Two-level factorial 
design 


Regression model 
Residual analysis 
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Statistical Quality Control 


BOWL OF BEADS 


The quality guru Edward Deming conducted a simple experiment in his seminars with a 
bowl of beads. Many were colored white but a percentage of red beads were randomly 
mixed in the bowl. A participant from the seminar was provided with a paddle made with 
indentations so that 50 beads at a time could be scooped from the bowl. The participant 
was only allowed to use the paddle and instructed to only scoop white beads (repeated 
multiple times with beads replaced). The red beads were considered to be defectives. Of 
course, this was difficult to do, and each scoop resulted in a count of red beads. Deming 
plotted the fraction of red beads from each scoop and used the results to make several 
points. As was clear from the scenario, this process is beyond the participant’s ability to 
make simple improvements. It is the process that needs to be changed (reduce the number 
of red beads) and that is the responsibility of management. Furthermore, many business 
processes have this type of characteristic and it is important to learn from the data whether 
the variability is common, intrinsic to the process or whether some special cause has oc- 
curred. This distinction is important for the type of process control or improvements to be 
applied. Refer to the example of control adjustments in Chapter 1. Control charts are pri- 
mary tools to understand process variability and that is main topic of this chapter. 


CHAPTER OUTLINE 

15-1 QUALITY IMPROVEMENT 15-2.3 Rational Subgroups 
AND STATISTICS 15-2.4 Analysis of Patterns on 
15-1.1 Statistical Quality Control Control Charts 
15-1.2 Statistical Process Control 15-3. X AND RORS CONTROL 

15-2 INTRODUCTION TO CHARTS 
CONTROL CHARTS 15-4 CONTROL CHARTS FOR 
15-2.1 Basic Principles INDIVIDUAL MEASUREMENTS 
15-2.2 Design of a Control Chart 15-5 PROCESS CAPABILITY 


637 


638 CHAPTER 15 STATISTICAL QUALITY CONTROL 


15-6 ATTRIBUTE CONTROL 15-8 TIME-WEIGHTED CHARTS 
CHARTS 15-8.1 Cumulative Sum Control 
15-6.1 P Chart (Control Chart for Chart 
Proportions) 15-8.2 Exponentially Weighted Moving 
15-6.2 U Chart (Control Chart for Average Control Chart 
Defects per Unit) 15-9 OTHER SPC PROBLEM-SOLVING 
15-7 CONTROL CHART TOOLS 
PERFORMANCE 15-10 IMPLEMENTING SPC 
LEARNING OBJECTIVES 


After careful study of this chapter you should be able to do the following: 
1. Understand the role of statistical tools in quality improvement 


2. Understand the different types of variability, rational subgroups, and how a control chart is used 
to detect assignable causes 


3. Understand the general form of a Shewhart control chart and how to apply zone rules (such as 
the Western Electric rules) and pattern analysis to detect assignable causes 


. Construct and interpret control charts for variables such as X, R, S, and individuals charts 
. Construct and interpret control charts for attributes such as P and U charts 

. Calculate and interpret process capability ratios 

. Calculate the ARL performance for a Shewhart control chart 


. Construct and interpret a cumulative sum and exponentially weighted moving average control chart 
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. Use other statistical process control problem-solving tools 


15-1 QUALITY IMPROVEMENT AND STATISTICS 


The quality of products and services has become a major decision factor in most businesses 
today. Regardless of whether the consumer is an individual, a corporation, a military defense 
program, or a retail store, when the consumer is making purchase decisions, he or she is likely 
to consider quality of equal importance to cost and schedule. Consequently, quality improve- 
ment has become a major concern to many U.S. corporations. This chapter is about statistical 
quality control, a collection of tools that are essential in quality-improvement activities. 

Quality means fitness for use. For example, you or I may purchase automobiles that we ex- 
pect to be free of manufacturing defects and that should provide reliable and economical trans- 
portation, a retailer buys finished goods with the expectation that they are properly packaged 
and arranged for easy storage and display, or a manufacturer buys raw material and expects to 
process it with no rework or scrap. In other words, all consumers expect that the products and 
services they buy will meet their requirements. Those requirements define fitness for use. 

Quality or fitness for use is determined through the interaction of quality of design and 
quality of conformance. By quality of design we mean the different grades or levels of 
performance, reliability, serviceability, and function that are the result of deliberate engi- 
neering and management decisions. By quality of conformance, we mean the systematic 
reduction of variability and elimination of defects until every unit produced is identical 
and defect-free. 

Some confusion exists in our society about quality improvement; some people still think 
that it means gold-plating a product or spending more money to develop a product or process. 
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This thinking is wrong. Quality improvement means the systematic elimination of waste. 
Examples of waste include scrap and rework in manufacturing, inspection and testing, errors on 
documents (such as engineering drawings, checks, purchase orders, and plans), customer com- 
plaint hotlines, warranty costs, and the time required to do things over again that could have 
been done right the first time. A successful quality-improvement effort can eliminate much of 
this waste and lead to lower costs, higher productivity, increased customer satisfaction, in- 
creased business reputation, higher market share, and ultimately higher profits for the company. 

Statistical methods play a vital role in quality improvement. Some applications are out- 
lined below: 


1. In product design and development, statistical methods, including designed exper- 
iments, can be used to compare different materials, components, or ingredients, and 
to help determine both system and component tolerances. This application can sig- 
nificantly lower development costs and reduce development time. 


2. Statistical methods can be used to determine the capability of a manufacturing 
process. Statistical process control can be used to systematically improve a process 
by reducing variability. 

3. Experimental design methods can be used to investigate improvements in the process. 
These improvements can lead to higher yields and lower manufacturing costs. 


4. Life testing provides reliability and other performance data about the product. This 
can lead to new and improved designs and products that have longer useful lives and 
lower operating and maintenance costs. 


Some of these applications have been illustrated in earlier chapters of this book. It is 
essential that engineers, scientists, and managers have an in-depth understanding of these sta- 
tistical tools in any industry or business that wants to be a high-quality, low-cost producer. In 
this chapter we provide an introduction to the basic methods of statistical quality control that, 
along with experimental design, form the basis of a successful quality-improvement effort. 


15-1.1 Statistical Quality Control 


The field of statistical quality control can be broadly defined as those statistical and engineer- 
ing methods that are used in measuring, monitoring, controlling, and improving quality. 
Statistical quality control is a field that dates back to the 1920s. Dr. Walter A. Shewhart of the 
Bell Telephone Laboratories was one of the early pioneers of the field. In 1924 he wrote a 
memorandum showing a modern control chart, one of the basic tools of statistical process 
control. Harold EF Dodge and Harry G. Romig, two other Bell System employees, provided 
much of the leadership in the development of statistically based sampling and inspection 
methods. The work of these three men forms much of the basis of the modern field of statis- 
tical quality control. World War II saw the widespread introduction of these methods to U.S. 
industry. Dr. W. Edwards Deming and Dr. Joseph M. Juran have been instrumental in spread- 
ing statistical quality-control methods since World War II. 

The Japanese have been particularly successful in deploying statistical quality-control 
methods and have used statistical methods to gain significant advantage over their 
competitors. In the 1970s American industry suffered extensively from Japanese (and other 
foreign) competition; that has led, in turn, to renewed interest in statistical quality-control 
methods in the United States. Much of this interest focuses on statistical process control and 
experimental design. Many U.S. companies have implemented these methods in their manu- 
facturing, engineering, and other business organizations. 
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15-1.2 Statistical Process Control 


It is impractical to inspect quality into a product; the product must be built right the first 
time. The manufacturing process must therefore be stable or repeatable and capable of op- 
erating with little variability around the target or nominal dimension. Online statistical 
process control is a powerful tool for achieving process stability and improving capability 
through the reduction of variability. 

It is customary to think of statistical process control (SPC) as a set of problem-solving 
tools that may be applied to any process. The major tools of SPC* are 


1. Histogram 
2. Pareto chart 
3. Cause-and-effect diagram 
4. Defect-concentration diagram 
5. Control chart 
6. Scatter diagram 
7. Check sheet 
Although these tools are an important part of SPC, they comprise only the technical aspect of 
the subject. An equally important element of SPC is attitude—a desire of all individuals in the 


organization for continuous improvement in quality and productivity through the systematic 
reduction of variability. The control chart is the most powerful of the SPC tools. 


15-2 INTRODUCTION TO CONTROL CHARTS 


15-2.1 Basic Principles 


In any production process, regardless of how well-designed or carefully maintained it is, a 
certain amount of inherent or natural variability will always exist. This natural variability or 
“background noise” is the cumulative effect of many small, essentially unavoidable causes. 
When the background noise in a process is relatively small, we usually consider it an accept- 
able level of process performance. In the framework of statistical quality control, this natural 
variability is often called a “stable system of chance causes.” A process that is operating with 
only chance causes of variation present is said to be in statistical control. In other words, the 
chance causes are an inherent part of the process. 

Other kinds of variability may occasionally be present in the output of a process. This 
variability in key quality characteristics usually arises from three sources: improperly ad- 
justed machines, operator errors, or defective raw materials. Such variability is generally large 
when compared to the background noise, and it usually represents an unacceptable level of 
process performance. We refer to these sources of variability that are not part of the chance 
cause pattern as assignable causes. A process that is operating in the presence of assignable 
causes is said to be out of control.’ 


* Some prefer to include the experimental design methods discussed previously as part of the SPC toolkit. We did not 
do so, because we think of SPC as an online approach to quality improvement using techniques founded on passive 
observation of the process, while design of experiments is an active approach in which deliberate changes are made 
to the process variables. As such, designed experiments are often referred to as offline quality control. 

‘The terminology chance and assignable causes was developed by Dr. Walter A. Shewhart. Today, some writers use 
common cause instead of chance cause and special cause instead of assignable cause. 


Figure 15-1 A typi- 
cal control chart. 
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Production processes will often operate in the in-control state, producing acceptable 
product for relatively long periods of time. Occasionally, however, assignable causes will 
occur, seemingly at random, resulting in a “shift” to an out-of-control state where a large pro- 
portion of the process output does not conform to requirements. A major objective of statisti- 
cal process control is to quickly detect the occurrence of assignable causes or process shifts 
so that investigation of the process and corrective action may be undertaken before many 
nonconforming units are manufactured. The control chart is an online process-monitoring 
technique widely used for this purpose. 

Recall the following from Chapter 1. Figure 1-11 illustrates that adjustments to common 
causes of variation increase the variation of a process whereas Fig. 1-12 illustrates that actions 
should be taken in response to assignable causes of variation. Control charts may also be used to 
estimate the parameters of a production process and, through this information, to determine the 
capability of a process to meet specifications. The control chart can also provide information 
that is useful in improving the process. Finally, remember that the eventual goal of statistical 
process control is the elimination of variability in the process. Although it may not be possible 
to eliminate variability completely, the control chart helps reduce it as much as possible. 

A typical control chart is shown in Fig. 15-1, which is a graphical display of a quality 
characteristic that has been measured or computed from a sample versus the sample number 
or time. Often, the samples are selected at periodic intervals such as every hour. The chart 
contains a center line (CL) that represents the average value of the quality characteristic 
corresponding to the in-control state. (That is, only chance causes are present.) Two other hor- 
izontal lines, called the upper control limit (UCL) and the lower control limit (LCL), are also 
shown on the chart. These control limits are chosen so that if the process is in control, nearly 
all of the sample points will fall between them. In general, as long as the points plot within the 
control limits, the process is assumed to be in control, and no action is necessary. However, a 
point that plots outside of the control limits is interpreted as evidence that the process is out of 
control, and investigation and corrective action are required to find and eliminate the assigna- 
ble cause or causes responsible for this behavior. The sample points on the control chart are 
usually connected with straight-line segments so that it is easier to visualize how the sequence 
of points has evolved over time. 

Even if all the points plot inside the control limits, if they behave in a systematic or 
nonrandom manner, this is an indication that the process is out of control. For example, if 
18 of the last 20 points plotted above the center line but below the upper control limit, and 
only two of these points plotted below the center line but above the lower control limit, we 
would be very suspicious that something was wrong. If the process is in control, all the 
plotted points should have an essentially random pattern. Methods designed to find 
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Control Chart 
Model 


sequences or nonrandom patterns can be applied to control charts as an aid in detecting 
out-of-control conditions. A particular nonrandom pattern usually appears on a control 
chart for a reason, and if that reason can be found and eliminated, process performance can 
be improved. 

There is a close connection between control charts and hypothesis testing. Essentially, the 
control chart is a test of the hypothesis that the process is in a state of statistical control. A 
point plotting within the control limits is equivalent to failing to reject the hypothesis of 
statistical control, and a point plotting outside the control limits is equivalent to rejecting the 
hypothesis of statistical control. 

We give a general model for a control chart. Let W be a sample statistic that measures 
some quality characteristic of interest, and suppose that the mean of W is jw and the standard 
deviation of W is ow.* Then the center line, the upper control limit, and the lower control limit 
become 


CL = pw 
LCL. = ay — how (15-1) 


where k is the “distance” of the control limits from the center line, expressed in standard 
deviation units. A common choice is k = 3. This general theory of control charts was first pro- 
posed by Dr. Walter A. Shewhart, and control charts developed according to these principles 
are often called Shewhart control charts. 

The control chart is a device for describing exactly what is meant by statistical control; 
as such, it may be used in a variety of ways. In many applications, it is used for online 
process monitoring. That is, sample data are collected and used to construct the control 
chart, and if the sample values of x (say) fall within the control limits and do not exhibit any 
systematic pattern, we say the process is in control at the level indicated by the chart. Note 
that we may be interested here in determining both whether the past data came from a 
process that was in control and whether future samples from this process indicate statistical 
control. 

The most important use of a control chart is to improve the process. We have found that, 
generally 


1. Most processes do not operate in a state of statistical control. 


2. Consequently, the routine and attentive use of control charts will identify assignable 
causes. If these causes can be eliminated from the process, variability will be reduced 
and the process will be improved. 


This process-improvement activity using the control chart is illustrated in Fig. 15-2. Notice 
that: 


3. The control chart will only detect assignable causes. Management, operator, and 
engineering action will usually be necessary to eliminate the assignable cause. An 
action plan for responding to control chart signals is vital. 


*Note that “sigma” refers to the standard deviation of the statistic plotted on the chart (i.e., oy), not the standard 
deviation of the quality characteristic. 


Figure 15-2 Process 
improvement using 
the control chart. 
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In identifying and eliminating assignable causes, it is important to find the underlying root 
cause of the problem and to attack it. A cosmetic solution will not result in any real, long-term 
process improvement. Developing an effective system for corrective action is an essential 
component of an effective SPC implementation. 

We may also use the control chart as an estimating device. That is, from a control chart 
that exhibits statistical control, we may estimate certain process parameters, such as the mean, 
standard deviation, and fraction nonconforming or fallout. These estimates may then be used 
to determine the capability of the process to produce acceptable products. Such process 
capability studies have considerable impact on many management decision problems that oc- 
cur over the product cycle, including make-or-buy decisions, plant and process improvements 
that reduce process variability, and contractual agreements with customers or suppliers re- 
garding product quality. 

Control charts may be classified into two general types. Many quality characteristics can 
be measured and expressed as numbers on some continuous scale of measurement. In such 
cases, it is convenient to describe the quality characteristic with a measure of central tendency 
and a measure of variability. Control charts for central tendency and variability are collectively 
called variables control charts. The X chart is the most widely used chart for monitoring cen- 
tral tendency, whereas charts based on either the sample range or the sample standard deviation 
are used to control process variability. Many quality characteristics are not measured on a con- 
tinuous scale or even a quantitative scale. In these cases, we may judge each unit of product as 
either conforming or nonconforming on the basis of whether or not it possesses certain attrib- 
utes, or we may count the number of nonconformities (defects) appearing on a unit of product. 
Control charts for such quality characteristics are called attributes control charts. 

Control charts have had a long history of use in industry. There are at least five reasons 
for their popularity: 


1. Control charts are a proven technique for improving productivity. A successful 
control chart program will reduce scrap and rework, which are the primary produc- 
tivity killers in any operation. If you reduce scrap and rework, productivity increases, 
cost decreases, and production capacity (measured in the number of good parts per 
hour) increases. 


2. Control charts are effective in defect prevention. The control chart helps keep the 
process in control, which is consistent with the “do it right the first time” philosophy. 
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It is never cheaper to sort out the “good” units from the “bad” later on than it is to 
build them correctly initially. If you do not have effective process control, you are 
paying someone to make a nonconforming product. 


3. Control charts prevent unnecessary process adjustments. A control chart can dis- 
tinguish between background noise and abnormal variation; no other device, including 
a human operator, is as effective in making this distinction. If process operators adjust 
the process based on periodic tests unrelated to a control chart program, they will often 
overreact to the background noise and make unneeded adjustments. These unnecessary 
adjustments can result in a deterioration of process performance. In other words, the 
control chart is consistent with the “if it isn’t broken, don’t fix it” philosophy. 


4. Control charts provide diagnostic information. Frequently, the pattern of points 
on the control chart will contain information that is of diagnostic value to an 
experienced operator or engineer. This information allows the operator to implement 
a change in the process that will improve its performance. 


5. Control charts provide information about process capability. The control chart 
provides information about the value of important process parameters and their sta- 
bility over time. This allows an estimate of process capability to be made. This in- 
formation is of tremendous use to product and process designers. 


Control charts are among the most effective management control tools, and they are as 
important as cost controls and material controls. Modern computer technology has made it 
easy to implement control charts in any type of process, because data collection and analysis 
can be performed on a microcomputer or a local area network terminal in real time, online at 
the work center. 


15-2.2 Design of a Control Chart 


Figure 15-3 X con- 
trol chart for piston 
ring diameter. 


To illustrate these ideas, we give a simplified example of a control chart. In manufacturing au- 
tomobile engine piston rings, the inside diameter of the rings is a critical quality characteris- 
tic. The process mean inside ring diameter is 74 millimeters, and it is known that the standard 
deviation of ring diameter is 0.01 millimeters. A control chart for average ring diameter is 
shown in Fig. 15-3. Every hour a random sample of five rings is taken, the average ring di- 
ameter of the sample (say x) is computed, and x is plotted on the chart. Because this control 
chart utilizes the sample mean X to monitor the process mean, it is usually called an X con- 
trol chart. Note that all the points fall within the control limits, so the chart indicates that the 
process is in statistical control. 
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Consider how the control limits were determined. The process average is 74 millimeters, 
and the process standard deviation is o = 0.01 millimeters. Now if samples of size n = 5 are 
taken, the standard deviation of the sample average X is 


Oo 0.01 
oe = 
Vn V5 


= 0.0045 


Therefore, if the process is in control with a mean diameter of 74 millimeters, by using 
the central limit theorem to assume that X is approximately normally distributed, we 
would expect approximately 100(1 — «)% of the sample mean diameters X to fall between 
74 + Zy/2(0.0045) and 74 — z,/.(0.0045). As discussed above, we customarily choose the 
constant Z,/ to be 3, so the upper and lower control limits become 


UCL = 74 + 3(0.0045) = 74.0135 
and 
LCL = 74 — 3(0.0045) = 73.9865 


as shown on the control chart. These are the 3-sigma control limits referred to above. Note that 
the use of 3-sigma limits implies that « = 0.0027; that is, the probability that the point plots 
outside the control limits when the process is in control is 0.0027. The width of the control 
limits is inversely related to the sample size n for a given multiple of sigma. Choosing the con- 
trol limits is equivalent to setting up the critical region for testing the hypotheses 


Ho: w = 74 
Ay: wp #74 


where o = 0.01 is known. Essentially, the control chart tests this hypothesis repeatedly at dif- 
ferent points in time. 

In designing a control chart, we must specify both the sample size to use and the fre- 
quency of sampling. In general, larger samples will make it easier to detect small shifts in the 
process. When choosing the sample size, we must keep in mind the size of the shift that we are 
trying to detect. If we are interested in detecting a relatively large process shift, we use smaller 
sample sizes than those that would be employed if the shift of interest were relatively small. 

We must also determine the frequency of sampling. The most desirable situation from the 
point of view of detecting shifts would be to take large samples very frequently; however, this 
is usually not economically feasible. The general problem is one of allocating sampling effort. 
That is, either we take small samples at short intervals or larger samples at longer intervals. 
Current industry practice tends to favor smaller, more frequent samples, particularly in high- 
volume manufacturing processes or where a great many types of assignable causes can occur. 
Furthermore, as automatic sensing and measurement technology develops, it is becoming 
possible to greatly increase frequencies. Ultimately, every unit can be tested as it is manufac- 
tured. This capability will not eliminate the need for control charts because the test system will 
not prevent defects. The increased data will increase the effectiveness of process control and 
improve quality. 

When preliminary samples are used to construct limits for control charts, these limits are 
customarily treated as trial values. Therefore, the sample statistics should be plotted on the 
appropriate charts, and any points that exceed the control limits should be investigated. If 
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assignable causes for these points are discovered, they should be eliminated and new limits for 
the control charts determined. In this way, the process may be eventually brought into statistical 
control and its inherent capabilities assessed. Other changes in process centering and dispersion 
may then be contemplated. 


15-2.3 Rational Subgroups 


A fundamental idea in the use of control charts is to collect sample data according to what 
Shewhart called the rational subgroup concept. Generally, this means that subgroups or sam- 
ples should be selected so that to the extent possible, the variability of the observations within 
a subgroup should include all the chance or natural variability and exclude the assignable 
variability. Then, the control limits will represent bounds for all the chance variability and not 
the assignable variability. Consequently, assignable causes will tend to generate points that are 
outside of the control limits, while chance variability will tend to generate points that are 
within the control limits. 

When control charts are applied to production processes, the time order of production is a 
logical basis for rational subgrouping. Even though time order is preserved, it is still possible 
to form subgroups erroneously. If some of the observations in the subgroup are taken at the end 
of one eight-hour shift and the remaining observations are taken at the start of the next eight- 
hour shift, any differences between shifts might not be detected. Time order is frequently a good 
basis for forming subgroups because it allows us to detect assignable causes that occur over time. 

Two general approaches to constructing rational subgroups are used. In the first ap- 
proach, each subgroup consists of units that were produced at the same time (or as closely to- 
gether as possible). This approach is used when the primary purpose of the control chart is to 
detect process shifts. It minimizes variability due to assignable causes within a sample, and it 
maximizes variability between samples if assignable causes are present. It also provides bet- 
ter estimates of the standard deviation of the process in the case of variables control charts. 
This approach to rational subgrouping essentially gives a “snapshot” of the process at each 
point in time where a sample is collected. 

In the second approach, each sample consists of units of product that are representative of 
all units that have been produced since the last sample was taken. Essentially, each subgroup 
is a random sample of all process output over the sampling interval. This method of rational 
subgrouping is often used when the control chart is employed to make decisions about the ac- 
ceptance of all units of product that have been produced since the last sample. In fact, if the 
process shifts to an out-of-control state and then back in control again between samples, it is 
sometimes argued that the first method of rational subgrouping defined above will be ineffec- 
tive against these types of shifts, and so the second method must be used. 

When the rational subgroup is a random sample of all units produced over the sampling 
interval, considerable care must be taken in interpreting the control charts. If the process mean 
drifts between several levels during the interval between samples, the range of observations 
within the sample may consequently be relatively large. It is the within-sample variability that 
determines the width of the control limits on an X chart, so this practice will result in wider 
limits on the X chart. This makes it harder to detect shifts in the mean. In fact, we can often 
make any process appear to be in statistical control just by stretching out the interval between 
observations in the sample. It is also possible for shifts in the process average to cause points 
on a control chart for the range or standard deviation to plot out of control, even though no 
shift in process variability has taken place. 

There are other bases for forming rational subgroups. For example, suppose a process con- 
sists of several machines that pool their output into a common stream. If we sample from this 


Figure 15-4 AnX 
control chart. 
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common stream of output, it will be very difficult to detect whether or not some of the machines 
are out of control. A logical approach to rational subgrouping here is to apply control chart tech- 
niques to the output for each individual machine. Sometimes this concept needs to be applied to 
different heads on the same machine, different workstations, different operators, and so forth. 

The rational subgroup concept is very important. The proper selection of samples re- 
quires careful consideration of the process, with the objective of obtaining as much useful in- 
formation as possible from the control chart analysis. 


15-2.4 Analysis of Patterns on Control Charts 


A control chart may indicate an out-of-control condition either when one or more points fall be- 
yond the control limits, or when the plotted points exhibit some nonrandom pattern of behavior. 
For example, consider the X chart shown in Fig. 15-4. Although all 25 points fall within the con- 
trol limits, the points do not indicate statistical control because their pattern is very nonrandom 
in appearance. Specifically, we note that 19 of the 25 points plot below the center line, while 
only 6 of them plot above. If the points are truly random, we should expect a more even distri- 
bution of them above and below the center line. We also observe that following the fourth point, 
five points in a row increase in magnitude. This arrangement of points is called a run. Since the 
observations are increasing, we could call it a run up; similarly, a sequence of decreasing points 
is called a run down. This control chart has an unusually long run up (beginning with the fourth 
point) and an unusually long run down (beginning with the eighteenth point). 

In general, we define a run as a sequence of observations of the same type. In addition to 
runs up and runs down, we could define the types of observations as those above and below the 
center line, respectively, so two points in a row above the center line would be a run of length 2. 

A run of length 8 or more points has a very low probability of occurrence in a random 
sample of points. Consequently, any type of run of length 8 or more is often taken as a signal 
of an out-of-control condition. For example, eight consecutive points on one side of the cen- 
ter line will indicate that the process is out of control. 

Although runs are an important measure of nonrandom behavior on a control chart, other 
types of patterns may also indicate an out-of-control condition. For example, consider the X 
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Figure 15-5 An X 
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chart in Fig. 15-5. Note that the plotted sample averages exhibit a cyclic behavior, yet they all 
fall within the control limits. Such a pattern may indicate a problem with the process, such as 
operator fatigue, raw material deliveries, and heat or stress buildup. The yield may be im- 
proved by eliminating or reducing the sources of variability causing this cyclic behavior 
(see Fig. 15-6). In Fig. 15-6, LSL and USL denote the lower and upper specification limits of 
the process. These limits represent bounds within which acceptable product must fall and they 
are often based on customer requirements. 

The problem is one of pattern recognition, that is, recognizing systematic or nonrandom 
patterns on the control chart and identifying the reason for this behavior. The ability to interpret 
a particular pattern in terms of assignable causes requires experience and knowledge of the 
process. That is, we must not only know the statistical principles of control charts, but we 
must also have a good understanding of the process. 

The Western Electric Handbook (1956) suggests a set of decision rules for detecting non- 
random patterns on control charts. Specifically, the Western Electric rules would conclude 
that the process is out of control if either 


1. One point plots outside 3-sigma control limits. 


2. Two out of three consecutive points plot beyond a 2-sigma limit. 


3. Four out of five consecutive points plot at a distance of 1-sigma or beyond from the 
center line. 


4. Eight consecutive points plot on one side of the center line. 


LSL u USL 


Figure 15-7 The 
Western Electric zone 
rules. 
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We have found these rules very effective in practice for enhancing the sensitivity of control 
charts. Rules 2 and 3 apply to one side of the center line at a time. That is, a point above the 
upper 2-sigma limit followed immediately by a point below the /Jower 2-sigma limit would not 
signal an out-of-control alarm. 

Figure 15-7 shows an X control chart for the piston ring process with the 1-sigma, 
2-sigma, and 3-sigma limits used in the Western Electric procedure. Notice that these inner 
limits (sometimes called warning limits) partition the control chart into three zones A, B, and 
C on each side of the center line. Consequently, the Western Electric rules are sometimes 
called the run rules for control charts. Notice that the last four points fall in zone B or beyond. 
Thus, since four of five consecutive points exceed the |-sigma limit, the Western Electric 
procedure will conclude that the pattern is nonrandom and the process is out of control. 


15-3 X AND R OR S CONTROL CHARTS 


When dealing with a quality characteristic that can be expressed as a measurement, it is cus- 
tomary to monitor both the mean value of the quality characteristic and its variability. Control 
over the average quality is exercised by the control chart for averages, usually called the X 
chart. Process variability can be controlled by either a range chart (R chart) or a standard de- 
viation chart (S chart), depending on how the population standard deviation is estimated. 

Suppose that the process mean and standard deviation js and o are known and that we can 
assume that the quality characteristic has a normal distribution. Consider the X chart. As dis- 
cussed previously, we can use p as the center line for the control chart, and we can place the 
upper and lower 3-sigma limits at 


UCL = p + 30/Vn 


LCL = p — 30/Vn 
Cr (15-2) 


When the parameters and o are unknown, we usually estimate them on the basis of 
preliminary samples, taken when the process is thought to be in control. We recommend the 


650 


CHAPTER 15 STATISTICAL QUALITY CONTROL 


Estimator of o 
from R Chart 


use of at least 20 to 25 preliminary samples. Suppose m preliminary samples are available, 
each of size n. Typically, n will be 4, 5, or 6; these relatively small sample sizes are widely 
used and often arise from the construction of rational subgroups. Let the sample mean for the 
ith sample be X;. Then we estimate the mean of the population, w, by the grand mean 


~ Se Lz. 
p=x=— > (15-3) 


Thus, we may take X as the center line on the X control chart. 

We may estimate o from either the standard deviation or the range of the observations 
within each sample. The sample size is relatively small, so there is little loss in efficiency in 
estimating o from the sample ranges. 

The relationship between the range R of a sample from a normal population with known 
parameters and the standard deviation of that population is needed. Since R is a random 
variable, the quantity W = R/o, called the relative range, is also a random variable. The 
parameters of the distribution of W have been determined for any sample size n. The mean and 
standard deviation of the distribution of W are called d, and d; respectively. Because R = oW, 


WrR= do Op = dso (15-4) 
Let R; be the range of the ith sample, and let 


_ 1 m 
R=—DR, (15-5) 
i=1 


m 


be the average range. Then R is an estimator of wz and from Equation 15-4 an unbiased 
estimator of o is 


(15-6) 


where the constant d, is tabulated for various sample sizes in Appendix Table XI. 


Therefore, we may use as our upper and lower control limits for the X chart 


UCL =X + R LCL = X- R 15-7 
dy>Vn dyVn oo) 
Define the constant 
A> = 2 (15-8) 
2 doVn 


Now, once we have computed the sample values x and 7, the X control chart may be defined 
as follows. 


X Control Chart 
(from R) 


R Chart 
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The center line and upper and lower control limits for an X control chart are 
OO aN OS) IO (15-9) 


where the constant A, is tabulated for various sample sizes in Appendix Table XI. 


The parameters of the R chart may also be easily determined. The center line is R. To de- 
termine the control limits, we need an estimate of op, the standard deviation of R. Once again, 
assuming the process is in control, the distribution of the relative range, W, is useful. We may 
estimate oO from Equation 15-4 as 

- : R 
Or = da36 _ d;— (15-10) 
dy 


and the upper and lower control limits on the R chart are 


—  3d5— ads 
UCL=R+—R=(1+ R 


dy dy 
a 3d3\ — 
LCL =R-—R=(1- R (15-11) 
d, dy 


Setting D, = 1 — 3d;/d, and D, = 1 + 3d;/d, leads to the following definition. 


The center line and upper and lower control limits for an R chart are 
UCL = D4r CL=r LCL = D3r (15-12) 


where 7 is the sample average range, and the constants D3 and Dy, are tabulated for 
various sample sizes in Appendix Table XI. 


The LCL for an R chart can be a negative number. In that case, it is customary to set LCL 
to zero. Because the points plotted on an R chart are nonnegative, no points can fall below an 
LCL of zero. Also, we often study the R chart first because if the process variability is not con- 
stant over time the control limits calculated for the X chart can be misleading. 

Rather than base control charts on ranges, a more modern approach is to calculate the standard 
deviation of each subgroup and plot these standard deviations to monitor the process standard de- 
viation o. This is called an S chart. When an S chart is used, it is common to use these standard 
deviations to develop control limits for the X chart. Typically, the sample size used for subgroups 
is small (fewer than 10) and in that case there is usually little difference in the X chart generated 
from ranges or standard deviations. However, because computer software is often used to imple- 
ment control charts, $ charts are quite common. Details to construct these charts follow. 

In Section 7-3, it was stated that S is a biased estimator of o. That is, E(S) = cso where c,4 
is a constant that is near, but not equal to, 1. Furthermore, a calculation similar to the one used 
for E(S) can derive the standard deviation of the statistic S with the result oV1 — ie. 
Therefore, the center line and three-sigma control limits for S are 


LCL = co —30V1—ci CL =co UCL=c0 + 30V1 - cj (15-13) 
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Assume that there are m preliminary samples available, each of size n, and let S; denote the 
standard deviation of the ith sample. Define 


S; (15-14) 


Because E(S) = c4o, an unbiased estimator of o is S/c4 That is, 


Estimator of o 
from S Chart oe 
6 = S/c4 (15-15) 
where the constant c, is tabulated for various sample sizes in Appendix Table XI. 
A control chart for standard deviations follows. 
S Chart 


UCL=3+35V1-& CL=3 LCL=3-32Vi-d (15-16) 


The LCL for an S chart can be a negative number, in that case, it is customary to set LCL to zero. 
When an S chart is used, the estimate for o in Equation 15-15 is commonly used to calculate 
the control limits for an X chart. This produces the following control limits for an X chart. 


(from S) 


= s = 
UCL =x +3——} CL=x 
. caVn is 


S 
LEE = 5s = 3 
2 caVn 


(iscin) 


EXAMPLE 15-1 Vane Opening 
A component part for a jet aircraft engine is manufactured 
by an investment casting process. The vane opening on this 
casting is an important functional parameter of the part. We 
will illustrate the use of X and R control charts to assess the 
statistical stability of this process. Table 15-1 presents 20 
samples of five parts each. The values given in the table have 
been coded by using the last three digits of the dimension; 
that is, 31.6 should be 0.50316 inch. 

The quantities x = 33.3 and 7 = 5.8 are shown at the 
foot of Table 15-1. The value of A, for samples of size 5 is 
A, = 0.577. Then the trial control limits for the X chart are 


¥ + AjF = 33.32 + (0.577)(5.8) = 33.32 + 3.35 
or 
UCL = 36.67. LCL = 29.97 


For the R chart, the trial control limits are 


UCL = Dy? = (2.115)(5.8) = 12.27 
LCL = DF = (0)(5.8) = 0 


The X and R control charts with these trial control limits 
are shown in Fig. 15-8. Notice that samples 6, 8, 11, and 19 are 
out of control on the X chart and that sample 9 is out of 
control on the R chart. (These points are labeled with a “1” 
because they violate the first Western Electric rule.) 

For the S chart, the value of c, = 0.94. Therefore, 


3(2.345 
Via e = (2345) 7 om = 2.553 


C4 0.94 


and the trial control limits are 


UCL = 2.345 + 2.553 = 4.898 
LCL = 2.345 — 2.553 = —0.208 


Table 15-1 Vane-Opening Measurements 
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Sample 
Number x, Xy x3 x4 Xs oe r s 
1 33 29 31 32 33 31.6 4 1.67332 
2 33 31 35 37 31 33.4 6 2.60768 
3 35 37 33 34 36 35.0 4 1.58114 
4 30 31 33 34 33 32:2 4 1.64317 
5 33 34 35 33 34 33.8 2 0.83666 
6 38 37 39 40 38 38.4 3 1.14018 
7 30 31 32 34 31 31.6 4 1.51658 
8 29 39 38 39 39 36.8 10 4.38178 
9 28 33 35 36 43 35.0 15 5.43139 
10 38 33 32 35 32 34.0 6 2.54951 
11 28 30 28 32 31 29.8 4 1.78885 
12 31 35 35 35 34 34.0 1.73205 
13 2 32 34 35 37 33.0 10 3.80789 
14 33 33 35 37 36 34.8 4 1.78885 
15 35 37 32 35 39 35.6 7 2.60768 
16 33 33 27 31 30 30.8 6 2.48998 
17 35 34 34 30 3 33.0 5 2.00000 
18 32 33 30 30 33 31.6 3 1.51658 
19 5) Di 34 DY 28 28.2 9 3.42053 
20 35 35 36 33 30 33.8 6 2.38747 
xX = 33.32 7=58 y = 2.345 
42 
40 
1 
38 UCL = 36.67 \ 1 
2 38 z= 2, / *\ "1 
334 en Aaa ™“. A . : 
SLY VW 
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1 e 
28 i 
26 
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Figure 15-8 The X andR 
control charts for vane opening. 
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The LCL is set to zero. If s is used to determine the control 
limits for the X chart, 


_ 35 3(2.345) 
~ Vn 0.94-V5 


and this result is nearly the same as from 7. The S chart is shown 
in Fig. 15-9. Because the control limits for the X chart calculated 
from 5 are nearly the same as from 7, the chart is not shown. 

Suppose that all of these assignable causes can be traced 
to a defective tool in the wax-molding area. We should discard 
these five samples and recompute the limits for the X andR 
charts. These new revised limits are, for the X chart, 


=/33:52-= = 33.32 3:35 


sll 


UCL = x 


+ AyF = 33.21 + (0.577)(5.0) = 36.10 


LCL = x 


AoF = 33.21 — (0.577)(5.0) = 30.33 


R] 


[ WY 


123 4 5 6 7 8 9 101112 13 14 15 16 17 1819 20 


and for the R chart, 


UCL = DyF = (2.115)(5.0) = 10.57 


LCL = D3 = (0)(5.0) = 0 


The revised control charts are shown in Fig. 15-10. 

Practical Interpretation: Notice that we have treated the 
first 20 preliminary samples as estimation data with which to 
establish control limits. These limits can now be used to judge 
the statistical control of future production. As each new 
sample becomes available, the values of x and r should be 
computed and plotted on the control charts. It may be desir- 
able to revise the limits periodically, even if the process 
remains stable. The limits should always be revised when 
process improvements are made. 


@= not used in computing 
control limits 


Revised UCL = 36.10 
CL = 33.21 


Revised LCL = 30.33 
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Figure 15-10 The X 
and R control charts for 
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Table 15-2 Summary Report from Minitab for the Vane-Opening Data 


Test Results for Xbar Chart 
TEST 1. One point more than 3.00 sigmas from center line. 
Test Failed at points: 6 8 11 19 


Test Results for R Chart 


TEST 1. One point more than 3.00 sigmas from center line. 


Test Failed at points: 9 


Computer Construction of X and R Control Charts 

Many computer programs construct X and R control charts. Figures 15-8 and 15-10 show 
charts similar to those produced by Minitab for the vane-opening data. This program will 
allow the user to select any multiple of sigma as the width of the control limits and use the 
Western Electric rules to detect out-of-control points. The program will also prepare a sum- 
mary report as in Table 15-2 and exclude subgroups from the calculation of the control limits. 


EXERCISES FOR SECTION 15-3 


15-1. Control charts for X and R are to be set up for an im- 
portant quality characteristic. The sample size is n = 5, and 
x and r are computed for each of 35 preliminary samples. 
The summary data are 


35 

d\n = 1200 

i=l i=1 

(a) Calculate trial control limits for X and R charts. 

(b) Assuming that the process is in control, estimate the 
process mean and standard deviation. 

15-2. Twenty-five samples of size 5 are drawn from a process 

at one-hour intervals, and the following data are obtained: 


25 25 25 
Ds = 362.75 Dry =860 Dds; = 3.64 
i=1 i=1 i=1 


(a) Calculate trial control limits for X and R charts. 

(b) Repeat part (a) for X and S charts. 

15-3. Control charts are to be constructed for samples of size 
n = 4, and x and s are computed for each of 20 preliminary 
samples as follows: 


iS) 
i=) 


20 
x= 4460 js, = 271.6 


i=1 i=1 


I 


(a) Calculate trial control limits for X and S charts. 

(b) Assuming the process is in control, estimate the process 
mean and standard deviation. 

15-4. Samples of size n = 6 are collected from a process 

every hour. After 20 samples have been collected, we calcu- 

late x = 20.0 and 7/d, = 1.4. 

(a) Calculate trial control limits for X and R charts. 


(b) If s/c, = 1.5, determine trial control limits for X and S 
charts. 

15-5. The level of cholesterol (in mg/dL) is an important index 

for human health. The sample size is n = 5. The following sum- 

mary statistics are obtained from cholesterol measurements: 


3 


o 


t 


30 30 
a = 14003, 3 A= 1503, 3 y= 5.10 
i=1 i=l 


Il 


(a) Find trial control limits for X and R charts. 
(b) Repeat part (a) for X and S charts. 


15-6. An X control chart with three-sigma control limits has 
UCL = 48.75 and LCL = 42.71. Suppose the process standard 
deviation is o = 2.25. What subgroup size was used for the chart? 


15-7. An extrusion die is used to produce aluminum rods. 
The diameter of the rods is a critical quality characteristic. The 
following table shows x and r values for 20 samples of five 
rods each. Specifications on the rods are 0.5035 + 0.0010 inch. 
The values given are the last three digits of the measurement; 
that is, 34.2 is read as 0.50342. 


Sample x 


COIDNHWRWNH 
WwW WwW 
pag 
or 
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Sample G r 
10 38.6 4 
11 35.4 8 
12 34.0 6 
13 36.0 4 
14 37.2 7 
15 35.2 3 
16 33.4 10 
17 35.0 4 
18 34.4 7 
19 33.9 8 
20 34.0 4 


(a) Using all the data, find trial control limits for X and R 
charts, construct the chart, and plot the data. 

(b) Use the trial control limits from part (a) to identify 
out-of-control points. If necessary, revise your control 
limits, assuming that any samples that plot outside the 
control limits can be eliminated. Estimate o. 

15-8. The copper content of a plating bath is measured three 

times per day, and the results are reported in ppm. The x and r 

values for 25 days are shown in the following table: 


Day a r Day as r 
1 5.45 1.21 14 7.01 1.45 
2, 5.39 0.95 15 5.83 1.37 
3 6.85 1.43 16 6.35 1.04 
4 6.74 1.29 17 6.05 0.83 
D 5.83 1.35 18 TAL 1.35 
6 122 0.88 19 7.32 1.09 
7 6.39 0.92 20 5.90 1.22 
8 6.50 1.13 21 5.50 0.98 
9 WMS 1.25 22 6.32 1.21 

10 5.92 1.05 23 6.55 0.76 
11 6.45 0.98 24 5.90 1.20 
12 5.38 1.36 25 5.95 1.19 
13 6.03 0.83 


(a) Using all the data, find trial control limits for X and R charts, 
construct the chart, and plot the data. Is the process in 
statistical control? 

(b) If necessary, revise the control limits computed in part (a), 
assuming that any samples that plot outside the control 
limits can be eliminated. 

15-9. The pull strength of a wire-bonded lead for an inte- 

grated circuit is monitored. The following table provides data 

for 20 samples each of size three. 


Sample Number xy X2 x3 
1 15.4 15.6 15.3 
2 15.4 17.1 15.2 
3 16.1 16.1 13.5 
4 13.5 12.5 10.2 
5 18.3 16.1 17.0 
6 19.2 17.2 19.4 
7 14.1 12.4 11.7 
8 15.6 13.3 13.6 
9 13.9 14.9 15.5 

10 18.7 21.2 20.1 
11 15.3 13.1 13.7 
12 16.6 18.0 18.0 
13 17.0 15.2 18.1 
14 16.3 16.5 17.7 
15 8.4 Well 8.4 
16 11.1 13.8 11.9 
17 16.5 17.1 18.5 
18 18.0 14.1 15.9 
19 17.8 17.3 12.0 
20 11.5 10.8 11.2 


(a) Use all the data to determine trial control limits for X and 
R charts, construct the control limits, and plot the data. 

(b) Use the control limits from part (a) to identify out-of-control 
points. If necessary, revise your control limits assuming that 
any samples that plot outside of the control limits can be 
eliminated. 

(c) Repeat parts (a) and (b) for X and S charts. 

15-10. The following data were considered in Quality 

Engineering [“An SPC Case Study on Stabilizing Syringe 

Lengths” (1999-2000, Vol. 12(1))]. The syringe length is 

measured during a pharmaceutical manufacturing process. 

The following table provides data (in inches) for 20 samples 

each of size five. 


Sample xy Xo x3 x4 Xs 
1 4.960 4.946 4.950 4.956 4.958 
2 4.958 4.927 4.935 4.940 4.950 
3 4.971 4.929 4.965 4.952 4.938 
4 4.940 4.982 4.970 4.953 4.960 
5 4.964 4.950 4.953 4.962 4.956 
6 4.969 4.951 4.955 4.966 4.954 
7 4.960 4.944 4.957 4.948 4.951 
8 4.969 4.949 4.963 4.952 4.962 
9 4.984 4.928 4.960 4.943 4.955 
10 4.970 4.934 4.961 4.940 4.965 
11 4.975 4.959 4.962 4.971 4.968 
12 4.945 4.977 4.950 4.969 4.954 


13 4.976 4964 4970 4.968 4.972 
14 4.970 4.954 4.964 4959 4.968 
15 4.982 4.962 4968 4975 4.963 
16 4.961 4.943 4.950 4.949 4.957 
17 4.980 4.970 4.975 4.978 4.977 
18 4.975 4.968 4.971 4.969 4.972 
19 4.977 4.966 4969 4.973 4.970 
20 4.975 4.967 4969 4972 4.972 


Using all the data, find trial control limits for X and R 
charts, construct the chart, and plot the data. Is this 
process in statistical control? 

Use the trial control limits from part (a) to identify out- 
of-control points. If necessary, revise your control limits 
assuming that any samples that plot outside of the control 
limits can be eliminated. 

(c) Repeat parts (a) and (b) for X and S charts. 

15-11. The thickness of a metal part is an important qual- 
ity parameter. Data on thickness (in inches) are given in the 
following table, for 25 samples of five parts each. 


(a 


YS 


(b 


YS 


Sample 
Number xy Xp x3 x4 Xs 

(_ 1 0.0629 0.0636 0.0640 0.0635 0.0640 
2 0.0630 0.0631 0.0622 0.0625 0.0627 

3 0.0628 0.0631 0.0633 0.0633 0.0630 

4 0.0634 0.0630 0.0631 0.0632 0.0633 

5 0.0619 0.0628 0.0630 0.0619 0.0625 

6 0.0613 0.0629 0.0634 0.0625 0.0628 

7 0.0630 0.0639 0.0625 0.0629 0.0627 

8 0.0628 0.0627 0.0622 0.0625 0.0627 

9 0.0623 0.0626 0.0633 0.0630 0.0624 

10 0.0631 0.0631 0.0633 0.0631 0.0630 

11 0.0635 0.0630 0.0638 0.0635 0.0633 

12 0.0623 0.0630 0.0630 0.0627 0.0629 

13 0.0635 0.0631 0.0630 0.0630 0.0630 

14 0.0645 0.0640 0.0631 0.0640 0.0642 

15 0.0619 0.0644 0.0632 0.0622 0.0635 

16 0.0631 0.0627 0.0630 0.0628 0.0629 

17 0.0616 0.0623 0.0631 0.0620 0.0625 

18 0.0630 0.0630 0.0626 0.0629 0.0628 

19 0.0636 0.0631 0.0629 0.0635 0.0634 

20 0.0640 0.0635 0.0629 0.0635 0.0634 

21 0.0628 0.0625 0.0616 0.0620 0.0623 
22 0.0615 0.0625 0.0619 0.0619 0.0622 

23 0.0630 0.0632 0.0630 0.0631 0.0630 
24 0.0635 0.0629 0.0635 0.0631 0.0633 

25 0.0623 0.0629 0.0630 0.0626 0.0628 
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(a) Using all the data, find trial control limits for X and R 
charts, construct the chart, and plot the data. Is the process 
in statistical control? 

(b) Use the trial control limits from part (a) to identify out-of- 
control points. If necessary, revise your control limits 
assuming that any samples that plot outside of the control 
limits can be eliminated. 

(c) Repeat parts (a) and (b) for X and S charts. 

15-12. Apply the Western Electric Rules to the following x 

control chart. The warning limits are shown as dotted lines. 

Describe any rule violations. 


UCL=16 


2 4 6 8 10 12 14 16 18 20 
Observation 


15-13. An X control chart with three-sigma control limits 

and subgroup size n = 4 has control limits UCL = 48.75 and 

LCL = 40.55. 

(a) Estimate the process standard deviation. 

(b) Does the answer to part (a) depend on whether 7 or s was 
used to construct the X control chart? 


15-14. Web traffic can be measured to help highlight secu- 
rity problems or indicate a potential lack of bandwidth. Data 
on Web traffic (in thousand hits) from http://en.wikipedia. 
org/wiki/Web_traffic are given in the following table for 25 
samples each of size four. 


Sample x Xy X3 X4 
1 163.95 164.54 163.87 165.10 
2 163.30 162.85 163.18 165.10 
3 163.13 165.14 162.80 163.81 
4 164.08 163.43 164.03 163.77 
5) 165.44 163.63 163.95 164.78 
6 163.83 164.14 165.22 164.91 
7 162.94 163.64 162.30 163.78 
8 164.97 163.68 164.73 162.32 
9 165.04 164.06 164.40 163.69 
10 164.74 163.74 165.10 164.32 
11 164.72 165.75 163.07 163.84 


continued 
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(a) Use all the data to determine trial control limits for X and 


Sample a 2 & 2 R charts, construct the chart, and plot the data. 

12 164.25 162.72 163.25 164.14 (b) Use the trial control limits from part (a) to identify out- 

13 164.71 162.63 165.07 162.59 of-control points. If necessary, revise your control limits, 

14 166.61 167.07 167.41 166.10 assuming that any samples that plot outside the control 

15 165.23 16340 164.94 163.74 limits can be eliminated. 

16 164.27 163.42 164.73 164.88 15-15. Consider the data in Exercise 15-9. Calculate the 

17 163.59 164.84 164.45 164.12 sample standard deviation of all 60 measurements and com- 

18 164.90 164.20 164.32 163.98 pare this result to the estimate of o obtained from your revised 

19 163.98 163.53 163.34 163.82 X and R charts. Explain any differences. 

20 164.08 164.33 162.38 164.08 15-16. Consider the data in Exercise 15-10. Calculate the 

21 165.71 162.63 164.42 165.27 sample standard deviation of all 100 measurements and com- 

2 164.03 163.36 164.55 165.77 pare this result to the estimate of o obtained from your revised 
X and R charts. Explain any differences. 

3} 160.52 161.68 161.18 161.33 

24 164.22 164.27 164.35 165.12 

a) 163.93 163.96 165.05 164.52 


15-4 CONTROL CHARTS FOR INDIVIDUAL 
MEASUREMENTS 


In many situations, the sample size used for process control is n = 1; that is, the sample con- 
sists of an individual unit. Some examples of these situations are as follows: 


1. Automated inspection and measurement technology is used, and every unit manu- 
factured is analyzed. 


2. The production rate is very slow, and it is inconvenient to allow sample sizes of n > 1 
to accumulate before being analyzed. 


3. Repeat measurements on the process differ only because of laboratory or analysis 
error, as in many chemical processes. 


4. In process plants, such as papermaking, measurements on some parameters such 
as coating thickness across the roll will differ very little and produce a standard 
deviation that is much too small if the objective is to control coating thickness 
along the roll. 


In such situations, the individuals control chart (also called an X chart) is useful. The 
control chart for individuals uses the moving range of two successive observations to estimate 
the process variability. The moving range is defined as MR; = |X; — X,_,| and for m observations 
the average moving range is m 


MR = XxX X;_ 
m-1 >> i i 1 
An estimate of o is 
c= Mk = ME: (15-18) 
od «(12.128 


because each moving range is the range between two consecutive observations. Note that there 
are only m — 1 moving ranges. It is also possible to establish a control chart on the moving range 
using D; and D, for n = 2. The parameters for these charts are defined as follows. 


@ 
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Individuals 
Control Chart The center line and upper and lower control limits for a control chart for individuals are 
= mr _ mr 
Cee nr ae =* 2 08 
CL=x (15-19) 
= mr _ mr 
eae 2 = Saas 


and for a control chart for moving ranges 


UCL = Dymr = 3.267mr 
CL = mr 
LCL = Dymr = 0 


Note that LCL for this moving range chart is always zero because D; = 0 forn = 2. The pro- 
cedure is illustrated in the following example. 


EXAMPLE 15-2 Chemical Process Concentration 


Table 15-3 shows 20 observations on concentration for the same time, the observed concentration reading will differ only 
output of a chemical process. The observations are taken at because of measurement error. Since the measurement error is 
one-hour intervals. If several observations are taken at the small, only one observation is taken each hour. 


Table 15-3 Chemical Process Concentration Measurements 


Concentration Moving Range 
Observation x ane 
1 102.0 
2 94.8 72 
3 98.3 315) 
4 98.4 0.1 
5 102.0 3.6 
6 98.5 3.5 
Wi 99.0 0.5 
8 97.7 1.3 
9 100.0 23) 
10 98.1 1.9 
11 101.3 3.2 
12 98.7 2.6 
13 101.1 2.4 
14 98.4 21 
15 97.0 1.4 
16 96.7 0.3 
17 100.3 3.6 
18 101.4 1.1 
19 7/2 4.2 
20 101.0 3.8 


x — 9901 mr = 2.59 
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Figure 15-11 
process concentration data. 


To set up the control chart for individuals, note that the 
sample average of the 20 concentration readings is x = 99.1 
and that the average of the moving ranges of two observations 
shown in the last column of Table 16-3 is mr = 2.59. To set up 
the moving-range chart, we note that D,; = 0 and D, = 3.267 
for n = 2. Therefore, the moving-range chart has center line 
mr = 2.59, LCL = 0, and UCL = Dymr = (3.267)(2.59) 
= 8.46. The control chart is shown as the lower control chart 
in Fig. 15-11. This control chart was constructed by Minitab. 
Because no points exceed the upper control limit, we may now 
set up the control chart for individual concentration measure- 
ments. If a moving range of n = 2 observations is used, 
d, = 1.128. For the data in Table 15-3 we have 
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Control charts for individuals and the moving range (from Minitab) for the chemical 


UCL = 3437 = 99,1 + 32>” = 108.99 
xb "7128 
Ci =s = 001 
LCL = % — 3" = 991 — 32>" = 92,21 
x = . = A 
d, 1,128 


The control chart for individual concentration measure- 
ments is shown as the upper control chart in Fig. 15-11. There 
is no indication of an out-of-control condition. 

Practical Interpretation: These calculated control limits 
are used to monitor future production. 


The chart for individuals can be interpreted much like an ordinary X control chart. A shift 
in the process average will result in either a point (or points) outside the control limits, or a 
pattern consisting of a run on one side of the center line. 

Some care should be exercised in interpreting patterns on the moving-range chart. 
The moving ranges are correlated, and this correlation may often induce a pattern of runs 
or cycles on the chart. The individual measurements are assumed to be uncorrelated, 
however, and any apparent pattern on the individuals’ control chart should be carefully 


investigated. 


The control chart for individuals is not very sensitive to small shifts in the process mean. 
For example, if the size of the shift in the mean is one standard deviation, the average number 
of points to detect this shift is 43.9. This result is shown later in the chapter. While the per- 
formance of the control chart for individuals is much better for large shifts, in many situations 
the shift of interest is not large and more rapid shift detection is desirable. In these cases, we 
recommend time-weighted charts such as the cumulative sum control chart or an exponen- 
tially weighted moving-average chart (discussed in Section 15-8). 

Some individuals have suggested that limits narrower than 3-sigma be used on the 
chart for individuals to enhance its ability to detect small process shifts. This is a danger- 
ous suggestion, for narrower limits dramatically increase false alarms and the charts may 
be ignored and become useless. If you are interested in detecting small shifts, consider the 


time-weighted charts. 


15-4 CONTROL CHARTS FOR INDIVIDUAL MEASUREMENTS 


EXERCISES FOR SECTION 15-4 
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15-17. Twenty successive hardness measurements are made 
on a metal alloy, and the data are shown in the following table. 


Observation Hardness Observation Hardness 
1 51 11 51 
2 52 12 57 
3 54 13 58 
4 55 14 50 
5 55 15 53 
6 51 16 52 
7 Sy 17 54 
8 50 18 50 
9 51 19 56 

10 56 20 53 


(a) Using all the data, compute trial control limits for individual 
observations and moving-range charts. Construct the chart 
and plot the data. Determine whether the process is in statis- 
tical control. If not, assume assignable causes can be found 
to eliminate these samples and revise the control limits. 

(b) Estimate the process mean and standard deviation for the 
in-control process. 


15-18. In a semiconductor manufacturing process CVD 

metal thickness was measured on 30 wafers obtained over ap- 

proximately two weeks. Data are shown in the following table. 

(a) Using all the data, compute trial control limits for individual 
observations and moving-range charts. Construct the chart 
and plot the data. Determine whether the process is in statis- 
tical control. If not, assume assignable causes can be found 
to eliminate these samples and revise the control limits. 

(b) Estimate the process mean and standard deviation for the 
in-control process. 


Wafer eg Wafer Xe 
1 16.8 16 15.4 
2 14.9 17 14.3 
3 18.3 18 16.1 
4 16.5 19 15.8 
5 17.1 20 15.9 
6 17.4 21 15.2 
7 15.9 a) 16.7 
8 14.4 23 15.2 
9 15.0 24 14.7 

10 15.7 25 17.9 
11 Il 26 14.8 
12 15.9 27 17.0 
13 16.4 28 16.2 
14 15.8 29 15.6 
15 15.4 30 16.3 


15-19. The diameter of holes is measured in consecutive 
order by an automatic sensor. The results of measuring 25 
holes are in the following table. 


Sample Diameter Sample Diameter 

1 9.94 14 VOY) 
2 9.93 15 10.12 
3 10.09 16 9.81 
4 9.98 17 9.73 
5 10.11 18 10.14 
6 9.99 19 9.96 
7 10.11 20 10.06 
8 9.84 21 10.11 
9 9.82 DD, 9.95 

10 10.38 23 9.92 

11 9.99 24 10.09 

12 10.41 25 9.85 

13 10.36 

(a) Using all the data, compute trial control limits for individ- 


ual observations and moving-range charts. Construct the 
control chart and plot the data. Determine whether the 
process is in statistical control. If not, assume assignable 
causes can be found to eliminate these samples and revise 
the control limits. 

(b) Estimate the process mean and standard deviation for the 
in-control process. 

15-20. The viscosity of a chemical intermediate is mea- 

sured every hour. Twenty samples each of size n = 1, are in the 

following table. 


Sample Viscosity Sample Viscosity 
1 495 11 493 
2 491 12 507 
3 501 13 503 
4 501 14 475 
5 512 15 497 
6 540 16 499 
7 492 17 468 
8 504 18 486 
9 542 19 S11 

10 508 20 487 


(a) Using all the data, compute trial control limits for individ- 
ual observations and moving-range charts. Determine 
whether the process is in statistical control. If not, assume 
assignable causes can be found to eliminate these samples 
and revise the control limits. 


| 


(_ Observation 
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(b) Estimate the process mean and standard deviation for the 
in-control process. 

15-21. The following table of data was analyzed in Quality 

Engineering (1991-1992, Vol. 4(1)). The average particle size 

of raw material was obtained from 25 successive samples. 


Size Observation Size 
1 96.1 14 100.5 
94.4 15 103.1 
3 116.2 16 93.1 
4 98.8 17 93.7 
5 95.0 18 72.4 
6 120.3 19 87.4 
7 104.8 20 96.1 
8 88.4 21 97.1 
9 106.8 pD 95.7 
10 96.8 23 94.2 
11 100.9 24 102.4 
12 117.7 25 131.9 
13 115.6 


(a) Using all the data, compute trial control limits for individual 
observations and moving-range charts. Construct the chart 
and plot the data. Determine whether the process is in statis- 
tical control. If not, assume assignable causes can be found 
to eliminate these samples and revise the control limits. 

(b) Estimate the process mean and standard deviation for the 
in-control process. 


15-22. Pulsed laser deposition technique is a thin film dep- 
osition technique with a high-powered laser beam. Twenty-five 
films were deposited through this technique. The thicknesses 
of the films obtained are shown in the following table. 


Film Thickness (in nm) Film Thickness (in nm) 


1 28 8 51 
2 45 9 23 
3 34 10 35 
4 29 11 47 
3) 37 12 50 
6 52 13 32 
7 29 14 40 
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15 46 21 21 
16 59 UD, 62 
17 20 23 34 
18 33 24 31 
19 56 25 98 
20 49 


(a) Using all the data, compute trial control limits for individ- 
ual observations and moving-range charts. Determine 
whether the process is in statistical control. If not, assume 
assignable causes can be found to eliminate these samples, 
and revise the control limits. 

(b) Estimate the process mean and standard deviation for the 
in-control process. 


15-23. The production manager of a soap manufacturing 
company wants to monitor the weights of the bars produced 


=< 


a) 


on the line. Twenty bars are taken during a stable period of 


the process. The weights of the bars are shown in the follow- 
ing table. 


Bar Weight (in g) Bar Weight (in g) 
1 74 11 99 
2 82 12 75 
3 97 13 77 
4 86 14 82 
5 71 15 93 
6 68 16 70 
di 83 17 87 
8 90 18 76 
9 88 19 84 

10 64 20 94 


(a) Using all the data, compute trial control limits for individ- 
ual observations and moving-range charts. Determine 
whether the process is in statistical control. If not, assume 
assignable causes can be found to eliminate these samples, 
and revise the control limits. 

(b) Estimate the process mean and standard deviation for the 
in-control process. 


It is usually necessary to obtain some information about the process capability, that is, the 
performance of the process when it is operating in control. Two graphical tools, the tolerance 
chart (or tier chart) and the histogram, are helpful in assessing process capability. The toler- 
ance chart for all 20 samples from the vane-manufacturing process is shown in Fig. 15-12. The 
specifications on vane opening are 0.5030 + 0.0010 in. In terms of the coded data, the upper 
specification limit is USL = 40 and the lower specification limit is LSL = 20, and these limits 


Figure 15-12 
Tolerance diagram of 
vane openings. 


Process 
Capability Ratio 
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are shown on the chart in Fig. 15-12. Each measurement is plotted on the tolerance chart. 
Measurements from the same subgroup are connected with lines. The tolerance chart is useful 
in revealing patterns over time in the individual measurements, or it may show that a particu- 
lar value of X or r was produced by one or two unusual observations in the sample. For exam- 
ple, note the two unusual observations in sample 9 and the single unusual observation in 
sample 8. Note also that it is appropriate to plot the specification limits on the tolerance chart, 
since it is a chart of individual measurements. It is never appropriate to plot specification 
limits on a control chart or to use the specifications in determining the control limits. 
Specification limits and control limits are unrelated. Finally, note from Fig. 15-12 that the 
process is running off-center from the nominal dimension of 30 (or 0.5030 inch). 

The histogram for the vane-opening measurements is shown in Fig. 15-13. The observations 
from samples 6, 8, 9, 11, and 19 (corresponding to out of-control points on either the X or R 
chart) have been deleted from this histogram. The general impression from examining this his- 
togram is that the process is capable of meeting the specification but that it is running off-center. 

Another way to express process capability is in terms of an index that is defined as follows. 


The process capability ratio (PCR) is 


pcr = USE = ESL or 
60 ( a ) 
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Figure 15-13 
Histogram for vane 
opening. 
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The numerator of PCR is the width of the specifications. The limits 30 on either side of the 
process mean are sometimes called natural tolerance limits, for these represent limits that an 
in-control process should meet with most of the units produced. Consequently, 60 is often 
referred to as the width of the process. For the vane opening, where our sample size is 5, we 
could estimate o as 


Therefore, the PCR is estimated to be 


USL—LSL 40 — 20 
PCR = ——= = = 1.55 
66 6(2.15) 


The PCR has a natural interpretation: (1/PCR)100% is just the percentage of the specifi- 
cations’ width used by the process. Thus, the vane-opening process uses approximately 
(1/1.55)100% = 64.5% of the specifications’ width. 

Figure 15-14(a) shows a process for which the PCR exceeds unity. Since the process 
natural tolerance limits lie inside the specifications, very few defective or nonconforming 
units will be produced. If PCR = 1, as shown in Fig.15-14(b), more nonconforming units 
result. In fact, for a normally distributed process, if PCR = 1, the fraction nonconforming 
is 0.27%, or 2700 parts per million. Finally, when the PCR is less than unity, as in Fig. 
15-14(c), the process is very yield-sensitive and a large number of nonconforming units will 
be produced. 

The definition of the PCR given in Equation 15-20 implicitly assumes that the 
process is centered at the nominal dimension. If the process is running off-center, its 
actual capability will be less than indicated by the PCR. It is convenient to think of PCR 
as a measure of potential capability, that is, capability with a centered process. If the 


Figure 15-14 

Process fallout and the 
process capability ratio 
(PCR). 


PCR, 
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process is not centered, a measure of actual capability is often used. This ratio, called 
PCR,, is defined below. 


USE tp Se 
PCR = min} eagle | 


15-21 
30 30 ( ) 


In effect, PCR, is a one-sided process capability ratio that is calculated relative to the specifi- 
cation limit nearest to the process mean. For the vane-opening process, we find that the 
estimate of the process capability ratio PCR, (after deleting the samples corresponding to out- 
of-control points) is 


USL — x x — LSL 
PCR, = min 36.” ~«3S 
oO oO 
40 — 33.21 33.21 — 20 
= 1.06, = 2.04] = 1.05 
_ 3(2.15) 3(2.15) 


_~ Note that if PCR = PCR,, the process is centered at the nominal dimension. Since 
PCR, = 1.05 for the vane-opening process and PCR = 1.55, the process is obviously run- 
ning off-center, as was first noted in Figs. 15-10 and 15-13. This off-center operation was 
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ultimately traced to an oversized wax tool. Changing the tooling resulted in a substantial 
improvement in the process. 

The fractions of nonconforming output (or fallout) below the lower specification limit and 
above the upper specification limit are often of interest. Suppose that the output from a normally 


distributed process in statistical control is denoted as X. The fractions are determined from 


P(X < LSL) = P(Z < (LSL — p)/c) 


EXAMPLE 15-3 Electrical Current 

For an electronic manufacturing process a current has specifi- 
cations of 100 + 10 milliamperes. The process mean p and 
standard deviation o are 107.0 and 1.5, respectively. The 
process mean is nearer to the USL. Consequently, 


6:5 
and 
110 — 107 
PCR, = ————— = 0.67 
CR: 3-15 m6 


The small PCR, indicates that the process is likely to produce 
currents outside of the specification limits. From the normal 


P(X > USL) = P(Z > (USL — )/o) 


distribution in Appendix Table II, 


P(X < LSL) = P(Z < (90 — 107)/1.5) 
= P(Z < —11.33) =0 

P(X > USL) = P(Z > (110 — 107)/1.5) 
= P(Z > 2) = 0.023 


Practical Interpretation: The probability a current is less 
than the LSL is nearly zero. Consequently, the nonconform- 
ing output exceeds the USL. The PCR, would improve if the 
process mean were centered in the specifications at 100 mil- 
liamperes. 


Figure 15-15 Mean 
of a six-sigma process 
shifts by 1.5 standard 
deviations. 


For this example, the relatively large probability of exceeding the USL is a warning of poten- 
tial problems with this criterion even if none of the measured observations in a preliminary sam- 
ple exceed this limit. We emphasize that the fraction-nonconforming calculation assumes that the 
observations are normally distributed and the process is in control. Departures from normality can 
seriously affect the results. The calculation should be interpreted as an approximate guideline for 
process performance. To make matters worse, 1 and o need to be estimated from the data avail- 
able and a small sample size can result in poor estimates that further degrade the calculation. 

Montgomery (2009) provides guidelines on appropriate values of the PCR and a table re- 
lating fallout for a normally distributed process in statistical control to the value of PCR. 
Many U.S. companies use PCR = 1.33 as a minimum acceptable target and PCR = 1.66 as a 
minimum target for strength, safety, or critical characteristics. Some companies require that in- 
ternal processes and those at suppliers achieve a PCR, = 2.0. Figure 15-15 illustrates a process 
with PCR = PCR, = 2.0. Assuming a normal distribution, the calculated fallout for this process 
is 0.0018 parts per million. A process with PCR, = 2.0 is referred to as a six-sigma process be- 
cause the distance from the process mean to the nearest specification is six standard deviations. 
The reason that such a large process capability is often required is that it is difficult to maintain 
a process mean at the center of the specifications for long periods of time. A common model that 


|x1.56>| 


LSL mn USL 
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is used to justify the importance of a six-sigma process is illustrated by referring to Fig. 15-15. 
If the process mean shifts off-center by 1.5 standard deviations, the PCR, decreases to 


USL-~w_ 60 — 1.50 | 


4.50 


PCR, = 


=15 
30 30 


Assuming a normally distributed process, the fallout of the shifted process is 3.4 parts per 
million. Consequently, the mean of a 6-sigma process can shift 1.5 standard deviations from 
the center of the specifications and still maintain a fallout of 3.4 parts per million. 

In addition, some U.S. companies, particularly the automobile industry, have adopted the 
terminology C, = PCR and C,, = PCR,. Because C,, has another meaning in statistics (in 
multiple regression) we prefer the traditional notation PCR and PCR,. 

We repeat that process capability calculations are meaningful only for stable processes; 
that is, processes that are in control. A process capability ratio indicates whether or not the nat- 
ural or chance variability in a process is acceptable relative to the specifications. 


EXERCISES FOR SECTION 15-5 


15-24. Suppose that a quality characteristic is normally 
distributed with specifications at 100 + 20. The process stan- 
dard deviation is 6. 

(a) Suppose that the process mean is 100. What are the natural 
tolerance limits? What is the fraction defective? Calculate 
PCR and PCR, and interpret these ratios. 

(b) Suppose that the process mean is 106. What are the natu- 
ral tolerance limits? What is the fraction defective? 
Calculate PCR and PCR, and interpret these ratios. 


15-25. Suppose that a quality characteristic is normally 

distributed with specifications from 20 to 32 units. 

(a) What value is needed for o to achieve a PCR of 1.5? 

(b) What value for the process mean minimizes the fraction 
defective? Does this choice for the mean depend on the 
value of o? 


15-26. Suppose that a quality characteristic is normally 

distributed with specifications from 10 to 30 units. The 

process standard deviation is 2 units. 

(a) Calculate the natural tolerance limits, fraction defective, 
PCR, and PCR, when the process mean is 20. 

(b) Suppose the process mean shifts higher by 1.5 standard 
deviations. Recalculate the quantities in part (b). 

(c) Compare the results in parts (a) and (b) and comment on 
any differences. 

15-27. A normally distributed process uses 66.7% of the 

specification band. It is centered at the nominal dimension, lo- 

cated halfway between the upper and lower specification limits. 

(a) Estimate PCR and PCR,. Interpret these ratios. 

(b) What fallout level (fraction defective) is produced? 


15-28. A normally distributed process uses 85% of the 
specification band. It is centered at the nominal dimension, 
located halfway between the upper and lower specification 
limits. 

(a) Estimate PCR and PCR,. Interpret these ratios. 

(b) What fallout level (fraction defective) is produced? 


15-29. Reconsider Exercise 15-1. Suppose that the quality 
characteristic is normally distributed with specification at 
220 + 40. What is the fallout level? Estimate PCR and PCR, 
and interpret these ratios. 


15-30. Reconsider Exercise 15-2, where the specification 

limits are 14.50 + 0.50. 

(a) What conclusions can you draw about the ability of the 
process to operate within these limits? Estimate the per- 
centage of defective items that will be produced. 

(b) Estimate PCR and PCR,. Interpret these ratios. 


15-31. Reconsider Exercise 15-3. Suppose that the variable 
is normally distributed with specifications at 220 + 50. What 
is the proportion out of specifications? Estimate and interpret 
PCR and PCR,. 


15-32. Reconsider Exercise 15-4(a). Assuming that both 
charts exhibit statistical control and that the process specifications 
are at 20 + 5, estimate PCR and PCR, and interpret these ratios. 


15-33. Reconsider Exercise 15-7. Use the revised control 

limits and process estimates. 

(a) Estimate PCR and PCR,. Interpret these ratios. 

(b) What percentage of defectives is being produced by this 
process? 


15-34. Reconsider Exercise 15-8. Given that the specifica- 
tions are at 6.0 + 1.0, estimate PCR and PCR, and interpret 
these ratios. 


15-35. Reconsider Exercise 15-9. Using the process estimates, 
what is the fallout level if the specifications are 16 + 5? Estimate 
PCR and interpret this ratio. 


15-36. Reconsider 15-20. The viscosity specifications are at 
500 + 25. Calculate estimates of the process capability ratios 
PCR and PCR, for this process and provide an interpretation. 


15-37. Suppose that a quality characteristic is normally dis- 
tributed with specifications at 120 + 20. The process standard 
deviation is 6.5. 


@ ©@ @ 


@ 
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(a) Suppose that the process mean is 120. What are the natu- 
ral tolerance limits? What is the fraction defective? 
Calculate PCR and PCR, and interpret these ratios. 

(b) Suppose the process mean shifts off-center by 1.5 stan- 
dard deviations toward the upper specification limit. 
Recalculate the quantities in part (a). 

(c) Compare the results in parts (a) and (b) and comment on 
any differences. 

15-38. Suppose that a quality characteristic is normally dis- 

tributed with specifications at 150 + 20. Natural tolerance 

limits for the process are 150 + 18. 


(c) What fallout level (fraction defective) is produced? 


15-39. An X control chart with three-sigma control limits 
and subgroup size n = 4 has control limits UCL = 28.8 and 
LCL = 24.6. The process specification limits are (24, 32). 

(a) Estimate the process standard deviation. 

(b) Calculate PCR and PCR, for the process. 


15-40. A control chart for individual observations has 
three-sigma control limits UCL = 1.80 and LCL = 1.62. The 
process specification limits are (1.64, 1.84). 

(a) Estimate the process standard deviation. 

(b) Calculate PCR and PCR, for the process. 


(a) Calculate the process standard deviation. 
(b) Calculate PCR and PCR, of the process. Calculate the per- 
centage of the specification width used by the process. 


15-6 ATTRIBUTE CONTROL CHARTS 
15-6.1  P Chart (Control Chart for Proportions) 


Often it is desirable to classify a product as either defective or nondefective on the basis of 
comparison with a standard. This classification is usually done to achieve economy and sim- 
plicity in the inspection operation. For example, the diameter of a ball bearing may be 
checked by determining whether it will pass through a gauge consisting of circular holes cut 
in a template. This kind of measurement would be much simpler than directly measuring the 
diameter with a device such as a micrometer. Control charts for attributes are used in these 
situations. Attribute control charts often require a considerably larger sample size than do 
their variable measurements counterparts. In this section, we discuss the fraction-defective 
control chart, or P chart. Sometimes the P chart is called the control chart for fraction 
nonconforming. 

Suppose D is the number of defective units in a random sample of size n. We assume that 
D is a binomial random variable with unknown parameter p. The fraction defective 


of each sample is plotted on the chart. Furthermore, the variance of the statistic P is 


> _ PU -p) 


Therefore, a P chart for fraction defective could be constructed using p as the center line and 
control limits at 


P(l — p) 


P(1 — p) 


UCL =p +3 LCL = p — 3 (15-22) 


However, the true process fraction defective is almost always unknown and must be estimated 
using the data from preliminary samples. 
Suppose that m preliminary samples each of size n are available, and let D; be the number 


of defectives in the ith sample. Then P; = D,/n is the sample fraction defective in the ith 
sample. The average fraction defective is 
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=> 1 m 1] m 
p= 7a oy ~ inn 2 D; (15-23) 
Now P may be used as an estimator of p in the center line and control limit formulas. 
P Chart 
The center line and upper and lower control limits for the P chart are 
1 -s a 
UCL =p +3 ee) CL=p LCL=p—3 Ae) (15-24) 


where p is the observed value of the average fraction defective. 


These control limits are based on the normal approximation to the binomial distribution. 
When p is small, the normal approximation may not always be adequate. In such cases, we 
may use control limits obtained directly from a table of binomial probabilities. If p is small, 
the lower control limit obtained from the normal approximation may be a negative number. If 
this should occur, it is customary to consider zero as the lower control limit. 


EXAMPLE 15-4 Ceramic Substrate 

Suppose we wish to construct a fraction-defective control 
chart for a ceramic substrate production line. We have 20 pre- 
liminary samples, each of size 100; the number of defectives 
in each sample is shown in Table 15-4. Assume that the sam- 
ples are numbered in the sequence of production. Note 
that p = (800/2000) = 0.40; therefore, the trial parameters 
for the control chart are 


(0.40)(0.60) 
UCL = 0.40 + 3 =0.55 CL=0.40 
100 
(0.40)(0.60) 
LCL = 0.40 — 3 ag Oe 


The control chart is shown in Fig. 15-16. All samples are 
in control. If they were not, we would search for assignable 
causes of variation and revise the limits accordingly. This 
chart can be used for controlling future production. 

Practical Interpretation: Although this process exhibits 
statistical control, its defective rate (p = 0.40) is very poor. 
We should take appropriate steps to investigate the process 
to determine why such a large number of defective units is be- 
ing produced. Defective units should be analyzed to determine 
the specific types of defects present. Once the defect types 
are known, process changes should be investigated to determine 
their impact on defect levels. Designed experiments may be use- 
ful in this regard. 


Table 15-4 Number of Defectives in Samples of 100 


Ceramic Substrates 


Sample No. of Defectives Sample No. of Defectives 
1 44 11 36 
2 48 12 52 
3 32 13 35 
4 50 14 41 
5 29 15 42 
6 31 16 30 
Wf 46 17 46 
8 52 18 38 
9 44 19 26 

10 48 20 30 
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Figure 15-16 P chart 
for a ceramic substrate. 
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Computer software also produces an NP chart. This is just a control chart of nP = D, the 
number of defectives in a sample. The points, center line, and control limits for this chart are 
simply multiples (times n) of the corresponding elements of a P chart. The use of an NP chart 
avoids the fractions in a P chart but it is otherwise equivalent. 


15-6.2. U Chart (Control Chart for Defects per Unit) 


It is sometimes necessary to monitor the number of defects in a unit of product rather than 

the fraction defective. Suppose that in the production of cloth it is necessary to control the 

number of defects per yard or that in assembling an aircraft wing the number of missing 

rivets must be controlled. In these situations we may use the control chart for defects per unit, 

or the U chart. Many defects-per-unit situations can be modeled by the Poisson distribution. 
If each sample consists of n units and there are C total defects in the sample, 


BIO 


is the average number of defects per unit. A U chart may be constructed for such data. 

If the number of defects in a unit is a Poisson random variable with parameter A, the mean 
and variance of this distribution are both \. Each point on the chart is an observed value of U, 
the average number of defects per unit from a sample of 1 units. The mean of U is \ and the 
variance of U is \/n. Therefore, the control limits for the U chart with known } are: 


x 
uct =»+3,/* 


i 
tL) = 34 - (15-25) 


n 


If there are m preliminary samples, and the number of defects per unit in these samples are U,, 


U>,..., U,,, the estimator of the average number of defects per unit is 
U= 2S U, 15-26 
= > (15-26) 


Now JU is used as an estimator of \ in the centerline and control limit formulas. 
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U Chart 


The center line and upper and lower control limits on the U chart are 


UCL =u+3 


where wu is the average number of defects per unit. 


CL=u JCI, = fi = 3} 


(15-27) 


These control limits are based on the normal approximation to the Poisson distribution. 
When X is small, the normal approximation may not always be adequate. In such cases, we 
may use control limits obtained directly from a table of Poisson probabilities. If uw is small, 
the lower control limit obtained from the normal approximation may be a negative number. 
If this should occur, it is customary to use zero as the lower control limit. 


EXAMPLE 15-5 Printed Circuit Boards 

Printed circuit boards are assembled by a combination of 
manual assembly and automation. Surface Mount Technol- 
ogy (SMT) is used to make the mechanical and electrical 
connections of the components to the board. Every hour, 
five boards are selected and inspected for process-control 
purposes. The number of defects in each sample of five 
boards is noted. Results for 20 samples are shown in Table 
15-5. 

The center line for the U chart is 


20 = 20 _ 


Pa 16 


and the upper and lower control limits are 


UCL =u 4 


= 1 = 
64 3,/= 33 
LCL =u vies 3/2 <o 


The control chart is plotted in Fig. 15-17. Because LCL is neg- 
ative, it is set to 0. From the control chart in Fig. 15-17, we see 
that the process is in control. 

Practical Interpretation: Eight defects per group of five 
circuit boards are too many (about 8/5 = 1.6 defects/board), 
and the process needs improvement. An investigation needs to 
be made of the specific types of defects found on the printed 
circuit boards. This will usually suggest potential avenues for 
process improvement. 


=|! 


Computer software also produces a C chart. This is just a control chart of C, the total of 
defects in a sample. The points, center line, and control limits for this chart are simply multi- 
ples (times n) of the corresponding elements of a U chart. The use of a C chart avoids the frac- 
tions that can occur in a U chart but it is otherwise equivalent. 


Table 15-5 Number of Defects in Samples of Five Printed Circuit Boards 


Number of Defects per Number of Defects per 
Sample Defects Unit u; Sample Defects Unit u; 
1 6 1.2 11 9 1.8 
2 4 0.8 12 15 3.0 
3 8 1.6 13 8 1.6 
4 10 2.0 14 10 2.0 
5 g) 1.8 15 8 1.6 
6 12 2.4 16 2 0.4 
7 16 3.2 17 7 1.4 
8 2 0.4 18 1 0.2 
9 3 0.6 19 7 1.4 
10 10 2.0 20 13 2.6 


—* 


i) 
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UCL = 3.3 f 


Defects per unit, wu 
2s 
<< 


Figure 15-17 Uchart 
of defects per unit on 0 
printed circuit boards. 


EXERCISES FOR SECTION 15-6 


12 3 4 5 6 7 8 9 1011 1213 14 15 16 17 18 19 20 
Sample number 


15-41. Anearly example of SPC was described in Jndustrial 
Quality Control [“The Introduction of Quality Control at 
Colonial Radio Corporation” (1944, Vol. 1(1), pp. 4—9)]. The 
following are the fractions defective of shaft and washer assem- 
blies during the month of April in samples of n = 1500 each: 


Fraction Fraction 
Sample Defective Sample Defective 
1 0.11 11 0.03 
2 0.06 12 0.03 
3 0.1 13 0.04 
4 0.11 14 0.07 
5 0.14 15 0.04 
6 0.11 16 0.04 
i 0.14 17 0.04 
8 0.03 18 0.03 
9 0.02 19 0.06 
10 0.03 20 0.06 


(a) Set up a P chart for this process. Is this process in statisti- 
cal control? 

(b) Suppose that instead of n = 1500, n = 100. Use the data 
given to set up a P chart for this process. Revise the con- 
trol limits if necessary. 

(c) Compare your control limits for the P charts in parts (a) and 
(b). Explain why they differ. Also, explain why your assess- 
ment about statistical control differs for the two sizes of n. 

15-42. Suppose the following fraction defective has been 

found in successive samples of size 100 (read down): 

(a) Using all the data, compute trial control limits for a fraction- 
defective control chart, construct the chart, and plot the data. 


0.09 0.03 0.12 
0.10 0.05 0.14 
0.13 0.13 0.06 


0.08 0.10 0.05 
0.14 0.14 0.14 
0.09 0.07 0.11 
0.10 0.06 0.09 
0.15 0.09 0.13 
0.13 0.08 0.12 
0.06 0.11 0.09 


(b) Determine whether the process is in statistical control. If 
not, assume assignable causes can be found and out-of- 
control points eliminated. Revise the control limits. 

15-43. The following are the numbers of defective solder 

joints found during successive samples of 500 solder joints: 


Day No. of Defectives Day No. of Defectives 

1 106 12 37 
2 116 13 25 
3 164 14 88 
4 89 15 101 
5 99 16 64 
6 40 17 51 
vi 112 18 74 
8 36 19 71 
9 69 20 43 

10 74 21 80 

11 42 


(a) Using all the data, compute trial control limits for a 
fraction-defective control chart, construct the chart, and 
plot the data. 

(b) Determine whether the process is in statistical control. If 
not, assume assignable causes can be found and out-of- 
control points eliminated. Revise the control limits. 

15-44. The following represent the number of defects 

per 1000 feet in rubber-covered wire: 1, 1, 3, 7, 8, 10, 5, 13, 0, 


SS 


19, 24, 6, 9, 11, 15, 8, 3, 6, 7, 4, 9, 20, 11, 7, 18, 10, 6, 4, 0, 9, 
7, 3, 1, 8, 12. Do the data come from a controlled process? 
15-45. The following represent the number of solder defects 
observed on 24 samples of five printed circuit boards: 7, 6, 8, 
10, 24, 6, 5,4, 8, 11, 15, 8, 4, 16, 11, 12, 8, 6, 5, 9, 7, 14, 8, 21. 
(a) Using all the data, compute trial control limits for a U con- 
trol chart, construct the chart, and plot the data. 
(b) Can we conclude that the process is in control using a U 
chart? If not, assume assignable causes can be found, list 
points and revise the control limits. 


15-46. Consider the data on the number of earthquakes of 

magnitude 7.0 or greater by year in Exercise 6-71. 

(a) Construct a U chart for this data with a sample size of n = 1. 

(b) Do the data appear to be generated by an in-control 
process? Explain. 


15-47. Ina semiconductor manufacturing company, samples 
of 200 wafers are tested for defectives in the lot. The number 
of defectives in 20 such samples is shown in the following 
table. 


No. of No. of 
Sample Defectives Sample Defectives 
1 44 11 52 
2 63 12 74 
3 40 13 43 
4 35 14 50 
5 29 15 60 
6 56 16 38 
7 40 17 36 
8 38 18 65 
9 74 19 41 
10 66 20 95 
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(a) Set up a P chart for this process. Is the process in statisti- 
cal control? 

(b) Suppose that instead of samples of size 200, we have sam- 
ples of size 100. Use the data to set up a P chart for this 
process. Revise the control limits if necessary. 

(c) Compare the control limits in parts (a) and (b). Explain 
why they differ. 

15-48. The following data are the number of spelling errors 

detected for every 1000 words on a news Web site over 20 

weeks. 


No. of No. of 
Week Spelling Errors Week Spelling Errors 
1 3 11 1 
2 6 12 6 
3 0 13 9 
4 5 14 8 
5 9 15 6 
6 5 16 4 
7 2 17 13 
8 2 18 3 
9 3 19 0 
10 2 20 7 


(a) What control chart is most appropriate for these data? 

(b) Using all the data, compute trial control limits for the 
chart in part (a), construct the chart, and plot the data. 

(c) Determine whether the process is in statistical control. If 
not, assume assignable causes can be found and out-of- 
control points eliminated. Revise the control limits. 


Specifying the control limits is one of the critical decisions that must be made in designing a 
control chart. By moving the control limits further from the center line, we decrease the risk 
of a type I error—that is, the risk of a point falling beyond the control limits, indicating an 
out-of-control condition when no assignable cause is present. However, widening the control 
limits will also increase the risk of a type II error—that is, the risk of a point falling between 
the control limits when the process is really out of control. If we move the control limits closer 
to the center line, the opposite effect is obtained: The risk of type I error is increased, while the 
risk of type II error is decreased. 

The control limits on a Shewhart control chart are customarily located a distance of plus 
or minus three standard deviations of the variable plotted on the chart from the center line. 
That is, the constant k in Equation 15-1 should be set equal to 3. These limits are called 
3-sigma control limits. 
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A way to evaluate decisions regarding sample size and sampling frequency is through the 
average run length (ARL) of the control chart. Essentially, the ARL is the average number of 
points that must be plotted before a point indicates an out-of-control condition. For any Shewhart 
control chart, the ARL can be calculated from the mean of a geometric random variable. Suppose 
that p is the probability that any point exceeds the control limits. Then 


(15-28) 


Thus, for an X chart with 3-sigma limits, p = 0.0027 is the probability that a normally dis- 
tributed point falls outside the limits when the process is in control, so 


1 1 
AR g00eT 


is the average run length of the X chart when the process is in control. That is, even if the process 
remains in control, an out-of-control signal will be generated every 370 points, on the average. 

Consider the piston ring process discussed in Section 15-2.2, and suppose we are sam- 
pling every hour. Thus, we will have a false alarm about every 370 hours on the average. 
Suppose we are using a sample size of n = 5 and that when the process goes out of control the 
mean shifts to 74.0135 millimeters. Then, the probability that X falls between the control lim- 
its of Fig. 15-3 is equal to 


P[73.9865 = X < 74.0135 when pw = 74.0135] 
73.9865 — 74.0135 74.0135 — 74.0135 
= =Zs 


0.0045 0.0045 
= P[-6=Z<0]=05 


Therefore, p in Equation 15-28 is 0.50, and the out-of-control ARL is 


ARL = ae 2 
“PO 05 

That is, the control chart will require two samples to detect the process shift, on the average, 
so two hours will elapse between the shift and its detection (again, on the average). Suppose 
this approach is unacceptable, because production of piston rings with a mean diameter of 
74.0135 millimeters results in excessive scrap costs and delays final engine assembly. How 
can we reduce the time needed to detect the out-of-control condition? One method is to 
sample more frequently. For example, if we sample every half hour, only one hour will 
elapse (on the average) between the shift and its detection. The second possibility is to in- 
crease the sample size. For example, if we use n = 10, the control limits in Fig. 15-3 nar- 
row to 73.9905 and 74.0095. The probability of X falling between the control limits when 
the process mean is 74.0135 millimeters is approximately 0.1, so p = 0.9, and the out-of- 
control ARL is 


1 
ARL => =— =1.11 
0.9 
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Table 15-6 Average Run Length (ARL) for an X Chart 
with 3-Sigma Control Limits 


Magnitude of ARL ARL 
Process Shift n=1 = al 
0 370.4 370.4 

0.50 155.2 43.9 
1.0c 43.9 6.3 
1.50 15.0 2.0 
2.00 6.3 iL 
3.00 2.0 1.0 


Thus, the larger sample size would allow the shift to be detected about twice as quickly as 
the old one. If it became important to detect the shift in approximately the first hour after it oc- 
curred, two control chart designs would work: 


Design 1 


Design 2 


Sample size: n = 5 


Sampling frequency: every half hour 


Sample size: n = 10 
Sampling frequency: every hour 


Table 15-6 provides average run lengths for an X chart with 3-sigma control limits. The aver- 
age run lengths are calculated for shifts in the process mean from 0 to 3.0o and for sample 
sizes of n = 1 andn = 4 by using 1/p, where p is the probability that a point plots outside of 
the control limits. Figure 15-18 illustrates a shift in the process mean of 20. 


EXERCISES FOR SECTION 15-7 


15-49. An X chart uses samples of size 1. The center line is 

at 100 and the upper and lower 3-sigma limits are at 112 and 

88, respectively. 

(a) What is the process 0? 

(b) Suppose the process mean shifts to 96. Find the probability 
that this shift will be detected on the next sample. 

(c) Find the ARL to detect the shift in part (b). 


15-50. An X chart uses samples of size 4. The center line is 

at 100, and the upper and lower 3-sigma control limits are at 

106 and 94, respectively. 

(a) What is the process 0? 

(b) Suppose the process mean shifts to 96. Find the probability 
that this shift will be detected on the next sample. 

(c) Find the ARL to detect the shift in part (b). 


15-51. Consider the X control chart in Fig. 15-3. Suppose 
that the mean shifts to 74.010 millimeters. 


Figure 15-18 Process 


mean shift of 2c. H——> [L+ 20 


(a) What is the probability that this shift will be detected on 
the next sample? 

(b) What is the ARL after the shift? 

15-52. Consider an X control chart with = 0.344, UCL = 

14.708, LCL = 14.312, and n = 5. Suppose that the mean 

shifts to 14.6. 

(a) What is the probability that this shift will be detected on 
the next sample? 

(b) What is the ARL after the shift? 

15-53. Consider an X control chart with ¥ = 34.286, UCL = 

242.780, LCL = 203.220, and n = 5. Suppose that the mean 

shifts to 210. 

(a) What is the probability that this shift will be detected on 
the next sample? 

(b) What is the ARL after the shift? 
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15-54. Consider an X control chart with ¢ = 1.40, UCL = 

21.71, LCL = 18.29, and n = 6. Suppose that the mean shifts 

to 17. 

(a) What is the probability that this shift will be detected on 
the next sample? 

(b) What is the ARL after the shift? 

15-55. Consider an X control chart with 6 = 2.466, UCL = 

37.404, LCL = 30.780, and n = 5. Suppose that the mean 

shifts to 36. 

(a) What is the probability that this shift will be detected on 
the next sample? 

(b) What is the ARL after the shift? 

15-56. Consider an X control chart with F = 2.25, UCL = 

17.40, LCL = 12.79, and n = 3. Suppose that the mean shifts 

to 13. 

(a) What is the probability that this shift will be detected on 
the next sample? 

(b) What is the ARL after the shift? 

15-57. Consider an X control chart with 7 = 0.000924, 

UCL = 0.0635, LCL = 0.0624, and n = 5. Suppose that the 

mean shifts to 0.0625. 
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15-8.1. Cumulative Sum Control Chart 


(a) What is the probability that this shift will be detected on 
the next sample? 

(b) What is the ARL after the shift? 

15-58. Consider the revised X control chart in Exercise 15-8 

with ¢ = 0.669, UCL = 7.443, LCL = 5.125, and n = 3. 

Suppose that the mean shifts to 5.5. 

(a) What is the probability that this shift will be detected on 
the next sample? 

(b) What is the ARL after the shift? 


15-59. An X chart uses a subgroup of size three. The center 

line is at 200, and the upper and lower three-sigma control lim- 

its are at 212 and 188, respectively. 

(a) Estimate the process o. 

(b) Suppose the process mean shifts to 195. Determine the prob- 
ability that this shift will be detected on the next sample. 

(c) Find the ARL to detect the shift in part (b). 

15-60. Consider an X control chart with UCL = 24.802, 

LCL = 23.792, and n = 3. Suppose the mean shifts to 24.2. 

(a) What is the probability that this shift will be detected on 
the next sample? 

(b) What is the ARL to detect the shift? 


In Sections 15-3 and 15-4 we have presented basic types of Shewhart control charts. A major 
disadvantage of any Shewhart control chart is that the chart is relatively insensitive to 
small shifts in the process, say, on the order of about 1.50 or less. One reason for this relatively 
poor performance in detecting small process shifts is that the Shewhart chart makes use of 
only the information in the last plotted point, and it ignores the information in the sequence of 
points. This problem can be addressed, to some extent by adding criteria such as the Western 
Electric rules to a Shewhart chart, but the use of these rules reduces the simplicity and ease 
of interpretation of the chart. These rules would also cause the in-control average run length 
of a Shewhart chart to drop below 370. This increase in the false alarm rate can have serious 
practical consequences. 

A very effective alternative to the Shewhart control chart is the cumulative sum control 
chart (or CUSUM). This chart has much better performance (in terms of ARL) for detecting small 
shifts than the Shewhart chart, but it does not cause the in-control ARL to drop significantly. This 
section will illustrate the use of the CUSUM for sample averages and individual measurements. 

The CUSUM chart plots the cumulative sums of the deviations of the sample values from 
a target value. For example, suppose that samples of size n = 1 are collected, and xX; is the 
average of the jth sample. Then if 1p is the target for the process mean, the cumulative sum 
control chart is formed by plotting the quantity 


i 


Si = > &- Bo) 


jal 


(15-29) 


against the sample number i. Now, S; is called the cumulative sum up to and including the ith 
sample. Because they combine information from several samples, cumulative sum charts are 


15-8 TIME-WEIGHTED CHARTS 677 


more effective than Shewhart charts for detecting small process shifts. Furthermore, they are 
particularly effective with samples of n = 1. This makes the cumulative sum control chart a 
good candidate for use in the chemical and process industries where rational subgroups are 
frequently of size 1, as well as in discrete parts manufacturing with automatic measurement of 
each part and online control using a computer directly at the work center. 

If the process remains in control at the target value |1, the cumulative sum defined in Equation 
15-29 should fluctuate around zero. However, if the mean shifts upward to some value pL, > Wo, 
say, an upward or positive drift develops in the cumulative sum S,. Conversely, if the mean shifts 
downward to some (1 < fo, a downward or negative drift in S; develops. Therefore, if a trend de- 
velops in the plotted points either upward or downward, we should consider this as evidence that 
the process mean has shifted, and a search for the assignable cause should be performed. 

This theory can easily be demonstrated by applying the CUSUM to the chemical process 
concentration data in Table 15-3. Since the concentration readings are individual measure- 
ments, we would take X; = X;in computing the CUSUM. Suppose that the target value for the 
concentration is ft) = 99. Then the CUSUM is 


= (X,— 99) + >) (X; — 99) 
j=l 
= (8) a Sey 


Table 15-7 shows the computation of this CUSUM, where the starting value of the 


CUSUM, So, is taken to be zero. Figure 15-19 plots the CUSUM from the last column of Table 
15-7. Notice that the CUSUM fluctuates) around the value of 0. 


Table 15-7 CUSUM Computations for the Chemical Process Concentration Data in Table 15-3 


1 102.0 3.0 3.0 
2 94.8 —4.2 —1.2 
3 98.3 —0.7 FIL) 
4 98.4 —0.6 =2:5 
3) 102.0 3.0 0.5 
6 98.5, —0.5 0.0 
7 99.0 0.0 0.0 
8 97.7 =1,3 =13 
g 100.0 1.0 —0.3 
10 98.1 —0.9 =12 
11 101.3 3) 1.1 
12 98.7 —0.3 0.8 
13 101.1 2.1 2S) 
14 98.4 —0.6 23 
15 97.0 0 0.3 
16 96.7 2.3 —2.0 
17 100.3 1.3 OW 
18 101.4 2.4 1.7 
19 97.2 —1.8 —0.1 


N 
i=) 
— 
j=) 
= 
j=) 


2.0 1.9 
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Figure 15-19 Plot of 


the cumulative sum for z 
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Table 15-7. Observation, i 


The graph in Fig. 15-19 is not a control chart because it lacks control limits. There are 
two general approaches to devising control limits for CUSUMs. The older of these two 
methods is the V-mask procedure. A typical V mask is shown in Fig. 15-20(a). It is a 
V-shaped notch in a plane that can be placed at different locations on the CUSUM chart. The 
decision procedure consists of placing the V mask on the cumulative sum control chart with 
the point O on the last value of s; and the line OP parallel to the horizontal axis. If all the pre- 
vious cumulative sums, 5), 53, ..., 5;-1, lie within the two arms of the V mask, the process is 
in control. The arms are the lines that make angles 8 with segment OP in Figure 15-20(a) and 
they are assumed to extend infinitely in length. However, if any s; lies outside the arms of the 
mask, the process is considered to be out of control. In actual use, the V mask would be 


+6 


+4 
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-4 
1 2 3 4 i 1 5 10 15 20 25 30 
Observation, i 
(a) (b) 


Figure 15-20 The cumulative sum control chart. (a) The V-mask and scaling. (b) The cumulative 
sum control chart in operation. 
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applied to each new point on the CUSUM chart as soon as it was plotted. In the example 
shown in Fig. 15-20(b), an upward shift in the mean is indicated, since at least one of the 
points that have occurred earlier than sample 22 now lies below the lower arm of the mask, 
when the V mask is centered on sample 30. If the point lies above the upper arm, a downward 
shift in the mean is indicated. Thus, the V mask forms a visual frame of reference similar to 
the control limits on an ordinary Shewhart control chart. For the technical details of design- 
ing the V mask, see Montgomery (2009). 

While some computer programs plot CUSUMs with the V-mask control scheme, 
we feel that the other approach to CUSUM control, the tabular CUSUM, is superior. 
The tabular procedure is particularly attractive when the CUSUM is implemented on a 
computer. 

Let S;,(i) be an upper one-sided CUSUM for period i and S,(i) be a lower one-sided 
CUSUM for period 7. These quantities are calculated from 


CUSUM 
Control Chart sui) = max[0, ¥ — (wo + K) + sufi - 1) (15-30) 
and 
where the starting values s,,(0) = s,(0) = 0. 


In Equations 15-30 and 15-31 K is called the reference value, which is usually chosen 
about halfway between the target 1 and the value of the mean corresponding to the 
out-of-control state, 4; = fo + A. That is, K is about one-half the magnitude of the shift we 
are interested in, or 


= 
2 


Notice that $;,(i) and S;(i) accumulate deviations from the target value that are greater than K, 
with both quantities reset to zero upon becoming negative. If either S;,(i) or S;(@) exceeds a 
constant H, the process is out of control. This constant H is usually called the decision interval. 


EXAMPLE 15-6 Chemical Process Concentration CUSUM 
A Tabular CUSUM si(i) = max[0, (uo — K) — x; + sri — 1)] 
We will illustrate the tabular CUSUM by applying it to the = : 

chemical process concentration data in Table 15-7. The = max[0, (99 — 1) — x + si — 1)] 


process target is ty = 99, and we will use K = 1 as the ref- = max[0, 98 — x; + s,(i — 1)] 
erence value and H = 10 as the decision interval. The rea- 
sons for these choices will be explained later. Therefore, for observation 1 the CUSUMs are 

Table 15-8 shows the tabular CUSUM scheme for the 
chemical process concentration data. To illustrate the calcula- Sy(1) = max[0, x; — 100 + s,(0)] 
tions, note that = max[0, 102.0 — 100 + 0] = 2.0 

sp(i) = max[0, x; — (Mo + K) + sx(i — 1)] and 
= max([0, x; — (99 + 1) + sy(i — 1)] s_(1) = max[0, 98 — x, + s,(0)] 


= max[0, x; — 100 + sy(i — 1)] = max[0, 98 — 102.0 + 0] = 0 
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Table 15-8 The Tabular CUSUM for the Chemical Process Concentration Data 


Observation Upper CUSUM Lower CUSUM 
i x; x; — 100 Sy) Ny 98 — x; 5, (i) Ny, 
1 102.0 2.0 2.0 1 —4.0 0.0 0 
2 94.8 =5.2 0.0 0 3.2 3.2 1 
3 98.3 =jlo7/ 0.0 0 —0.3 2.9 2 
4 98.4 —1.6 0.0 0 —0.4 2.5 3 
5 102.0 2.0 2.0 1 —4.0 0.0 0 
6 98.5 —1.5 0.5 2 —0.5 0.0 0 
7 99.0 —1.0 0.0 0 —1.0 0.0 0 
8 97.7 —2.3 0.0 0 0.3 0.3 1 
9 100.0 0.0 0.0 0 2.0) 0.0 0 
10 98.1 —1.9 0.0 0 —0.1 0.0 0 
11 101.3 13} 1.3 1 Bho) 0.0 0 
12 98.7 —1.3 0.0 0 —0.7 0.0 0 
13 101.1 1.1 1.1 1 —3.1 0.0 0 
14 98.4 —1.6 0.0 0 —0.4 0.0 0 
15 97.0 —=3)40) 0.0 0 1.0 1.0 1 
16 96.7 —3.3 0.0 0 1.3 2.3 2 
17 100.3 0.3 0.3 1 8) 0.0 0 
18 101.4 1.4 1.7 2 —3.4 0.0 0 
19 97.2 —2.8 0.0 0 0.8 0.8 1 
20 101.0 1.0 1.0 0 —3.0 0.0 0 


as shown in Table 15-8. The quantities n, and n, in Table 15-8 Next Steps: The limits for the CUSUM charts may be used 
indicate the number of periods that the CUSUM s,,(i) or s;(i) to continue to operate the chart in order to monitor future 
have been nonzero. Notice that the CUSUMs in this example productions. 

never exceed the decision interval H = 10. We would there- 

fore conclude that the process is in control. 


When the tabular CUSUM indicates that the process is out of control, we should search 
for the assignable cause, take any corrective actions indicated, and restart the CUSUMs at 
zero. It may be helpful to have an estimate of the new process mean following the shift. This 
can be computed from 


Sp(Z) 


Joo + K + Tn? if sy(i) > H 
(15-32) 


=> 
II 
iz) 
nt 
on 
wa 


Bo — K - ny? if s,(i) > H 


It is also useful to present a graphical display of the tabular CUSUMs, which are 
sometimes called CUSUM status charts. They are constructed by plotting s,,(i) and s,(i) ver- 
sus the sample number. Figure 15-21 shows the CUSUM status chart for the data in Example 
15-6. Each vertical bar represents the value of s,,(i) and s,(i) in period i. With the decision in- 
terval plotted on the chart, the CUSUM status chart resembles a Shewhart control chart. We 
have also plotted the sample statistics x; for each period on the CUSUM status chart as the 


Figure 15-21 The 
CUSUM status chart 
for Example 15-6. 
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solid dots. This frequently helps the user of the control chart to visualize the actual process 
performance that has led to a particular value of the CUSUM. 

The tabular CUSUM is designed by choosing values for the reference value K and the 
decision interval H. We recommend that these parameters be selected to provide good average 
run-length values. There have been many analytical studies of CUSUM ARL performance. 
Based on these studies, we may give some general recommendations for selecting H and K. 
Define H = hoy and K = kox, where oy is the standard deviation of the sample variable used 
in forming the CUSUM (if n = 1, og = oy). Using h = 4 or h = 5 andk = 1/2 will gener- 
ally provide a CUSUM that has good ARL properties against a shift of about lox (or lox) in 
the process mean. If much larger or smaller shifts are of interest, set k = 5/2, where 6 is the size 
of the shift in standard deviation units. 

To illustrate how well the recommendations of h = 4 or h = 5 with k = 1/2 work, con- 
sider these average run lengths in Table 15-9. Notice that a shift of lox would be detected in 


Table 15-9 Average Run Lengths for a CUSUM Control Chart 


with k = 1/2 

Shift in Mean 

(multiple of ox) h=4 h=5 
0 168 465 
0.25 74.2 139 
0.50 26.6 38.0 
0.75 13.3 17.0 
1.00 8.38 10.4 
1.50 4.75 5.75 
2.00 3.34 4.01 
2.50 2.62 3.11 
3.00 2.19 2.57 


4.00 1.71 2.01 
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either 8.38 samples (with k = 1/2 and h = 4) or 10.4 samples (with k = 1/2 and h = 5). By 
comparison, Table 15-6 shows that an X chart would require approximately 43.9 samples, on 
the average, to detect this shift. 

These design rules were used for the CUSUM in Example 15-6. We assumed that the 
process standard deviation o = 2. (This is a reasonable value; see Example 15-2.) Then with 
k = 1/2 and h = 5, we would use 


K=ko ='A(2)=1 and H=ho =5(2)= 10 


in the tabular CUSUM procedure. 

Finally, we should note that supplemental procedures such as the Western Electric rules 
cannot be safely applied to the CUSUM, because successive values of S;,(i) and S;(i) are not 
independent. In fact, the CUSUM can be thought of as a weighted average, where the weights 
are stochastic or random. In effect, all the CUSUM values are highly correlated, thereby 
causing the Western Electric rules to give too many false alarms. 


15-8.2 Exponentially Weighted Moving Average Control Chart 


Data collected in time order is often averaged over several time periods. For example, eco- 
nomic data is often presented as an average over the last four quarters. That is, at time f the 
average of the last four measurements can be written as 


= 1 1 1 
x(4) = 4° + Ae + 4*t-2 + 4*t-3 


This average places weight of 1/4 on each of the most recent observations, and zero weight 
on older observations. It is called a moving average and in this case a window of size 4 is used. 
An average of the recent data is used to smooth the noise in the data to generate a better estimate 
of the process mean than only the most recent observation. However, in a dynamic environment 
where the process mean may change, the number of observations used to construct the average 
is kept to a modest size so that the estimate can adjust to any change in the process mean. 
Therefore, the window size is a compromise between a better statistical estimate from an average 
and a response to a mean change. If a window of size 10 were used in a moving average, the sta- 
tistic x,(10) would have lower variability, but it would not adjust as well to a change in mean. 

For statistical process control, rather than use a fixed window size it is useful to place the 
greatest weight on the most recent observation or subgroup average, and then gradually 
decrease the weight on older observations. One average of this type can be constructed by a 
multiplicative decrease in the weights. Let \ = | denote a constant and jy denote the process 
target or historical mean. Suppose that samples of size n = 1 are collected and x, is the average 
of the sample at time t. The exponentially weighted moving average (EWMA) is 


Zp = AX, + ACL — A), + ACL — A)?H,_2 ++ FAC — A) IX, + (1 — A} 
t 
= SAL — Ale + (1 — Ayo 
k=0 


Each older observation has its weight decreased by the factor (1 — A). The weight on the 
starting value jo is selected so that the weights sum to one. Here z, is also sometimes called a 
geometric average. 


Figure 15-22 
EWMAs with \ = 0.8 
and \ = 0.2 show a 
compromise between a 
smooth curve and a 
response to a shift. 


EWMA Update 
Equation 
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The value of \ determines the compromise between noise reduction and response to a 
mean change. For example, the series of weights when A = 0.8 are 


0.8, 0.16, 0.032, 0.0064, 0.00128, ... 
and when A = 0.2 the weights are 
0.2, 0.16, 0.128, 0.1024, 0.0819, ... 


When X = 0.8 the weights decrease rapidly. Most of the weight is placed on the most recent 
observation, with modest contributions to the EWMA from older measurements. In this case, 
the EWMA does not average noise much, but it responds quickly to a mean change. However, 
when A = 0.2 the weights decrease much more slowly and the EWMA has substantial contri- 
butions from the more recent observations. In this case, the EWMA averages noise more, but 
it responds more slowly to a change in the mean. Fig. 15-22 displays a series of observations 
with a mean shift in the middle on the series. Notice that the EWMA with \ = 0.2 smooths 
the data more, but that the EWMA with \ = 0.8 adjusts the estimate to the mean shift more 
quickly. 

It appears that it is difficult to calculate an EWMA because at every time ¢ a new weighted 
average of all previous data is required. However, there is an easy method to calculate an 
EWMA z, based on a simple recursive equation. Let z) = po. Then it can be shown that 


z= de, + (1 — Nyy (15-33) 


Consequently, only a brief computation is needed at each time t. 

To develop a control chart from an EWMA, control limits are needed for Z,. The control 
limits are defined in a straightforward manner. They are placed at three standard deviations 
around the mean of the plotted statistic Z,. This follows the general approach for a control 
chart in Equation 15-1. An EWMA control chart may be applied to individual measurements 
as an extension to an X chart or to subgroup averages. Formulas here are developed for the 
more general case with an average from a subgroup of size n. For individual measurements 
n= 1. 
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Because Z, is a linear function of the independent observations X,, X>,...,X;, (and j9), 
the results from Chapter 5 can be used to show that 


o 
E(Z,) = wo and V4) = SF 


re  - 0-04 


where n is the subgroup size. Therefore an EWMA control chart uses estimates of jg and o in 
the following formulas: 


EWMA 
Control Chart o r ot 
LCL = wo ~ 3Sf5* [1 = (1 =)" 
CL = po (15-34) 


Note that the control limits are not of equal width about the centerline. The control limits are 
calculated from the variance of Z, and that changes with time. However, for large ¢ the variance 
of Z, converges to 


im wz) = =(20) 
a : nm\2—X 


so that the control limits tend to be parallel lines about the centerline as ¢ increases. 
The parameters 19 and o are estimated by the same statistics used in X or X charts. That 
is, for subgroups 


and forn = 1 


EXAMPLE 15-7 Chemical Process Concentration EWMA 


Consider the concentration data shown in Table 15-3. Construct 


an EWMA control chart with X = 0.2 with n = 1. It was deter- d ! z 2 ss 2 
mined that x = 99.1 and mr = 2.59. Therefore, jig = 99.1 and x; 102.0 94.8 98.3 98.4 102.0 
G = 2.59/1.128 = 2.30. The control limits for z, are Zs 99.68 98.70 98.62 98.58 99.26 
LCL 97.72 97.33 97.12 97.00 96.93 
0.2 UCL 100.48 100.87 101.08 101.20 101.27 
LCL = 99.1 — 3(2.30) [1 — (1 — 0.2)"| = 98.19 
2 -— 0.2 
0.2 > 
LCL = 99.1 + 3(2.30) 2 Ti (1 — 0.2) = 100.01 The chart generated by Minitab is shown in Figure 15-23. 
Notice that the control limits widen as time increases but 
The first few values of z, along with the corresponding control quickly stabilize. Each point is within its set of corresponding 


limits are control limits so there are no signals from the chart. 


Figure 15-23 
EWMaA control chart 
for the chemical 
process concentration 
data from Minitab. 


15-8 TIME-WEIGHTED CHARTS 685 


EWMA Chart of X 


UCL = 101.390 
101 r— 


san ll ASSL? X= 99.095 


97 L., 
LCL = 96.800 


2 4 6 8 10 12 14 16 18 20 
Sample 


EWMA 
/ 
/ 
\ 
{ 
> 


Similar to a CUSUM chart, the points plotted on an EWMA control chart are not independent. 
Therefore, run rules should not be applied to an EWMA control chart. Information in the his- 
tory of the data that is considered by run rules is to a large extent incorporated into the EWMA 
that is calculated at each time ¢. 

The value of A is usually chosen from the range 0.1 < ’ < 0.5. A common choice is 
dX = 0.2. Smaller values for \ provide more sensitivity for small shifts and larger values bet- 
ter tune the chart for larger shifts. This performance can be seen in the average run lengths in 
Table 15-10. These calculations are more difficult than those used for Shewhart charts, and de- 
tails are omitted. Here, \ = 0.1 and 0.5 are compared. The multiplier of the standard devia- 
tion, denoted L in the table, is adjusted so that the average run length (ARL) equals 500 for 
both choices for \. That is, the control limits are placed at E(Z,) + LV V(Z,) and L is chosen 
so the ARL without a mean shift is 500 in both cases. 

The EWMA ARLs in the table indicate that the smaller value for \ is preferred when the 
magnitude of the shift is small. Also, the EWMA performance is in general much better than 
results for a Shewhart control chart (in Table 15-6) and the results are comparable to a 
CUSUM control chart (in Table 15-9). However, these are average results. At the time of an 
increase in the process mean, z, might be negative and there would be some performance 
penalty to first increase z, to near zero and then further increase it to a signal above the UCL. 
Such a penalty provides an advantage to CUSUM control charts that is not accounted for in 
these ARL tables. A more refined analysis can be used to quantify this penalty, but the 
conclusion is that the EWMA penalty is moderate to small in most applications. 


Table 15-10 Average Run Lengths for an EWMA 


Control Chart 
Shift in Mean X= 0.5 X= 0.1 
(multiple of ox) L= 3.07 L=2.81 
0 500 500 
0.25 255 106 
0.5 88.8 31.3 
0.75 35.9 15.9 
1 WS 10.3 
1.5 6.53 6.09 
2 3.63 4.36 


3 1.93 2.87 
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EXERCISES FOR SECTION 15-8 


15-61. The following data were considered in Quality 
Engineering [‘“Parabolic Control Limits for The Exponentially 
Weighted Moving Average Control Charts in Quality 
Engineering” (1992, Vol. 4(4))]. In a chemical plant, the data for 
one of the quality characteristics (viscosity) were obtained for 
each 12-hour batch at the completion of the batch. The results of 
15 consecutive measurements are shown in the table below. 


Batch Viscosity Batch Viscosity 
1 13.3 11 14.3 
2 14.5 12 16.1 
3 15.3 13 13.1 
4 15.3 14 15.5 
5 14.3 15 12.6 
6 14.8 
q 15.2 
8 14.9 
9 14.6 

10 14.1 


(a) Set up a CUSUM control chart for this process. Assume 
the desired process target 14.1. Does the process appear to 
be in control? 

(b) Suppose that the next five observations are 14.6, 15.3, 
15.7, 16.1, and 16.8. Apply the CUSUM in part (a) to 
these new observations. Is there any evidence that the 
process has shifted out of control? 

15-62. The purity of a chemical product is measured every two 

hours. The results of 20 consecutive measurements are as follows: 


Sample Purity Sample Purity 
1 89.11 11 88.55 
2 90.59 12 90.43 
3 91.03 13 91.04 
4 89.46 14 88.17 
5 89.78 15 91.23 
6 90.05 16 90.92 
7 90.63 17 88.86 
8 90.75 18 90.87 
9 89.65 19 90.73 

10 90.15 20 89.78 


(a) Set up a CUSUM control chart for this process. Use 
o = 0.8 in setting up the procedure, and assume that the 
desired process target is 90. Does the process appear to 
be in control? 

(b) Suppose that the next five observations are 90.75, 90.00, 
91.15, 90.95, and 90.86. Apply the CUSUM in part (a) to 
these new observations. Is there any evidence that the 
process has shifted out of control? 


Sample Diameter Sample Diameter 
1 9.94 14 9.99 
2 9.93 15 10.12 
3 10.09 16 9.81 
4 9.98 17 9.73 
5 10.11 18 10.14 
6 9.99 19 9.96 
W 10.11 20 10.06 
8 9.84 21 10.11 
9 9.82 2D, 9.95 

10 10.38 23 9.92 
11 9.99 24 10.09 
12 10.41 25 9.85 
13 10.36 


(a) Estimate the process standard deviation. 

(b) Set up a CUSUM control procedure, assuming that the 
target diameter is 10.0 millimeters. Does the process 
appear to be operating in a state of statistical control at the 
desired target level? 


15-64. The concentration of a chemical product is mea- 
sured by taking four samples from each batch of material. The 
average concentration of these measurements is shown for the 
last 20 batches in the following table: 


Batch Concentration Batch Concentration 
1 104.5 11 95.4 
2 99.9 12 94.5 
3 106.7 13 104.5 
4 105.2 14 99.7 
5 94.8 15 97.7 
6 94.6 16 97.0 
7 104.4 17 95.8 
8 99.4 18 97.4 
9 100.3 19 99.0 

10 100.3 20 102.6 


(a) Suppose that the process standard deviation is o = 8 and 
that the target value of concentration for this process is 
100. Design a CUSUM scheme for the process. Does the 
process appear to be in control at the target? 

(b) How many batches would you expect to be produced with 
off-target concentration before it would be detected by the 
CUSUM control chart if the concentration shifted to 104? 
Use Table 15-9. 


15-65. 


Consider a CUSUM with h = 5 andk = 1/2. Samples 
are taken every two hours from the process. The target value for 
the process is fg = 50 and o = 2. Use Table 15-9. 


‘jm 15-63. The diameter of holes is measured in consecutive order 
‘= by an automatic sensor. The results of measuring 25 holes follow. 


SS 


(a) If the sample size is n = 1, how many samples would be 
required to detect a shift in the process mean to p = 51 on 
average? 

(b) If the sample size is increased to n = 4, how does this 
affect the average run length to detect the shift to yw = 51 
that you determined in part (a)? 

15-66. Consider the purity data in Exercise 15-62. Use 

o = 0.8 and assume that the desired process target is 90. 

(a) Construct an EWMA control chart with A = 0.2. Does the 
process appear to be in control? 

(b) Construct an EWMA control chart with A = 0.5. Compare 
your results to part (a). 

(c) Suppose that the next five observations are 90.75, 90.00, 
91.15, 90.95, and 90.86. Apply the EWMAs in part (a) and 
(b) to these new observations. Is there any evidence that 
the process has shifted out of control? 

15-67. Consider the diameter data in Exercise 15-63. 

Assume that the desired process target is 10.0 millimeters. 

(a) Estimate the process standard deviation. 

(b) Construct an EWMA control chart with \ = 0.2. Does 
the process appear to be in control? 

(c) Construct an EWMA control chart with 
Compare your results to part (a). 

15-68. Consider the concentration data in Exercise 15-64. 

Use o = 8 and assume that the desired process target is 100. 

(a) Construct an EWMA control chart with A = 0.2. Does the 
process appear to be in control? 

(b) Construct an EWMA control chart with A = 0.5. 
Compare your results to part (a). 

(c) If the concentration shifted to 104, would you prefer the 
chart in part (a) or (b)? Explain. 

15-69. Consider an EMWA control chart. The target value 

for the process is tp = 50 and o = 2. Use Table 15-10. 

(a) If the sample size is n = 1, would you prefer an 
EWMA chart with X = 0.1 and L = 2.81 or A = 0.5 and 
L = 3.07 to detect a shift in the process mean to p = 52 
on average? Why? 

(b) If the sample size is increased to n = 4, which chart in 
part (a) do you prefer? Why? 

(c) If an EWMA chart with A = 0.1 and L = 2.81 is used, 
what sample size is needed to detect a shift to ~» = 52 in 
approximately 3 samples on average? 


X= 0.5. 


15-70. A process has a target of 4) = 100 and a standard 
deviation of o = 4. Samples of size n = 1 are taken every two 
hours. Use Table 15-9. 

(a) Suppose the process mean shifts to » = 102. How many 
hours of production will occur before the process shift is 
detected by a CUSUM with h = 5 and k = 1/2? 

(b) It is important to detect the shift defined in part (a) more 
quickly. A proposal is made to reduce the sampling 
frequency to 0.5 hour. How will this affect the CUSUM 
control procedure? How much more quickly will the shift 
be detected? 

(c) Suppose that the 0.5 hour sampling interval in part (b) is 
adopted. How often will false alarms occur with this new 
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sampling interval? How often did they occur with the old 
interval of two hours? 

(d) A proposal is made to increase the sample size ton = 4 and 
retain the two-hour sampling interval. How does this sug- 
gestion compare in terms of average detection time to the 
suggestion of decreasing the sampling interval to 0.5 hour? 


15-71. Heart rate (in counts/minute) is measured every 30 
minutes. The results of 20 consecutive measurements are as 
follows: 


Sample Heart Rate Sample Heart Rate 
1 68 11 719 
2 71 12 719 
3 67 13 78 
4 69 14 78 
5 71 15 78 
6 70 16 79 
7 69 17 719 
8 67 18 82 
9 70 19 82 

10 70 20 81 


Suppose that the standard deviation of the heart rate is o = 3 

and the target value is 70. 

(a) Design a CUSUM scheme for the heart rate process. Does 
the process appear to be in control at the target? 

(b) How many samples on average would be required to 
detect a shift of the mean heart rate to 80? 


15-72. The number of influenza patients (in thousands) vis- 
iting hospitals weekly are shown in the following table. 
Suppose that the standard deviation is o = 2 and the target 
value is 160. 


Number Number 
Sample of Patients Sample of Patients 

1 162.27 13 159.989 
2 157.47 14 159.09 

3 157.065 15 162.699 
4 160.45 16 163.89 

> 157.993 17 164.247 
6 162.27 18 162.70 

q 160.652 19 164.859 
8 159.09 20 163.65 
9 157.442 21 165.99 
10 160.78 22 163.22 

11 159.138 23 164.338 
12 161.08 24 164.83 


(a) Design a CUSUM scheme for the process. Does the 
process appear to be in control at the target? 

(b) How many samples on average would be required to de- 
tect a shift of the mean to 165? 


688 


CHAPTER 15 STATISTICAL QUALITY CONTROL 


15-9 OTHER SPC PROBLEM-SOLVING TOOLS 


Figure 15-24 Pareto 
diagram for printed 
circuit board defects. 


While the control chart is a very powerful tool for investigating the causes of variation in 
a process, it is most effective when used with other SPC problem-solving tools. In this sec- 
tion we illustrate some of these tools, using the printed circuit board defect data in 
Example 15-5. 

Figure 15-17 shows a U chart for the number of defects in samples of five printed circuit 
boards. The chart exhibits statistical control, but the number of defects must be reduced. The 
average number of defects per board is 8/5 = 1.6, and this level of defects would require 
extensive rework. 

The first step in solving this problem is to construct a Pareto diagram of the individual 
defect types. The Pareto diagram, shown in Fig. 15-24, indicates that insufficient solder and 
solder balls are the most frequently occurring defects, accounting for (109/160) 100 = 68% 
of the observed defects. Furthermore, the first five defect categories on the Pareto chart are all 
solder-related defects. This points to the flow solder process as a potential opportunity for 
improvement. 

To improve the surface mount process, a team consisting of the operator, the shop super- 
visor, the manufacturing engineer responsible for the process, and a quality engineer meets to 
study potential causes of solder defects. They conduct a brainstorming session and produce 
the cause-and-effect diagram shown in Fig. 15-25. The cause-and-effect diagram is widely 
used to display the various potential causes of defects in products and their interrelationships. 
They are useful in summarizing knowledge about the process. 

As a result of the brainstorming session, the team tentatively identifies the following vari- 
ables as potentially influential in creating solder defects: 


1. Flux specific gravity 
2. Reflow temperature 


3. Squeegee speed 
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Figure 15-26 Defect 
concentration diagram 
for a printed circuit 
board. 
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Figure 15-25 Cause-and-effect diagram for the printed circuit board flow solder process. 


4. Squeegee angle 
5. Paste height 
6. Reflow temperature 


7. Board loading method 


A statistically designed experiment could be used to investigate the effect of these seven vari- 
ables on solder defects. 

In addition, the team constructed a defect concentration diagram for the product. A defect 
concentration diagram is just a sketch or drawing of the product, with the most frequently oc- 
curring defects shown on the part. This diagram is used to determine whether defects occur in 
the same location on the part. The defect concentration diagram for the printed circuit board is 
shown in Fig. 15-26. This diagram indicates that most of the insufficient solder defects are near 
the front edge of the board. Further investigation showed that one of the pallets used to carry the 
boards was bent, causing the front edge of the board to make poor contact with the squeegee. 

When the defective pallet was replaced, a designed experiment was used to investigate 
the seven variables discussed earlier. The results of this experiment indicated that several of 
these factors were influential and could be adjusted to reduce solder defects. After the results 
of the experiment were implemented, the percentage of solder joints requiring rework was 
reduced from 1% to under 100 parts per million (0.01%). 


Front 


Region of insufficient solder 


Back 
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15-10 IMPLEMENTING SPC 


The methods of statistical process control can provide significant payback to those companies 
that can successfully implement them. While SPC seems to be a collection of statistically 
based problem-solving tools, there is more to the successful use of SPC than simply learning 
and using these tools. Management involvement and commitment to the quality-improvement 
process is the most vital component of SPC’s potential success. Management is a role model, 
and others in the organization will look to management for guidance and as an example. A 
team approach is also important, for it is usually difficult for one person alone to introduce 
process improvements. Many of the “magnificent seven” problem-solving tools are helpful in 
building an improvement team, including cause-and-effect diagrams, Pareto charts, and defect 
concentration diagrams. The basic SPC problem-solving tools must become widely known 
and widely used throughout the organization. Continuous training in SPC and quality im- 
provement is necessary to achieve this widespread knowledge of the tools. 

The objective of an SPC-based quality-improvement program is continuous improve- 
ment on a weekly, quarterly, and annual basis. SPC is not a one-time program to be applied 
when the business is in trouble and later abandoned. Quality improvement must become part 
of the culture of the organization. 

The control chart is an important tool for process improvement. Processes do not natu- 
rally operate in an in-control state, and the use of control charts is an important step that must 
be taken early in an SPC program to eliminate assignable causes, reduce process variability, 
and stabilize process performance. To improve quality and productivity, we must begin to 
manage with facts and data, and not just rely on judgment. Control charts are an important part 
of this change in management approach. 

In implementing a company-wide SPC program, we have found that the following ele- 
ments are usually present in all successful efforts: 


. Management leadership 


. A team approach 


1 

2 

3. Education of employees at all levels 
4. Emphasis on continuous improvement 
5 


. A mechanism for recognizing success 


We cannot overemphasize the importance of management leadership and the team approach. 
Successful quality improvement is a “top-down” management-driven activity. It is also im- 
portant to measure progress and success and to spread knowledge of this success throughout 
the organization. When successful improvements are communicated throughout the company, 
this can provide motivation and incentive to improve other processes and to make continuous 
improvement a normal part of the way of doing business. 

The philosophy of W. Edwards Deming provides an important framework for implement- 
ing quality and productivity improvement. Deming’s philosophy is summarized in his 14 
points for management. The adherence to these management principles has been an important 
factor in Japan’s industrial success and continues to be the catalyst in that nation’s quality- and 
productivity-improvement efforts. This philosophy has also now spread rapidly in the West. 
Deming’s 14 points are as follows. 


1. Create a constancy of purpose focused on the improvement of products and 
services. Constantly try to improve product design and performance. Investment 
in research, development, and innovation will have a long-term payback to the 
organization. 


2. 


10. 


11. 


12. 
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Adopt a new philosophy of rejecting poor workmanship, defective products, or 
bad service. It costs as much to produce a defective unit as it does to produce a 
good one (and sometimes more). The cost of dealing with scrap, rework, and other 
losses created by defectives is an enormous drain on company resources. 


Do not rely on mass inspection to “control’’ quality. All inspection can do is sort 
out defectives, and at this point it is too late because we have already paid to pro- 
duce these defectives. Inspection occurs too late in the process, it is expensive, and 
it is often ineffective. Quality results from the prevention of defectives through 
process improvement, not inspection. 


Do not award business to suppliers on the basis of price alone, but also consider 
quality. Price is a meaningful measure of a supplier’s product only if it is consid- 
ered in relation to a measure of quality. In other words, the total cost of the item 
must be considered, not just the purchase price. When quality is considered, the 
lowest bidder is frequently not the low-cost supplier. Preference should be given to 
suppliers who use modern methods of quality improvement in their business and 
who can demonstrate process control and capability. 


Focus on continuous improvement. Constantly try to improve the production and 
service system. Involve the workforce in these activities and make use of statisti- 
cal methods, particularly the SPC problem-solving tools discussed in the previous 
section. 


Practice modern training methods and invest in training for all employees. 
Everyone should be trained in the technical aspects of their job, as well as in mod- 
ern quality- and productivity-improvement methods. The training should encourage 
all employees to practice these methods every day. 


Practice modern supervision methods. Supervision should not consist merely of 
passive surveillance of workers, but should be focused on helping the employees 
improve the system in which they work. The first goal of supervision should be to 
improve the work system and the product. 


Drive out fear. Many workers are afraid to ask questions, report problems, or point 
out conditions that are barriers to quality and effective production. In many organi- 
zations the economic loss associated with fear is large; only management can elim- 
inate fear. 


Break down the barriers between functional areas of the business. Teamwork 
among different organizational units is essential for effective quality and productiv- 
ity improvement to take place. 


Eliminate targets, slogans, and numerical goals for the workforce. A target such 
as “zero defects” is useless without a plan as to how to achieve this objective. In 
fact, these slogans and “programs” are usually counterproductive. Work to improve 
the system and provide information on that. 


Eliminate numerical quotas and work standards. These standards have histori- 
cally been set without regard to quality. Work standards are often symptoms of 
management’s inability to understand the work process and to provide an effective 
management system focused on improving this process. 


Remove the barriers that discourage employees from doing their jobs. 
Management must listen to employee suggestions, comments, and complaints. The 


= 


| 
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13. 


14. 


person who is doing the job is the one who knows the most about it, and usually has 
valuable ideas about how to make the process work more effectively. The workforce 
is an important participant in the business, and not just an opponent in collective 
bargaining. 


Institute an ongoing program of training and education for all employees. 
Education in simple, powerful statistical techniques should be mandatory for all 
employees. Use of the basic SPC problem-solving tools, particularly the control 
chart, should become widespread in the business. As these charts become wide- 
spread, and as employees understand their uses, they will be more likely to look for 
the causes of poor quality and to identify process improvements. Education is a way 
of making everyone partners in the quality-improvement process. 


Create a structure in top management that will vigorously advocate the first 13 
points. 


As we read Deming’s 14 points, we notice two things. First, there is a strong emphasis on 
change. Second, the role of management in guiding this change process is of dominating impor- 
tance. But what should be changed, and how should this change process be started? For exam- 
ple, if we want to improve the yield of a semiconductor manufacturing process, what should we 
do? It is in this area that statistical methods most frequently come into play. To improve the semi- 
conductor process, we must determine which controllable factors in the process influence the 
number of defective units produced. To answer this question, we must collect data on the process 
and see how the system reacts to changes in the process variables. Statistical methods, including 
the SPC and experimental design techniques in this book, can contribute to this knowledge. 


SUPPLEMENTAL EXERCISES 


15-73. The diameter of fuse pins used in an aircraft engine 18 64.006 64.010 64.018 
application is an important quality characteristic. Twenty-five 19 63.984 64.002 64.003 
samples of three pins each are shown as follows: 20 64.000 64.010 64.013 
21 63.988 64.001 64.009 
22 64.004 63.999 63.990 
Sample 
sie Din a a 
: eeoeg cane anh 2S) 63.982 63.984 63.995 
2 63.995 63.992 64.001 
3 63.988 64.024 64.021 
4 64.002 63.996 63.993 
5 63.992 64.007 64.015 = ; : 
6 64.009 63.994 63.997 (a) Set up X and R charts for this process. If necessary, revise 
7 63.995 64.006 63.994 limits so that no observations are out-of-control. 
8 63.985 64.003 63.993 (b) Estimate the process mean and standard deviation. 
(c) Suppose the process specifications are at 64 + 0.02. 
z 64.008 63.995 64.009 Calculate an estimate of PCR. Does the process meet a 
10 63.998 74.000 63.990 minimum capability level of PCR = 1.33? 
iW 63.994 63.998 63.994 (d) Calculate an estimate of PCR,. Use this ratio to draw con- 
12 64.004 64.000 64.007 clusions about process capability. 
13 63.983 64.002 63.998 (e) To make this process a six-sigma process, the variance o” 
14 64.006 63.967 63.994 ieee pee to be ieee eis a PCR, = 2.0. What 
should this new variance value be? 
= Cue solu! Ges (f) Suppose the mean shifts to 64.01. What is the probability 
16 64.000 63.984 64.005 that this shift will be detected on the next sample? What is 
17 63.994 64.012 63.986 the ARL after the shift? 


15-74. Rework Exercise 15-73 with X and S charts. 


15-75. Plastic bottles for liquid laundry detergent are 
formed by blow molding. Twenty samples of n = 100 bottles 
are inspected in time order of production, and the fraction de- 
fective in each sample is reported. The data are as follows: 


Sample Fraction Defective 
1 0.12 
2 0.15 
3 0.18 
4 0.10 
5 0.12 
6 0.11 
7 0.05 
8 0.09 
9 0.13 

10 0.13 
11 0.10 
12 0.07 
13) 0.12 
14 0.08 
15 0.09 
16 0.15 
17 0.10 
18 0.06 
19 0.12 
20 0.13 


(a) Set up a P chart for this process. Is the process in statisti- 
cal control? 

(b) Suppose that instead of n = 100, n = 200. Use the data 
given to set up a P chart for this process. Revise the con- 
trol limits if necessary. 

(c) Compare your control limits for the P charts in parts (a) and 
(b). Explain why they differ. Also, explain why your assess- 
ment about statistical control differs for the two sizes of n. 


15-76. Cover cases for a personal computer are manufac- 
tured by injection molding. Samples of five cases are taken 
from the process periodically, and the number of defects is 
noted. Twenty-five samples follow: 


Sample No. of Defects Sample No. of Defects 
1 3 7 2; 
2 Z 8 4 
3 0 9 1 
4 1 10 0 
5 4 11 2) 
6 3 12 3 
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13 2 20 0 
14 8 21 2 
15 0 DD) 1 
16 2 23 9 
17 4 24 3 
18 3 25 2 
19 EB) 


(a) Using all the data, find trial control limits for a U chart for 
the process. 

(b) Use the trial control limits from part (a) to identify out-of- 
control points. If necessary, revise your control limits. 

(c) Suppose that instead of samples of five cases, the sample 
size was 10. Repeat parts (a) and (b). Explain how this 
change alters your answers to parts (a) and (b). 

15-77. An article in Quality Engineering [“Is the Process 

Capable? Tables and Graphs in Assessing C,,,” (1992, Vol. 

4(4))]. Considered manufacturing data. Specifications for the 

outer diameter of the hubs were 60.3265 + 0.001 mm. A ran- 

dom sample with size n = 20 was taken and the data are 
shown in the following table. 


Sample x Sample ae e 
1 60.3262 11 60.3262 = 
2 60.3262 12 60.3262 
3 60.3262 13 60.3269 
4 60.3266 14 60.3261 
5 60.3263 15 60.3265 
6 60.3260 16 60.3266 
Wl 60.3262 17 60.3265 
8 60.3267 18 60.3268 
9 60.3263 19 60.3262 
10 60.3269 20 60.3266 


(a) Construct a control chart for individual measurements. 
Revise the control limits if necessary. 

(b) Compare your chart in part (a) to one that uses only the 
last (least significant) digit of each diameter as the mea- 
surement. Explain your conclusion. 

(c) Estimate » and o from the moving range of the revised 
chart and use this value to estimate PCR and PCR, and 
interpret these ratios. 

15-78. The following data from the U.S. Department of 

Energy Web site (http://www.eia.doe.gov) reported the total 

U.S. renewable energy consumption by year (trillion BTU) 

from 1973 to 2004. 
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Total Renewable Total Renewable 
Energy Energy 
Consumption Consumption 

Year (Trillion BTU) Year (Trillion Btu) 
1973 4433.121 1989 6294.209 
1974 4769.395 1990 6132.572 
1975 4723.494 1991 6158.087 
1976 4767.792 1992 5907.147 
1977 4249.002 1993 6155.959 
1978 5038.938 1994 6064.779 
1979 5166.379 1995 6669.261 
1980 5494.42 1996 7136.799 
1981 5470.574 1997 7075.152 
1982 5985.352 1998 6560.632 
1983 6487.898 1999 6598.63 
1984 6430.646 2000 6158.232 
1985 6032.728 2001 5328.335 
1986 6131.542 2002 5835.339 
1987 5686.932 2003 6081.722 
1988 5488.649 2004 6116.287 


(a) Using all the data, find trial control limits for a control 
chart for individual measurements, construct the chart, 
and plot the data. 

(b) Do the data appear to be generated from an in-control 
process? Comment on any patterns on the chart. 

15-79. The following dataset was considered in Quality 

Engineering (“Analytic Examination of Variance Components” 

(1994-1995, Vol. 7(2))]. A quality characteristic for cement 

mortar briquettes was monitored. Samples of size n = 6 were 

taken from the process, and 25 samples from the process are 
shown in the following table. 

(a) Using all the data, find trial control limits for X and S 
charts. Is the process in control? 


Batch x Ss 
1 572.00 73.25 
2 583.83 79.30 
3} 720.50 86.44 
4 368.67 98.62 
5 374.00 92.36 
6 580.33 93.50 
vii 388.33 110.23 
8 559.33 74.79 
9 562.00 76.53 
10 729.00 49.80 
11 469.00 40.52 
12 566.67 113.82 


13 578.33 58.03 
14 485.67 103.33 
15 746.33 107.88 
16 436.33 98.69 
17 556.83 99.25 
18 390.33 117.35 
19 562.33 75.69 
20 675.00 90.10 
21 416.50 89.27 
22 568.33 61.36 
23 762.67 105.94 
24 786.17 65.05 
25) 530.67 99.42 


(b) Suppose that the specifications are at 580 + 250. What 
statements can you make about process capability? 
Compute estimates of the appropriate process capability 
ratios. 

(c) To make this process a “6-sigma process,” the variance o 
would have to be decreased such that PCR, = 2.0. What 
should this new variance value be? 

(d) Suppose the mean shifts to 600. What is the probability 
that this shift will be detected on the next sample? What is 
the ARL after the shift? 

15-80. Suppose that an X control chart with 2-sigma limits 

is used to control a process. Find the probability that a false 

out-of-control signal will be produced on the next sample. 

Compare this with the corresponding probability for the chart 

with 3-sigma limits and discuss. Comment on when you 

would prefer to use 2-sigma limits instead of 3-sigma limits. 


15-81. Consider the diameter data in Exercise 15-73. 

(a) Construct an EWMA control chart with \ = 0.2 and 
L = 3. Comment on process control. 

(b) Construct an EWMA control chart with \ = 0.5 and 
L = 3 and compare your conclusion to part (a). 

15-82. Consider the renewable energy data in Exercise 15-78. 

(a) Construct an EWMA control chart withA = 0.2 and L = 3. 
Do the data appear to be generated from an incontrol 
process? 

(b) Construct an EWMA control chart with \ = 0.5 and 
L = 3 and compare your conclusion to part (a). 

15-83. Consider the hub data in Exercise 15-77. 

(a) Construct an EWMA control chart with \ = 0.2 and 
L = 3. Comment on process control. 

(b) Construct an EWMA control chart with \ = 0.5 and 
L = 3 and compare your conclusion to part (a). 

15-84. Consider the data in Exercise 15-18. Set up a CUSUM 

scheme for this process assuming that . = 16 is the process 

target. Explain how you determined your estimate of o and the 

CUSUM parameters K and H. 


2 


@ 


@ 


@ 


15-85. Consider the hardness measurement data in Exercise 
15-17. Set up a CUSUM scheme for this process using p» = 50 
and o = 2,so that K = 1 and H = 10. Is the process in control? 


15-86. Reconsider the viscosity data in Exercise 15-20. 
Construct a CUSUM control chart for this process using fy = 
500 as the process target. Explain how you determined your 
estimate of o and the CUSUM parameters H and K. 


15-87. The following data were considered in Quality 
Progress [“Digidot Plots for Process Surveillance” (1990, 
May, pp. 66—68)]. Measurements of center thickness (in mils) 
from 25 contact lenses sampled from the production process at 
regular intervals are shown in the following table. 


Sample i Sample a 
1 0.3978 14 0.3999 
0.4019 15 0.4062 
3 0.4031 16 0.4048 
4 0.4044 17 0.4071 
5 0.3984 18 0.4015 
6 0.3972 19 0.3991 
a 0.3981 20 0.4021 
8 0.3947 21 0.4009 
9 0.4012 22) 0.3988 
10 0.4043 23 0.3994 
11 0.4051 24 0.4016 
12 0.4016 25 0.4010 
13 0.3994 


(a) Construct a CUSUM scheme for this process with the 

target 9 = 0.4. Explain how you determined your esti- 

mate of o and the CUSUM parameters H and K. Is the 
process in control? 

Construct an EWMA control chart with \ = 0.5 and 

L = 3 and compare your conclusions to part (a). 

15-88. Suppose that a process is in control and an X chart is 

used with a sample size of 4 to monitor the process. Suddenly 

there is a mean shift of 1.50. 

(a) If 3-sigma control limits are in use on the X chart, what is 
the probability that this shift will remain undetected for 
three consecutive samples? 

(b) If 2-sigma control limits are in use on the X chart, what is 
the probability that this shift will remain undetected for 
three consecutive samples? 

(c) Compare your answers to parts (a) and (b) and explain 
why they differ. Also, which limits you would recommend 
using and why? 


(b) 


15-89. Consider the control chart for individuals with 

3-sigma limits. 

(a) Suppose that a shift in the process mean of magnitude 
o occurs. Verify that the ARL for detecting the shift is 
ARL = 43.9. 
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(b) Find the ARL for detecting a shift of magnitude 20 in the 
process mean. 

(c) Find the ARL for detecting a shift of magnitude 3o in the 
process mean. 

(d) Compare your answers to parts (a), (b), and (c) and ex- 
plain why the ARL for detection is decreasing as the mag- 
nitude of the shift increases. 


15-90. Consider a control chart for individuals, applied to a 
continuous 24-hour chemical process with observations taken 
every hour. 

(a) If the chart has 3-sigma limits, verify that the in-control 
ARL is ARL = 370. How many false alarms would occur 
each 30-day month, on the average, with this chart? 

(b) Suppose that the chart has 2-sigma limits. Does this re- 
duce the ARL for detecting a shift in the mean of magni- 
tude o? (Recall that the ARL for detecting this shift with 
3-sigma limits is 43.9.) 

(c) Find the in-control ARL if 2-sigma limits are used on the 
chart. How many false alarms would occur each month 
with this chart? Is this in-control ARL performance satis- 
factory? Explain your answer. 


15-91. The depth of a keyway is an important part quality char- 
acteristic. Samples of size n = 5 are taken every four hours from 
the process and 20 samples are summarized in the following table. 
(a) Using all the data, find trial control limits for X¥ and R 
charts. Is the process in control? 

Use the trial control limits from part (a) to identify out-of- 
control points. If necessary, revise your control limits. 
Then, estimate the process standard deviation. 

Suppose that the specifications are at 140 + 2. Using the 
results from part (b), what statements can you make about 
process capability? Compute estimates of the appropriate 
process capability ratios. 

To make this process a “6-sigma process,” the variance o* 
would have to be decreased such that PCR, = 2.0. What 
should this new variance value be? 


(b) 


(c 


VY 


(d) 


Sample X r 
1 139.7 el 
2 139.8 1.4 
3} 140.0 1.3 
4 140.1 1.6 
5 139.8 0.9 
6 139.9 1.0 
7 139.7 1.4 
8 140.2 1.2 
9 139.3 1.1 

10 140.7 1.0 
11 138.4 0.8 
12 138.5 0.9 

continued 


s 
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Sample K r 
13 137.9 12 
14 138.5 1.1 
15 140.8 1.0 
16 140.5 1.3 
17 139.4 1.4 
18 139.9 1.0 
19 137.5 1.5 
20 139.2 1.3 


(e) Suppose the mean shifts to 139.7. What is the probability 
that this shift will be detected on the next sample? What is 
the ARL after the shift? 


15-92. Consider a control chart for individuals with 3-sigma 
limits. What is the probability that there will not be a signal in 
three samples? In six samples? In 10 samples? 


15-93. Suppose a process has a PCR = 2, but the mean is 
exactly three standard deviations above the upper specifica- 
tion limit. What is the probability of making a product outside 
the specification limits? 


15-94. A process is controlled by a P chart using samples of 

size 100. The center line on the chart is 0.05. 

(a) What is the probability that the control chart detects a shift 
to 0.08 on the first sample following the shift? 

(b) What is the probability that the control chart does not de- 
tect a shift to 0.08 on the first sample following the shift, 
but does detect it on the second sample? 

(c) Suppose that instead of a shift in the mean to 0.08, the 
mean shifts to 0.10. Repeat parts (a) and (b). 

(d) Compare your answers for a shift to 0.08 and for a shift to 
0.10. Explain why they differ. Also, explain why a shift to 
0.10 is easier to detect. 


15-95. Suppose the average number of defects in a unit is 
known to be 8. If the mean number of defects in a unit shifts to 
16, what is the probability that it will be detected by a U chart 
on the first sample following the shift 

(a) if the sample size isn = 4? 

(b) if the sample size isn = 10? 


Use a normal approximation for U. 


15-96. Suppose the average number of defects in a unit is 
known to be 10. If the mean number of defects in a unit shifts 
to 14, what is the probability that it will be detected by a U 
chart on the first sample following the shift 

(a) if the sample size is n = 1? 

(b) if the sample size is n = 4? 

Use a normal approximation for U. 


15-97. An EWMA chart with \ = 0.5 and L = 3.07 is to 

be used to monitor a process. Suppose that the process mean 

is to = 10 ando = 2. 

(a) Assume that n = 1. What is the ARL without any shift 
in the process mean? What is the ARL to detect a shift to 
p = 12. 

(b) Assume that n = 4. Repeat part (a) and comment on your 
conclusions. 


15-98. The following table provides the costs for gasoline 
by month in the U.S. over recent years and the percentage of 
the cost due to refining, distribution and marketing, taxes, 
and crude oil. The table is from the U.S. Department of 
Energy Web site (http://tonto.eia.doe. gov/oog/info/gdu/ 
gaspump.html). There is some concern that the refining or 
distribution and marketing percentages of the retail price have 
shown patterns over time. 

(a) Construct separate control charts for the refining percent- 
age of the retail price and the distribution and marketing 
percentage of the retail price. Use control charts for indi- 
vidual measurements. Comment on any signs of assigna- 
ble causes on these charts. 

(b) Construct a control chart for the crude oil percentage of the 
retail price. Use a control chart for individual measurements. 
Comment on any signs of assignable causes on this chart. 

(c) Another way to study the data is to calculate refining, dis- 
tribution and marketing, and tax as costs directly. The 
costs of these categories might not depend strongly on the 
crude oil cost. Use the percentages provided in the table to 
calculate the cost each month associated with refining and 
distribution and marketing. Construct separate control 
charts for the refining and the distribution and marketing 
costs each month. Use control charts for individual mea- 
surements. Comment on any signs of assignable causes on 
these charts and comment on any differences between 
these charts and the ones constructed in part (a). 


What We Pay For in a Gallon of Regular Gasoline 


Retail Price Refining Distribution & Marketing Taxes Crude Oil 
Mo/Year (Dollars per gallon) (percentage) (percentage) (percentage) (percentage) 
Jan-00 1.289 7.8 13.0 32.1 47.1 
Feb-00 1.377 17.9 75 30.1 44.6 
Mar-00 1.517 15.4 12.8 Dales) 44.6 
Apr-00 1.465 10.1 20.2 28.3 41.4 
May-00 1.485 20.2 9.2 27.9 42.7 


15-10 IMPLEMENTING SPC 


697 


What We Pay For in a Gallon of Regular Gasoline 


Mo/Year 


Jun-00 
Jul-00 
Aug-00 
Sep-00 
Oct-00 
Nov-00 
Dec-00 
Jan-01 
Feb-01 
Mar-01 
Apr-01 
May-01 
Jun-01 
Jul-01 
Aug-01 
Sep-01 
Oct-01 
Nov-01 
Dec-01 
Jan-02 
Feb-02 
Mar-02 
Apr-02 
May-02 
Jun-02 
Jul-02 
Aug-02 
Sep-02 
Oct-02 
Nov-02 
Dec-02 
Jan-03 
Feb-03 
Mar-03 
Apr-03 
May-03 
Jun-03 
Jul-03 
Aug-03 
Sept-03 
Oct-03 
Nov-03 
Dec-03 
Jan-04 
Feb-04 
Mar-04 


Retail Price 
(Dollars per gallon) 


1.633 
1.551 
1.465 
1.550 
1.532 
1.517 
1.443 
1.447 
1.450 
1.409 
1.552 
1.702 
1.616 
1.421 
1.421 
1.522 
1.315 
1.171 
1.086 
1.107 
1.114 
1.249 
1.397 
1.392 
1.382 
1.397 
1.396 
1.400 
1.445 
1.419 
1.386 
1.458 
1.613 
1.693 
1.589 
1.497 
1.493 
1.513 
1.620 
1.679 
1.564 
1.512 
1.479 
1.572 
1.648 
1.736 


Refining 
(percentage) 


22.2 
13.2 
15.8 
15.4 
13.7 
10.4 

8.0 
17.8 
17.3 
18.8 
31.6 
26.4 
13.2 
10.0 
20.0 
18.0 
10.0 
10.0 
11.7 
13.0 
12.1 
19.4 
15.5 
11.9 
15.0 
15.0 
11.4 
10.8 
13.9 
11.1 
11.7 
11.5 
15.0 
14.8 
13.2 
15.3 
15.1 
15.3 
22.5 
13.9 
14.9 
11.7 
11.5 
15.9 
19.1 
19.0 


Distribution & Marketing 
(percentage) 


8.8 
15.8 

qs 

9.0 
10.1 
11.8 
17.9 
10.4 
11.0 

9.7 

4.6 
14.0 
24.1 
20.0 

9.0 
17.0 
20.8 
18.0 
12.7 
11.8 
11.2 

6.1 
13.0 
14.2 
13.0 
12.6 
13.4 
12.6 
11.7 
18.0 
12.3 
10.3 

9.5 
14.8 
19.8 
16.3 
12.3 
11.9 

8.2 
DOI 
16.1 
15.3 
12.6 

9.9 

9.2 
11.3 


Taxes Crude Oil 
(percentage) (percentage) 
25.8 43.1 
ile?) 43.8 
28.8 47.8 
DY 48.3 
27.5 48.6 
27.8 50.0 
29.2 44.8 
29.2 42.7 
29.1 42.6 
30.0 41.5 
27.1 36.7 
24.7 35.0 
26.0 36.7 
30.0 40.0 
30.0 41.0 
28.0 37.0 
31.9 37.2 
36.0 36.0 
38.7 36.9 
379) B72 
37.7 39.1 
33.6 40.9 
30.1 41.4 
30.2 43.7 
30.4 41.6 
30.1 42.3 
30.0 45.0 
30.0 46.7 
29.1 45.3 
29.6 41.3 
30.3 45.7 
28.8 49.4 
26.0 49.5 
24.8 45.5 
26.4 40.5 
28.1 40.4 
28.1 44.5 
27.8 44.9 
25.9 43.3 
25.0 38.3 
26.9 42.2 
27.8 45.2 
28.4 47.5 
26.7 47.5 
25.5 46.2 
24.2 45.5 


continued 
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What We Pay For in a Gallon of Regular Gasoline 


Retail Price Refining Distribution & Marketing Taxes Crude Oil 
Mo/Year (Dollars per gallon) (percentage) (percentage) (percentage) (percentage) 
Apr-04 1.798 22.0 9.9 23.4 44.6 
May-04 1.983 30.6 7.8 212 40.4 
Jun-04 1.969 21.3 16.7 21.3 40.7 
Jul-04 1.911 20.9 11.3 21.9 45.8 
Aug-04 1.878 13.9 12.2 22.4 51.5 
Sep-04 1.870 14.8 9.1 De 53.6 
Oct-04 2.000 13.0 9.3 21.0 56.7 
Nov-04 1.979 10.7 14.6 212 53.6 
Dec-04 1.841 8.9 18.1 23.9 49.1 
Jyan-05 1.831 Goll Ted 24.0 50.9 
Feb-05 1.910 16.1 9.3 23.0 51.6 
Mar-05 2.079 19.3 6.2 21.2 53.4 
Apr-05 2.243 20.9 9.6 19.6 49.8 
May-05 2.161 17.9 12.8 20.4 49.0 
Jun-05 2.156 18.5 6.9 20.4 54.2 
Jul-05 2.290 17.9 8.0 19.2 54.9 
Aug-05 2.486 24.3 2.1 17.7 55.9 
Sep-05 2.903 D3} WS 15.2 50.0 
Oct-05 2.717 15.1 17.8 16.2 50.9 
Nov-05 DST 8.3 13.1 19.5 57.1 
Dec-05 2.185 13.5 7.9 20.1 58.4 
Jan-06 2.316 13.4 6.6 19.8 60.1 
Feb-06 2.280 9.8 11.4 20.1 58.6 
Mar-06 2.425 Dio 4.5 18.9 54.8 
Apr-06 2.742 25.8 3.1 16.7 54.2 
May-06 2.907 2129 8.8 15.8 53.4 
Jun-06 2.885 22.0 7.9 15.9 54.1 
Jul-06 2.981 26.3 6.3 15.4 52.0 
Aug-06 2.952 15.2 13.5 15.9 55.4 
Sep-06 2555 6.3 18.8 18.3 56.7 
Oct-06 2.245 10.9 10.6 20.8 57.7 
Nov-06 2229 14.6 VS 20.4 ) 
Dec-06 2.313 12.9 9.4 19.7 58.0 
Jan-07 2.240 10.6 15.2 20.3 539 
Feb-07 2.278 18.0 5.8 20.0 56.3 
Mar-07 2.563 23.6 8.5 ISS) 2) 
Apr-07 2.845 28.1 7.6 14.0 50.3 
May-07 3.146 27.9 13.3 27 46.1 
Jun-07 3.056 22.7 13.7 13.0 50.5 
Jul-07 2.965 18.4 11.4 13.4 56.8 
Aug-07 2.786 13.5 11.8 14.3 60.4 
Sep-07 2.803 12.8 8.6 14.2 64.3 
Oct-07 2.803 10.1 8.1 14.2 67.6 
Nov-07 3.080 10.0 8.7 13.0 68.3 
Dec-07 3.018 8.1 10.5 13.2 68.1 


Jan-08 3.043 78 11.1 13.1 67.9 
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What We Pay For in a Gallon of Regular Gasoline 


Retail Price Refining Distribution & Marketing Taxes Crude Oil 
Mo/Year (Dollars per gallon) (percentage) (percentage) (percentage) (percentage) 
Feb-08 3.028 9.9 72 13.2 69.7 
Mar-08 3.244 8.0 eo 12.3 71.8 
Apr-08 3.458 10.0 5.8 11.5 72.7 
May-08 3.766 10.0 47 10.6 74.7 
Jun-08 4.054 8.5 6.8 9.8 74.8 
Jul-08 4.062 3) 11.2 9.8 75.8 
Aug-08 3.779 6.1 10.2 10.6 73.1 
Sep-08 3.703 14.2 8.2 10.8 66.8 
Oct-08 3.051 3.3 25.0 13.1 58.6 
Nov-08 2.147 So 24.7 18.6 60.4 
Dec-08 1.687 0.7 19.5 23.6 56.2 
Jan-09 1.788 13.4 10.7 ps) 53.6 
Feb-09 1.923 16.8 14.9 20.7 47.6 
Mar-09 1.959 12.0 12.3 20.4 Sb) 
Apr-09 2.049 12.0 12.1 19.5 56.4 
May-09 2.266 17.6 =39) 19.5 66.8 
Jun-09 2.631 13.7 10.3 15.1 60.9 
Jul-09 Davey 10.1 14.0 15.8 60.1 
Aug-09 2.616 11.0 9.7 15.4 63.9 
Sep-09 2.554 6.7 13h5) 15.7 64.0 
fm) 15-99. The following table shows the number of e-mails a (c) Use the trial control limits from part (b) to identify out-of- 
= student received at each hour from 8:00 A.M. to 6:00 PM. control points. If necessary, revise your control limits, as- 
The samples are collected for five days from Monday to suming that any samples that plot outside the control lim- 
Friday. its can be eliminated. 


15-100. The following are the number of defects observed i} 
on 15 samples of transmission units in an automotive manu- 


Hour M T Ww Th F . . ae : 
facturing company. Each lot contains five transmission units. 
1 2 2 2, 3 1 
Z 2 4 0 1 2 
Bl? Ee eee 
4 4 4 3 3 > ample efects ample efects 
5 1 1 2 2 1 : E ul S 
6 1 3 2 2 l 2 10 12 10 
7 3 2 1 1 0 3) 24 13 11 
8 2 3 2 3 1 ad : es 7 
9 1 3 3 2 0 2 2 5 s 
10 2 3 2 3 0 ° a 
7 10 
8 7 
(a) Use the rational subgrouping principle to comment on 9 9 
why an X chart that plots one point each hour with a sub- 10 15 


group of size 5 is not appropriate. 

(b) Construct an appropriate attribute control chart. Use all 
the data to find trial control limits, construct the chart, and (a) Using all the data, compute trial control limits for a U con- 
plot the data. trol chart, construct the chart, and plot the data. 
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(b) Determine whether the process is in statistical control. If 
not, assume assignable causes can be found and out-of- 
control points eliminated. Revise the control limits. 


15-101. Consider an X control chart with UCL = 32.802, 

UCL = 24.642, and n = 5. Suppose the mean shifts to 30. 

(a) What is the probability that this shift will be detected on 
the next sample? 

(b) What is the ARL to detect the shift? 

15-102. The number of visits (in millions) on a Web site is 

recorded every day. The following table shows the samples for 

25 consecutive days. 


7 10.46 19 12.53 

8 10.58 20 10.76 

9 9.95 21 11.92 
10 9.50 22 13.24 
11 11.26 23 10.64 
12 10.02 24 11.31 
13 10.95 25 11.26 
14 8.99 26 11.79 
15 9.50 27 10.53 
16 9.66 28 11.82 
17 10.42 29 11.47 
18 11.30 30 11.76 


(a) Estimate the process standard estimation. 
(b) Set up a CUSUM control chart for this process, assuming 
the target is 10. Does the process appear to be in control? 


MIND-EXPANDING EXERCISES 


Number Number 
Sample of Visits Sample of Visits 
1 10.12 4 5 
2 9.92 5 9.60 
3 9.76 6 8.60 
15-103. Suppose a process is in control, and 3-sigma 


control limits are in use on the X chart. Let the mean 
shift by 1.50. What is the probability that this shift will 
remain undetected for three consecutive samples? What 
would its probability be if 2-sigma control limits were 
used? The sample size is 4. 


15-104. Consider an X control chart with k-sigma 
control limits. Develop a general expression for the 
probability that a point will plot outside the control lim- 
its when the process mean has shifted by 6 units from 
the center line. 


15-105. Suppose that an X chart is used to control a 
normally distributed process and that samples of size n 
are taken every n hours and plotted on the chart, which 
has k-sigma limits. 

(a) Find a general expression for the expected number 
of samples and time that will be taken until a false 
action signal is generated. 

(b) Suppose that the process mean shifts to an out-of- 
control state, say 4; = Wo + do. Find an expres- 
sion for the expected number of samples that 
will be taken until a false action is generated. 

(c) Evaluate the in-control ARL for k = 3. How, does 
this change if k = 2? What do you think about the 
use of 2-sigma limits in practice? 

(d) Evaluate the out-of-control ARL for a shift of 1 
sigma, given that n = 5. 


15-106. Suppose a P chart with center line at p with 
k-sigma control limits is used to control a process. There 
is a critical fraction defective p, that must be detected 
with probability 0.50 on the first sample following the shift 
to this state. Derive a general formula for the sample 
size that should be used on this chart. 


15-107. Suppose that a P chart with center line at p 
and k-sigma control limits is used to control a process. 
What is the smallest sample size that can be used on this 
control chart to ensure that the lower control limit is 
positive? 

15-108. A process is controlled by a P chart using 
samples of size 100. The center line on the chart is 0.05. 
What is the probability that the control chart detects a 
shift to 0.08 on the first sample following the shift? 
What is the probability that the shift is detected by at 
least the third sample following the shift? 


15-109. Consider a process where specifications on a 
quality characteristic are 100 + 15. We know that the 
standard deviation of this normally distributed quality 
characteristic is 5. Where should we center the process 
to minimize the fraction defective produced? Now sup- 
pose the mean shifts to 105 and we are using a sample 
size of 4 on an X chart. What is the probability that such 
a shift will be detected on the first sample following the 
shift? What is the average number of samples until an 
out-of-control point occurs? Compare this result to the 
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MIND-EXPANDING EXERCISES 


average number of observations until a defective occurs 
(assuming normality). 

15-110. NP Control Chart. An alternative to the con- 
trol chart for fraction defective is a control chart based on 
the number of defectives, or the NP control chart. The 
chart has centerline at np, and the control limits are 


UCL = np + 3V np(1 — p) 
LCL = np — 3V np(1 — p) 


and the number of defectives for each sample is plotted 
on the chart. 

(a) Verify that the control limits given above are correct. 
(b) Apply this control chart to the data in Example 15-4. 
(c) Will this chart always provide results that are equiv- 

alent to the usual P chart? 

15-111. C Control Chart. An alternative to the U 
chart is a chart based on the number of defects. The 
chart has center line at nu, and the control limits are 


UCL = na + 3V nu 


LCL = na — 3 nu 


(a) Apply this chart to the data in Example 15-5. 

(b) Will this chart always provide results equivalent to 
the U chart? 

15-112. Standardized Control Chart. Consider the 

P chart with the usual 3-sigma control limits. Suppose 

that we define a new variable: 


P,—P 
Z,=—= 


‘  |P(1 — P) 


as the quantity to plot on a control chart. It is proposed 
that this new chart will have a center line at 0 with the 
upper and lower control limits at +3. Verify that this 
standardized control chart will be equivalent to the orig- 
inal P chart. 

15-113. Unequal Sample Sizes. One application of 
the standardized control chart introduced in Exercise 
15-112 is to allow unequal sample sizes on the control 
chart. Provide details concerning how this procedure 
would be implemented and illustrate using the follow- 
ing data: 


Sample, i 1 2 3 4 5 6 7 8 9 10 
Nj 20 DS 20 aS 50 30 2) 25 ws) 20 
Di 0.2 0.16 0.25 0.08 0.3 0.1 0.12 0.16 0.12 0.15 
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IMPORTANT TERMS AND CONCEPTS 


ARL 

Assignable causes 

Attributes control 
charts 

Average run length 

C chart 

Cause-and-effect 
diagram 

Center line 

Chance causes 

Control chart 

Control limits 

Cumulative sum control 
chart 


Defect concentration 
diagram 
Defects-per-unit chart 
Deming’s 14 points 
Exponentially 
weighted moving 
average control 
chart (EWMA) 
False alarm 
Fraction-defective 
control chart 
Implementing SPC 
Individuals control 
chart (X chart) 


Moving range 

NP chart 

P chart 

Pareto diagram 

PCR 

PCR, 
Problem-solving tools 
Process capability 
Process capability ratio 
Quality control 

R chart 

Rational subgroup 
Run rule 

S chart 


Shewhart control chart 

Six-sigma process 

Specification limits 

Statistical process 
control (SPC) 

Statistical quality 
control 

U chart 

V mask 

Variables control charts 

Warning limits 

Western Electric rules 

X chart 
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Table I Summary of Common Probability Distributions 


Probability Section 
Name Distribution Mean Variance in Book 
Discrete 
Cai 1 ; (b + a) (b-at+1/’-1 a5 
= = ee — 
niform nr 7 D 
Binomial (") p(l — p)"* np np(1 — p) 3-6 
x 
x=0,1,....171,0Spsl1 
Geometric (1 — py 'p 1/p (1 — p)/p’ 3-7 
y= 2 OS pa 
— 1 
Negative binomial & ') (1 — py 'p" r/p r(1 — p)/p? 3-7 
— 
x=rnrti1,r+2,...,.0Sp=l 
oles) 
H tri ee (1 (4=*) 3-8 
ergeometric eed n, np(1 — - 
yperg (") Pp PLSD) ea 
K 
n ih ees 
where p N 
x = max(0,n — N+ K),1,... 
min(K,n), K =N,n=N 
. e AAt 
Poisson xX =0,1,2,...,0<A IN nN 3-9 
x! 
Continuous 
1 (b + a) (b — ay 
if ,asxs 4- 
Uniform — asx=x=b 5 D 5 
1 If (x=B)2 
Normal 1 AU") bu o 4-6 
oV2T 
—-D2<I KKH -MKC P< HO<Oo 
Exponential he*\05x%,0<A 1/r 1/a? 4-8 
Nx" Lew Ae ‘5 
Erlang — —— + 0 = er =H 1225.55. r/X r/X 4-9.1 
(r—- 1)! 
Ax’ leat : 
Gamma =a QO<=RO<ZOZA r/X r/X 4-9.2 
r 
Bot 1 2 
Weibull £(2) e eh ar (1 + *) er (1 + 2) 4-10 
5 \8 B B 
1 2 
0<x0<B,0<8 -e[r(1+4)| 
aon 1 ac 6 2 rs 2 ve 
Tosrennal 1 exo( [ 205 ] ) oro? e20+0'(e0"_1) Al] 
xw V 27 20 
T(a + Qa 
Beta ore xe — yb! = ; B 4-12 
Pa) P(B) a+B (a + B)'(a +B +1) 
0O=x=1,0<a,0<8 
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Table II Cumulative Binomial Probabilities P(X = x) 
IP 
n i 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 0.95 0.99 
1 0 0.9000 0.8000 0.7000 0.6000 0.5000 0.4000 0.3000 0.2000 0.1000 0.0500 0.0100 
2 0 0.8100 0.6400 0.4900 0.3600 0.2500 0.1600 0.0900 0.0400 0.0100 0.0025 0.0001 
1 0.9900 0.9600 0.9100 0.8400 0.7500 0.6400 0.5100 0.3600 0.1900 0.0975 0.0199 
3 0 0.7290 0.5120 0.3430 0.2160 0.1250 0.0640 0.0270 0.0080 0.0010 0.0001 0.0000 
1 0.9720 0.8960 0.7840 0.6480 0.5000 0.3520 0.2160 0.1040 0.0280 0.0073 0.0003 
2 0.9990 0.9920 0.9730 0.9360 0.8750 0.7840 0.6570 0.4880 0.2710 0.1426 0.0297 
4 0 0.6561 0.4096 0.2401 0.1296 0.0625 0.0256 0.0081 0.0016 0.0001 0.0000 0.0000 
1 0.9477 0.8192 0.6517 0.4752 0.3125 0.1792 0.0837 0.0272 0.0037 0.0005 0.0000 
2 0.9963 0.9728 0.9163 0.8208 0.6875 0.5248 0.3483 0.1808 0.0523 0.0140 0.0006 
3. 0.9999 0.9984 0.9919 0.9744 0.9375 0.8704 0.7599 0.5904 0.3439 0.1855 0.0394 
5 0 0.5905 0.3277 0.1681 0.0778 0.0313 0.0102 0.0024 0.0003 0.0000 0.0000 0.0000 
1 0.9185 0.7373 0.5282 0.3370 0.1875 0.0870 0.0308 0.0067 0.0005 0.0000 0.0000 
2 0.9914 0.9421 0.8369 0.6826 0.5000 0.3174 0.1631 0.0579 0.0086 0.0012 0.0000 
3. 0.9995 0.9933 0.9692 0.9130 0.8125 0.6630 0.4718 0.2627 0.0815 0.0226 0.0010 
4 1.0000 0.9997 0.9976 0.9898 0.6988 0.9222 0.8319 0.6723 0.4095 0.2262 0.0490 
6 0 0.5314 0.2621 0.1176 0.0467 0.0156 0.0041 0.0007 0.0001 0.0000 0.0000 0.0000 
1 0.8857 0.6554 0.4202 0.2333 0.1094 0.0410 0.0109 0.0016 0.0001 0.0000 0.0000 
2 0.9842 0.9011 0.7443 0.5443 0.3438 0.1792 0.0705 0.0170 0.0013 0.0001 0.0000 
3. 0.9987 0.9830 0.9295 0.8208 0.6563 0.4557 0.2557 ~=0.0989 =-0.0159 0.0022 0.0000 
4 0.9999 0.9984 0.9891 0.9590 0.9806 0.7667 0.5798 0.3446 0.1143 0.0328 0.0015 
5 1.0000 0.9999 0.9993 0.9959 0.9844 0.9533 0.8824 0.7379 0.4686 0.2649 0.0585 
7 0 0.4783 0.2097 0.0824 0.0280 0.0078 0.0016 0.0002 0.0000 0.0000 0.0000 0.0000 
1 0.8503 0.5767 0.3294 0.1586 0.0625 0.0188 0.0038 0.0004 0.0000 0.0000 0.0000 
2 0.9743 0.8520 0.6471 0.4199 0.2266 0.0963 0.0288 0.0047 0.0002 0.0000 0.0000 
3. 0.9973 0.9667 0.8740 0.7102 0.5000 0.2898 0.1260 0.0333 0.0027 0.0002 0.0000 
4 0.9998 0.9953 0.9712 0.9037 0.7734 0.5801 0.3529 = 0.1480 0.0257 0.0038 0.0000 
5 1.0000 0.9996 0.9962 0.9812 0.9375 0.8414 0.6706 0.4233 0.1497 0.0444 0.0020 
6 1.0000 1.0000 0.9998 0.9984 0.9922 0.9720 0.9176 0.7903 0.5217 0.3017 0.0679 
8 0 0.4305 0.1678 0.0576 0.0168 0.0039 0.0007 0.0001 0.0000 0.0000 0.0000 0.0000 
1 0.8131 0.5033 0.2553 0.1064 0.0352 0.0085 0.0013. 0.0001 0.0000 0.0000 0.0000 
2 0.9619 0.7969 0.5518 0.3154 0.1445 0.0498 0.0113 0.0012 0.0000 0.0000 0.0000 
3. 0.9950 0.9437 0.8059 0.5941 0.3633 0.1737 0.0580 0.0104 0.0004 0.0000 0.0000 
4 0.9996 0.9896 0.9420 0.8263 0.6367 0.4059 0.1941 0.0563 0.0050 0.0004 0.0000 
5 1.0000 0.9988 0.9887 0.9502 0.8555 0.6846 0.4482 0.2031 0.0381 0.0058 0.0001 
6 1.0000 0.9999 0.9987 0.9915 0.9648 0.8936 0.7447 0.4967 0.1869 0.0572 0.0027 
7 1.0000 1.0000 0.9999 0.9993 0.9961 0.9832 0.9424 0.8322 0.5695 0.3366 0.0773 
9 0 0.3874 0.1342 0.0404 0.0101 0.0020 0.0003 0.0000 0.0000 0.0000 0.0000 0.0000 
1 0.7748 0.4362 0.1960 0.0705 0.0195 0.0038 0.0004 0.0000 0.0000 0.0000 0.0000 
2 0.9470 0.7382 0.4628 0.2318 0.0889 0.0250 0.0043. 0.0003 0.0000 0.0000 0.0000 
3. 0.9917 0.9144 0.7297 0.4826 0.2539 0.0994 0.0253 0.0031 0.0001 0.0000 0.0000 
4 0.9991 0.9804 0.9012 0.7334 0.5000 0.2666 0.0988 0.0196 0.0009 0.0000 0.0000 
5 0.9999 0.9969 0.9747 0.9006 0.7461 0.5174 0.2703 0.0856 0.0083 0.0006 0.0000 
6 1.0000 0.9997 0.9957 0.9750 0.9102 0.7682 0.5372 0.2618 0.0530 0.0084 0.0001 
T 1.0000 1.0000 0.9996 0.9962 0.9805 0.9295 0.8040 0.5638 0.2252 0.0712 0.0034 
8 1.0000 1.0000 1.0000 0.9997 0.9980 0.9899 0.9596 0.8658 0.6126 0.3698 0.0865 
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Table II Cumulative Binomial Probabilities P(X < x) (continued) 


P 
n Xx 0.1 0.2 0.3 0.4 0.5, 0.6 0.7 0.8 0.9 0.95 0.99 
10 O 0.3487 0.1074 0.0282 0.0060 0.0010 0.0001 0.0000 0.0000 0.0000 0.0000 0.0000 
1 0.7361 0.3758 0.1493 0.0464 0.0107 0.0017 0.0001 0.0000 0.0000 0.0000 0.0000 
2 0.9298 0.6778 0.3828 0.1673 0.0547 0.0123 0.0016 0.0001 0.0000 0.0000 0.0000 
3 0.9872 0.8791 0.6496 0.3823 0.1719 0.0548 0.0106 0.0009 0.0000 0.0000 0.0000 
4 0.9984 0.9672 0.8497 0.6331 0.3770 0.1662 0.0473 0.0064 0.0001 0.0000 0.0000 
5 0.9999 0.9936 0.9527 0.8338 0.6230 0.3669 0.1503 0.0328 0.0016 0.0001 0.0000 
6 1.0000 0.9991 0.9894 0.9452 0.8281 0.6177 0.3504 0.1209 0.0128 0.0010 0.0000 
7 1.0000 0.9999 0.9984 0.9877 0.9453 0.8327 0.6172 0.3222 0.0702 0.0115 0.0001 
8 1.0000 1.0000 0.9999 0.9983 0.9893 0.9536 0.8507 0.6242 0.2639 0.0861 0.0043 
9 1.0000 1.0000 1.0000 0.9999 0.9990 0.9940 0.9718 0.8926 0.6513 0.4013 0.0956 
11 O 0.3138 0.0859 0.0198 0.0036 0.0005 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 
1 0.6974 0.3221 0.1130 0.0302 0.0059 0.0007 0.0000 0.0000 0.0000 0.0000 0.0000 
2 0.9104 0.6174 0.3127 0.1189 0.0327 0.0059 0.0006 0.0000 0.0000 0.0000 0.0000 
3 0.9815 0.8389 0.5696 0.2963 0.1133: 0.0293 0.0043 0.0002 0.0000 0.0000 0.0000 
4 0.9972 0.9496 0.7897 0.5328 0.2744 0.0994 0.0216 0.0020 0.0000 0.0000 0.0000 
5 0.9997 0.9883 0.9218 0.7535 0.5000 0.2465 0.0782 0.0117 0.0003 0.0000 0.0000 
6 1.0000 0.9980 0.9784 0.9006 0.7256 0.4672 0.2103 0.0504 0.0028 0.0001 0.0000 
7 1.0000 0.9998 0.9957 0.9707 0.8867 0.7037 0.4304 0.1611 0.0185 0.0016 0.0000 
8 1.0000 1.0000 0.9994 0.9941 0.9673 0.8811 0.6873 0.3826 0.0896 0.0152 0.0002 
9 1.0000 1.0000 1.0000 0.9993 0.9941 0.9698 0.8870 0.6779 0.3026 0.1019 0.0052 
10 1.0000 1.0000 1.0000 1.0000 0.9995 0.9964 0.9802 0.9141 0.6862 0.4312 0.1047 
12 O 0.2824 0.0687 0.0138 0.0022 0.0002 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 
1 0.6590 0.2749 0.0850 0.0196 0.0032 0.0003 0.0000 0.0000 0.0000 0.0000 0.0000 
2 0.8891 0.5583 0.2528 0.0834 ~=0.0193 = 0.0028 ~=—0.0002~—-: 0.0000 +=0.0000 0.0000 0.0000 
3 0.9744 0.7946 0.4925 0.2253 0.0730 ~—:0.0153 0.0017 0.0001 0.0000 0.0000 0.0000 
4 0.9957 0.9274 0.7237 0.4382 0.1938 0.0573 0.0095 0.0006 0.0000 0.0000 0.0000 
5 0.9995 0.9806 0.8822 0.6652 0.3872 0.1582 0.0386 0.0039 0.0001 0.0000 0.0000 
6 0.9999 0.9961 0.9614 0.8418 0.6128 0.3348 0.1178 0.0194 0.0005 0.0000 0.0000 
7 ~~ 1.0000 0.9994 0.9905 0.9427 0.8062 0.5618 0.2763 0.0726 0.0043 0.0002 0.0000 
8 1.0000 0.9999 0.9983 0.9847 0.9270 0.7747 0.5075 0.2054 0.0256 0.0022 0.0000 
9 1.0000 1.0000 0.9998 0.9972 0.9807 0.9166 0.7472 0.4417 0.1109 0.0196 0.0002 
10 1.0000 1.0000 1.0000 0.9997 0.9968 0.9804 0.9150 0.7251 0.3410 0.1184 0.0062 
11 = =1.0000 1.0000 1.0000 1.0000 0.9998 0.9978 0.9862 0.9313 0.7176 0.4596 0.1136 
13 O 0.2542 0.0550 0.0097 0.0013 0.0001 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 
1 0.6213 0.2336 0.0637 0.0126 0.0017 0.0001 0.0000 0.0000 0.0000 0.0000 0.0000 
2 0.8661 0.5017 0.2025 0.0579 0.0112 0.0013 0.0001 0.0000 0.0000 0.0000 0.0000 
3 0.9658 0.7473 0.4206 0.1686 0.0461 0.0078 0.0007 0.0000 0.0000 0.0000 0.0000 
4 0.9935 0.9009 0.6543 0.3530 0.1334 0.0321 0.0040 0.0002 0.0000 0.0000 0.0000 
5 0.9991 0.9700 0.8346 0.5744 0.2905 0.0977 0.0182 0.0012 0.0000 0.0000 0.0000 
6 0.9999 0.9930 0.9376 0.7712 0.5000 0.2288 0.0624 0.0070 0.0001 0.0000 0.0000 
7 1.0000 0.9988 0.9818 0.9023 0.7095 0.4256 0.1654 0.0300 0.0009 0.0000 0.0000 
8 1.0000 0.9988 0.9960 0.9679 0.8666 0.6470 0.3457 0.0991 0.0065 0.0003 0.0000 
9 1.0000 1.0000 0.9993 0.9922 0.9539 0.8314 0.5794 0.2527 0.0342 0.0031 0.0000 
10. 1.0000 1.0000 0.9999 0.9987 0.9888 0.9421 0.7975 0.4983 0.1339 =0.0245 0.0003 
11 = =1.0000 1.0000 1.0000 0.9999 0.9983 0.9874 0.9363 0.7664 0.3787 0.1354 0.0072 
12 1.0000 1.0000 1.0000 1.0000 0.9999 0.9987 0.9903 0.9450 0.7458 0.4867 0.1225 


Table II Cumulative Binomial Probabilities P(X < x) (continued) 


P 
n Xx 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 0.95 0.99 
14 OO 0.2288 0.0440 0.0068 0.0008 0.0001 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 
1 0.5846 0.1979 0.0475 0.0081 0.0009 0.0001 0.0000 0.0000 0.0000 0.0000 0.0000 
2 0.8416 0.4481 0.1608 0.0398 0.0065 0.0006 0.0000 0.0000 0.0000 0.0000 0.0000 
3 0.9559 0.6982 0.3552 0.1243 -—-0.0287 =: 0.0039 =: 0.0002. —s: 0.0000 =0.0000 + 0.0000 0.0000 
4 0.9908 0.8702 0.5842 0.2793 0.0898 0.0175 0.0017. 0.0000 0.0000 0.0000 0.0000 
5 0.9985 0.9561 0.7805 0.4859 0.2120 0.0583 0.0083 0.0004 0.0000 0.0000 0.0000 
6 0.9998 0.9884 0.9067 0.6925 0.3953 0.1501 0.0315 0.0024 0.0000 0.0000 0.0000 
7 1.0000 0.9976 0.9685 0.8499 0.6047 0.3075 0.0933 0.0116 0.0002 0.0000 0.0000 
8 1.0000 0.9996 0.9917 0.9417 0.7880 0.5141 0.2195 0.0439 0.0015 0.0000 0.0000 
9 1.0000 1.0000 0.9983 0.9825 0.9102 0.7207 0.4158 0.1298 0.0092 0.0004 0.0000 
10 1.0000 1.0000 0.9998 0.9961 0.9713 0.8757 0.6448 0.3018 0.0441 0.0042 0.0000 
11 1.0000 1.0000 1.0000 0.9994 0.9935 0.9602 0.8392 0.5519 0.1584 0.0301 0.0003 
12 1.0000 1.0000 1.0000 0.9999 0.9991 0.9919 0.9525 0.8021 0.4154 0.1530 0.0084 
13 1.0000 1.0000 1.0000 1.0000 0.9999 0.9992 0.9932 0.9560 0.7712 0.5123 0.1313 
15 0 0.2059 0.0352 0.0047 0.0005 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 
1 0.5490 0.1671 0.0353 0.0052 0.0005 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 
2 0.8159 0.3980 0.1268 0.0271 0.0037 0.0003 0.0000 0.0000 0.0000 0.0000 0.0000 
3 0.9444 0.6482 0.2969 0.0905 0.0176 0.0019 0.0001 0.0000 0.0000 0.0000 0.0000 
4 0.9873 0.8358 0.5155 0.2173 0.0592 0.0093 0.0007 0.0000 0.0000 0.0000 0.0000 
5 0.9978 0.9389 0.7216 0.4032 0.1509 0.0338 0.0037 0.0001 0.0000 0.0000 0.0000 
6 0.9997 0.9819 0.8689 0.6098 0.3036 0.0950 0.0152 0.0008 0.0000 0.0000 0.0000 
7 1.0000 0.9958 0.9500 0.7869 0.5000 0.2131 0.0500 0.0042 0.0000 0.0000 0.0000 
8 1.0000 0.9992 0.9848 0.9050 0.6964 0.3902 0.1311 0.0181 0.0003 0.0000 0.0000 
9 1.0000 0.9999 0.9963 0.9662 0.8491 0.5968 0.2784 0.0611 0.0022 0.0001 0.0000 
10 1.0000 1.0000 0.9993 0.9907 0.9408 0.7827 0.4845 0.1642 0.0127 0.0006 0.0000 
11. 1.0000 1.0000 0.9999 0.9981 0.9824 0.9095 0.7031 0.3518 0.0556 0.0055 0.0000 
12 1.0000 1.0000 1.0000 0.9997 0.9963 0.9729 0.8732 0.6020 01841 0.0362 0.0004 
13, 1.0000 1.0000 1.0000 1.0000 0.9995 0.9948 0.9647 0.8329 0.4510 0.1710 0.0096 
14 1.0000 1.0000 1.0000 1.0000 1.0000 0.9995 0.9953 0.9648 0.7941 0.5367 0.1399 
20 O 0.1216 0.0115 0.0008 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 
1 0.3917 0.0692 0.0076 0.0005 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 
2 0.6769 0.2061 0.0355 0.0036 0.0002 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 
3 0.8670 0.4114 0.1071 0.0160 0.0013 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 
4 0.9568 0.6296 0.2375 0.0510 0.0059 0.0003 0.0000 0.0000 0.0000 0.0000 0.0000 
5 0.9887 0.8042 0.4164 0.1256 0.0207 0.0016 0.0000 0.0000 0.0000 0.0000 0.0000 
6 0.9976 0.9133 0.6080 0.2500 0.0577 0.0065 0.0003 0.0000 0.0000 0.0000 0.0000 
7 0.9996 0.9679 0.7723 0.4159 0.1316 0.0210 0.0013 0.0000 0.0000 0.0000 0.0000 
8 0.9999 0.9900 0.8867 0.5956 0.2517 0.0565 0.0051 0.0001 0.0000 0.0000 0.0000 
9 1.0000 0.9974 0.9520 0.7553 0.4119 0.1275 0.0171 0.0006 0.0000 0.0000 0.0000 
10 1.0000 0.9994 0.9829 0.8725 0.5881 0.2447 0.0480 0.0026 0.0000 0.0000 0.0000 
11. 1.0000 0.9999 0.9949 0.9435 0.7483 0.4044 0.1133 0.0100 0.0001 0.0000 0.0000 
12. 1.0000 1.0000 0.9987 0.9790 0.8684 0.5841 0.2277 ~=—0.0321 0.0004 0.0000 0.0000 
13 1.0000 1.0000 0.9997 0.9935 0.9423 0.7500 0.3920 0.0867 0.0024 0.0000 0.0000 
14 1.0000 1.0000 1.0000 0.9984 0.9793 0.8744 0.5836 0.1958 0.0113 0.0003 0.0000 
15 1.0000 1.0000 1.0000 0.9997 0.9941 0.9490 0.7625 0.3704 0.0432 0.0026 0.0000 
16 1.0000 1.0000 1.0000 1.0000 0.9987 0.9840 0.8929 0.5886 0.1330 0.0159 0.0000 
17. 1.0000 1.0000 1.0000 1.0000 0.9998 0.9964 0.9645 0.7939 0.3231 0.0755 0.0010 
18 1.0000 1.0000 1.0000 1.0000 1.0000 0.9995 0.9924 0.9308 0.6083 0.2642 0.0169 
19 1.0000 1.0000 1.0000 1.0000 1.0000 1.0000 0.9992 0.9885 0.8784 0.6415 0.1821 
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Pz) = P(Z Sz) = | e? du 
_» V2T 
® (z) 
z 0 
Table III Cumulative Standard Normal Distribution 

& —0.09 —0.08 —0.07 —0.06 —0.05 —0.04 —0.03 —0.02 —0.01 —0.00 
—3.9 0.000033 0.000034 0.000036 0.000037 0.000039 0.000041 0.000042 0.000044 0.000046 0.000048 
—3.8 0.000050 0.000052 0.000054 0.000057 0.000059 0.000062 0.000064 0.000067 0.000069 0.000072 
—3.7 0.000075 0.000078 0.000082 0.000085 0.000088 0.000092 0.000096 0.000100 0.000104 0.000108 
—3.6 0.000112 0.000117 0.000121 0.000126 0.000131 0.000136 0.000142 0.000147 0.000153 0.000159 
—3.5 0.000165 0.000172 0.000179 0.000185 0.000193 0.000200 0.000208 0.000216 0.000224 0.000233 
—3.4 0.000242 0.000251 0.000260 0.000270 0.000280 0.000291 0.000302 0.000313 0.000325 0.000337 
—3.3 0.000350 0.000362 0.000376 0.000390 0.000404 0.000419 0.000434 0.000450 0.000467 0.000483 
—3.2 0.000501 0.000519 0.000538 0.000557 0.000577 0.000598 0.000619 0.000641 0.000664 0.000687 
—3.1 0.000711 0.000736 0.000762 0.000789 0.000816 0.000845 0.000874 0.000904 0.000935 0.000968 
—3.0 0.001001 0.001035 0.001070 0.001107 0.001144 0.001183 0.001223 0.001264 0.001306 0.001350 
—2.9 0.001395 0.001441 0.001489 0.001538 0.001589 0.001641 0.001695 0.001750 0.001807 0.001866 
—2.8 0.001926 0.001988 0.002052 0.002118 0.002186 0.002256 0.002327 0.002401 0.002477 0.002555 
—2.7 0.002635 0.002718 0.002803 0.002890 0.002980 0.003072 0.003167 0.003264 0.003364 0.003467 
—2.6 0.003573 0.003681 0.003793 0.003907 0.004025 0.004145 0.004269 0.004396 0.004527 0.004661 
—2.5 0.004799 0.004940 0.005085 0.005234 0.005386 0.005543 0.005703 0.005868 0.006037 0.006210 
—2.4 0.006387 0.006569 0.006756 0.006947 0.007143 0.007344 0.007549 0.007760 0.007976 0.008198 
—2.3 0.008424 0.008656 0.008894 0.009137 0.009387 0.009642 0.009903 0.010170 0.010444 0.010724 
—2.2 0.011011 0.011304 0.011604 0.011911 0.012224 0.012545 0.012874 0.013209 0.013553 0.013903 
—2.1 0.014262 0.014629 0.015003 0.015386 0.015778 0.016177 0.016586 0.017003 0.017429 0.017864 
—2.0 0.018309 0.018763 0.019226 0.019699 0.020182 0.020675 0.021178 0.021692 0.022216 0.022750 
-1.9 0.023295 0.023852 0.024419 0.024998 0.025588 0.026190 0.026803 0.027429 0.028067 0.028717 
—1.8 0.029379 0.030054 0.030742 0.031443 0.032157 0.032884 0.033625 0.034379 0.035148 0.035930 
—-1.7 0.036727 0.037538 0.038364 0.039204 0.040059 0.040929 0.041815 0.042716 0.043633 0.044565 
—1.6 0.045514 0.046479 0.047460 0.048457 0.049471 0.050503 0.051551 0.052616 0.053699 0.054799 
-1.5 0.055917 0.057053 0.058208 0.059380 0.060571 0.061780 0.063008 0.064256 0.065522 0.066807 
—-14 0.068112 0.069437 0.070781 0.072145 0.073529 0.074934 0.076359 0.077804 0.079270 0.080757 
—-1.3 0.082264 0.083793 0.085343 0.086915 0.088508 0.090123 0.091759 0.093418 0.095098 0.096801 
—1.2 0.098525 0.100273 0.102042 0.103835 0.105650 0.107488 0.109349 0.111233 0.113140 0.115070 
—1.1 0.117023 0.119000 0.121001 0.123024 0.125072 0.127143 0.129238 0.131357 0.133500 0.135666 
—-1.0 0.137857 0.140071 0.142310 0.144572 0.146859 0.149170 0.151505 0.153864 0.156248 0.158655 
—0.9 0.161087 0.163543 0.166023 0.168528 0.171056 0.173609 0.176185 0.178786 0.181411 0.184060 
—0.8 0.186733 0.189430 0.192150 0.194894 0.197662 0.200454 0.203269 0.206108 0.208970 0.211855 
—0.7 0.214764 0.217695 0.220650 0.223627 0.226627 0.229650 0.232695 0.235762 0.238852 0.241964 
—0.6 0.245097 0.248252 0.251429 0.254627 0.257846 0.261086 0.264347 0.267629 0.270931 0.274253 
—0.5 0.277595 0.280957 = 0.284339 = 0.287740 = 0.291160 3=— 0.294599 =: 0.298056 = 0.301532 = 0.305026 ~=—- 0.308538 
—0.4 0.312067 0.315614 0.319178 0.322758 0.326355 0.329969 0.333598 0.337243 0.340903 0.344578 
—0.3 0.348268 0.351973 0.355691 0.359424 0.363169 0.366928 0.370700 0.374484 0.378281 0.382089 
—0.2 0.385908 0.389739 0.393580 0.397432 0.401294 0.405165 0.409046 0.412936 0.416834 0.420740 
—0.1 0.424655 0.428576 0.432505 0.436441 0.440382 0.444330 0.448283 0.452242 0.456205 0.460172 

0.0 0.464144 0.468119 0.472097 0.476078 0.480061 0.484047 0.488033 0.492022 0.496011 0.500000 
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a 


0.00 


0.01 


0.02 


0.03 


0.04 


0.05 


0.06 


0.07 


0.08 


0.09 


0.0 
0.1 
0.2 
0.3 
0.4 
0.5 
0.6 
0.7 
0.8 
0.9 
1.0 
1.1 
1.2 
1.3 
1.4 
1.5 
1.6 
1.7 
1.8 
1.9 
2.0 
2.1 
DP) 
2.3 
2.4 
2.5 
2.6 
2.7 
2.8 
2.9 
3.0 
3.1 
3) 
3.3 
3.4 
3:5 
3.6 
3.7 
3.8 
3.9 


0.500000 
0.539828 
0.579260 
0.617911 
0.655422 
0.691462 
0.725747 
0.758036 
0.788145 
0.815940 
0.841345 
0.864334 
0.884930 
0.903199 
0.919243 
0.933193 
0.945201 
0.955435 
0.964070 
0.971283 
0.977250 
0.982136 
0.986097 
0.989276 
0.991802 
0.993790 
0.995339 
0.996533 
0.997445 
0.998134 
0.998650 
0.999032 
0.999313 
0.999517 
0.999663 
0.999767 
0.999841 
0.999892 
0.999928 
0.999952 


0.503989 
0.543795 
0.583166 
0.621719 
0.659097 
0.694974 
0.729069 
0.761148 
0.791030 
0.818589 
0.843752 
0.866500 
0.886860 
0.904902 
0.920730 
0.934478 
0.946301 
0.956367 
0.964852 
0.971933 
0.977784 
0.982571 
0.986447 
0.989556 
0.992024 
0.993963 
0.995473 
0.996636 
0.997523 
0.998193 
0.998694 
0.999065 
0.999336 
0.999533 
0.999675 
0.999776 
0.999847 
0.999896 
0.999931 
0.999954 


0.507978 
0.547758 
0.587064 
0.625516 
0.662757 
0.698468 
0.732371 
0.764238 
0.793892 
0.821214 
0.846136 
0.868643 
0.888767 
0.906582 
0.922196 
0.935744 
0.947384 
0.957284 
0.965621 
0.972571 
0.978308 
0.982997 
0.986791 
0.989830 
0.992240 
0.994132 
0.995604 
0.996736 
0.997599 
0.998250 
0.998736 
0.999096 
0.999359 
0.999550 
0.999687 
0.999784 
0.999853 
0.999900 
0.999933 
0.999956 


0.511967 
0.551717 
0.590954 
0.629300 
0.666402 
0.701944 
0.735653 
0.767305 
0.796731 
0.823815 
0.848495 
0.870762 
0.890651 
0.908241 
0.923641 
0.936992 
0.948449 
0.958185 
0.966375 
0.973197 
0.978822 
0.983414 
0.987126 
0.990097 
0.992451 
0.994297 
0.995731 
0.996833 
0.997673 
0.998305 
0.998777 
0.999126 
0.999381 
0.999566 
0.999698 
0.999792 
0.999858 
0.999904 
0.999936 
0.999958 


0.515953 
0.555760 
0.594835 
0.633072 
0.670031 
0.705401 
0.738914 
0.770350 
0.799546 
0.826391 
0.850830 
0.872857 
0.892512 
0.909877 
0.925066 
0.938220 
0.949497 
0.959071 
0.967116 
0.973810 
0.979325 
0.983823 
0.987455 
0.990358 
0.992656 
0.994457 
0.995855 
0.996928 
0.997744 
0.998359 
0.998817 
0.999155 
0.999402 
0.999581 
0.999709 
0.999800 
0.999864 
0.999908 
0.999938 
0.999959 


0.519939 
0.559618 
0.598706 
0.636831 
0.673645 
0.708840 
0.742154 
0.773373 
0.802338 
0.828944 
0.853141 
0.874928 
0.894350 
0.911492 
0.926471 
0.939429 
0.950529 
0.959941 
0.967843 
0.974412 
0.979818 
0.984222 
0.987776 
0.990613 
0.992857 
0.994614 
0.995975 
0.997020 
0.997814 
0.998411 
0.998856 
0.999184 
0.999423 
0.999596 
0.999720 
0.999807 
0.999869 
0.999912 
0.999941 
0.999961 


0.532922 
0.563559 
0.602568 
0.640576 
0.677242 
0.712260 
0.745373 
0.776373 
0.805106 
0.831472 
0.855428 
0.876976 
0.896165 
0.913085 
0.927855 
0.940620 
0.951543 
0.960796 
0.968557 
0.975002 
0.980301 
0.984614 
0.988089 
0.990863 
0.993053 
0.994766 
0.996093 
0.997110 
0.997882 
0.998462 
0.998893 
0.999211 
0.999443 
0.999610 
0.999730 
0.999815 
0.999874 
0.999915 
0.999943 
0.999963 


0.527903 
0.567495 
0.606420 
0.644309 
0.680822 
0.715661 
0.748571 
0.779350 
0.807850 
0.833977 
0.857690 
0.878999 
0.897958 
0.914657 
0.929219 
0.941792 
0.952540 
0.961636 
0.969258 
0.975581 
0.980774 
0.984997 
0.988396 
0.991106 
0.993244 
0.994915 
0.996207 
0.997197 
0.997948 
0.998511 
0.998930 
0.999238 
0.999462 
0.999624 
0.999740 
0.999821 
0.999879 
0.999918 
0.999946 
0.999964 


0.531881 
0.571424 
0.610261 
0.648027 
0.684386 
0.719043 
0.751748 
0.782305 
0.810570 
0.836457 
0.859929 
0.881000 
0.899727 
0.916207 
0.930563 
0.942947 
0.953521 
0.962462 
0.969946 
0.976148 
0.981237 
0.985371 
0.988696 
0.991344 
0.993431 
0.995060 
0.996319 
0.997282 
0.998012 
0.998559 
0.998965 
0.999264 
0.999481 
0.999638 
0.999749 
0.999828 
0.999883 
0.999922 
0.999948 
0.999966 


0.535856 
0.575345 
0.614092 
0.651732 
0.687933 
0.722405 
0.754903 
0.785236 
0.813267 
0.838913 
0.862143 
0.882977 
0.901475 
0.917736 
0.931888 
0.944083 
0.954486 
0.963273 
0.970621 
0.976705 
0.981691 
0.985738 
0.988989 
0.991576 
0.993613 
0.995201 
0.996427 
0.997365 
0.998074 
0.998605 
0.998999 
0.999289 
0.999499 
0.999650 
0.999758 
0.999835 
0.999888 
0.999925 
0.999950 
0.999967 
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Table IV Percentage Points x2, of the Chi-Squared Distribution 


Qa 
v ERB) .990 SIS) .950 .900 500 .100 .050 025 .010 .005 
1 00+ 00+ 00+ 00+ 02 AS 271) 3.84 5.02 6.63 7.88 
2 .O1 02 05 10 21 1.39 4.61 5.99 7.38 9.21 10.60 
3 07 ll 2D 5) 58 237 6.25 7.81 9.35, 11.34 12.84 
4 21 30 48 71 1.06 3.36 7.78 9.49 11.14 13.28 14.86 
2) 41 oS) 83 1.15 1.61 4.35 9.24 11.07 12.83 15.09 16.75 
6 .68 87 1.24 1.64 2.20 5.35 10.65 12.59 14.45 16.81 18.55 
7 99 1.24 1.69 DENG, 2.83 6.35 12.02 14.07 16.01 18.48 20.28 
8 1.34 1.65 2.18 2013 3.49 7.34 13.36 15.51 17.53 20.09 21.96 
9 173} 2.09 2.70 sh3)3) 4.17 8.34 14.68 16.92 19.02 21.67 23.59 
10 2.16 2.56 3.25 3.94 4.87 9.34 15.99 18.31 20.48 23.21 25.19 
11 2.60 3.05 3.82 4.57 5.58 10.34 17.28 19.68 21.92 24.72 26.76 
12 3.07 3.57 4.40 9:23 6.30 = 11.34 18.55 21.03 23.34 26.22 28.30 
13 3h)// 4.11 5.01 5.89 7.04 = 12.34 19.81 22.36 24.74 27.69 29.82 
14 4.07 4.66 5.63 6.57 7.79 13.34 21.06 23.68 26.12 29.14 31.32 
15 4.60 D3) 6.27 7.26 8.55 14.34 22.31 25.00 27.49 30.58 32.80 
16 5.14 5.81 6.91 7.96 9.31 15.34 23.54 26.30 28.85 32.00 34.27 
17 5.70 6.41 7.56 8.67 10.09 16.34 24.77 2759 30.19 33.41 Bo 
18 6.26 7.01 8.23 9.39 10.87 17.34 25.99 28.87 31.53 34.81 37.16 
19 6.84 7.63 8.91 10.12 11.65 = 18.34 27.20 30.14 32.85 36.19 38.58 
20 7.43 8.26 9.59 10.85 12.44 19.34 28.41 31.41 34.17 37.57 40.00 
21 8.03 8.90 10.28 11.59 13.24 20.34 29.62 32.67 35.48 38.93 41.40 
22 8.64 9.54 10.98 12.34 14.04 21.34 30.81 33.92 36.78 40.29 42.80 
23 9.26 10.20 11.69 13.09 14.85 22.34 32.01 35.17 38.08 41.64 44.18 
24 9.89 10.86 12.40 13.85 15.66 23.34 33.20 36.42 39.36 42.98 45.56 
25) 10.52 11.52 13.12 14.61 16.47 24.34 34.28 37.65 40.65 44.31 46.93 
26 11.16 12.20 13.84 15.38 17.29 25.34 35.56 38.89 41.92 45.64 48.29 
27. 11.81 12.88 14.57 16.15 18.11 26.34 36.74 40.11 43.19 46.96 49.65 
28 12.46 13.57 15.31 16.93 18.94 27.34 37.92 41.34 44.46 48.28 50.99 
29 13.12 14.26 16.05 17.71 19.77 = 28.34 39.09 42.56 45.72 49.59 52.34 
30 13.79 14.95 16.79 18.49 20.60 29.34 40.26 43.77 46.98 50.89 53.67 
40 20.71 22.16 24.43 26.51 29.05 39.34 51.81 55.76 59.34 63.69 66.77 
50 27.99 29.71 32.36 34.76 37.69 49.33 63.17 67.50 71.42 76.15 79.49 
60 3phos) 37.48 40.48 43.19 46.46 59.33 7440 79.08 83.30 88.38 91.95 
70 43.28 45.44 48.76 51.74 55.33 69.33 85.53 90.53 95.02 100.42 104.22 
80 51.17 53.54 57.15 60.39 64.28 79.33 96.58 101.88 106.63 112.33 116.32 
90 59.20 61.75 65.65 69.13 73.29 89.33, 107.57) 113.14 118.14 124.12 128.30 
100 67.33 70.06 74.22 VILE 82.36 99.33 118.50 124.34 12956 135.81 140.17 


v = degrees of freedom. 
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a 
ta, Vv 
Table V_ Percentage Points t,,, of the t Distribution 
a 

Bd .40 sy) .10 AO) 025 01 005 .0025 001 .0005 

1 25) 1.000 3.078 6.314 12.706 31.821 63.657 127.32 318.31 636.62 
2 .289 816 1.886 2.920 4.303 6.965 9.925 14.089 23.326 31.598 
3 UY .765 1.638 2.353 3.182 4.541 5.841 7.453 10.213 12.924 
4 271 741 1.533 2.132 2.776 3.747 4.604 5.598 TA73 8.610 
5 .267 , 1.476 2.015 2.571 3.365 4.032 4.773 5.893 6.869 
6 .265 .718 1.440 1.943 2.447 3.143 3.707 4.317 5.208 5.959 
7 263 711 1.415 1.895 2.365 2.998 3.499 4.029 4.785 5.408 
8 .262 .706 1.397 1.860 2.306 2.896 3.355 3.833 4.501 5.041 
9 261 .703 1.383 1.833 2.262 2.821 3.250 3.690 4.297 4.781 
10 .260 .700 1.372 1.812 2.228 2.764 3.169 3.581 4.144 4.587 
11 .260 .697 1.363 1.796 2.201 2.718 3.106 3.497 4.025 4.437 
12 259 .695 1.356 1.782 2.179 2.681 3.055 3.428 3.930 4.318 
13 Pa) .694 1.350 1.771 2.160 2.650 3.012 3.372 3.852 4.221 
14 258 .692 1.345 1.761 2.145 2.624 2.977 3.326 3.787 4.140 
15 258 .691 1.341 1.753 2.131 2.602 2.947 3.286 3.733 4.073 
16 258 .690 1.337 1.746 2.120 2.583 2.921 3.252 3.686 4.015 
17 PT .689 1.333 1.740 2.110 2.567 2.898 3.222 3.646 3.965 
18 257 .688 1.330 1.734 2.101 2.552 2.878 3.197 3.610 3.922 
19 i .688 1.328 1.729 2.093 2.539 2.861 3.174 3.579 3.883 
20 257 .687 1.325 1.725 2.086 2.528 2.845 3.153 3.552 3.850 
21 PT .686 1.323 1.721 2.080 2.518 2.831 3.135 3.527 3.819 
22 256 .686 1.321 1.717 2.074 2.508 2.819 3.119 3.505 3.792 
23 256 .685 1.319 1.714 2.069 2.500 2.807 3.104 3.485 3.767 
24 256 .685 1.318 1.711 2.064 2.492 2.797 3.091 3.467 3.745 
aS) 256 .684 1.316 1.708 2.060 2.485 Dao, 3.078 3.450 3.725 
26 256 .684 1.315 1.706 2.056 2.479 2.779 3.067 3.435 3.707 
a 256 .684 1.314 1.703 2.052 2.473 DATA 3.057 3.421 3.690 
28 256 .683 1.313 1.701 2.048 2.467 2.763 3.047 3.408 3.674 
29 256 .683 1.311 1.699 2.045 2.462 2.756 3.038 3.396 3.659 
30 .256 .683 1.310 1.697 2.042 2.457 2.750 3.030 3.385 3.646 
40 255 .681 1.303 1.684 2.021 2.423 2.704 2.971 3.307 3.551 
60 254 .679 1.296 1.671 2.000 2.390 2.660 2.915 3.232 3.460 
120 254 .677 1.289 1.658 1.980 2.358 2.617 2.860 3.160 3.373 
© 253 .674 1.282 1.645 1.960 2.326 2.576 2.807 3.090 3.291 


v = degrees of freedom. 
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Chart VII Operating Characteristic Curves 
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(a) O.C. curves for different values of n for the two-sided normal test for a level of significance a = 0.05. 
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(b) O.C. curves for different values of n for the two-sided normal test for a level of significance a = 0.01. 


Source: Charts VIa, e, f, k, m, and g are reproduced with permission from “Operating Characteristics for 
the Common Statistical Tests of Significance,” by C. L. Ferris, E E. Grubbs, and C. L. Weaver, Annals of 
Mathematical Statistics, June 1946. 

Charts VIB, c, d, g, h, i, j, | n, 0, p, and r are reproduced with permission from Engineering Statistics, 2nd 
Edition, by A. H. Bowker and G. J. Lieberman, Prentice-Hall, 1972. 


718 APPENDIX A STATISTICAL TABLES AND CHARTS 


Chart VII Operating Characteristic Curves (continued) 
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(c) O.C. curves for different values of n for the one-sided normal test for a level of significance a = 0.05. 
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(d) O.C. curves for different values of n for the one-sided normal test for a level of significance a = 0.01. 
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Chart VII Operating Characteristic Curves (continued) 
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(e) O.C. curves for different values of n for the two-sided t-test for a level of significance a = 0.05. 
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(f) O.C. curves for different values of n for the two-sided t-test for a level of significance a = 0.01. 
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Chart VII Operating Characteristic Curves (continued) 
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(g) O.C. curves for different values of n for the one-sided t-test for a level of significance a = 0.05. 
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(h) O.C. curves for different values of n for the one-sided t-test for a level of significance a = 0.01. 
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Chart VII Operating Characteristic Curves (continued) 
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Xd 
(i) O.C. curves for different values of n for the two-sided chi-square test for a level of significance a = 0.05. 
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A 
(j) O.C. curves for different values of n for the two-sided chi-square test for a level of significance a = 0.01. 
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Chart VII Operating Characteristic Curves (continued) 
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(k) O.C. curves for different values of n for the one-sided (upper-tail) chi-square test for a level of 
significance a = 0.05. 
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(J) O.C. curves for different values of n for the one-sided (upper-tail) chi-square test for a level of 
significance a = 0.01. 
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Chart VII Operating Characteristic Curves (continued) 
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(m) O.C. curves for different values of n for the one-sided (lower-tail) chi-square test for a level of 
significance a = 0.05. 
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(n) O.C. curves for different values of n for the one-sided (lower-tail) chi-square test for a level of 
significance a = 0.01. 
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Chart VII Operating Characteristic Curves (continued) 
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(0) O.C. curves for different values of n for the two-sided F-test for a level of significance a = 0.05. 
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(p) O.C. curves for different values of n for the two-sided F-test for a level of significance a = 0.01. 
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Chart VII Operating Characteristic Curves (continued) 
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(gq) O.C. curves for different values of n for the one-sided F-test for a level of significance a = 0.05. 
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(r) O.C. curves for different values of n for the one-sided F-test for a level of significance a = 0.01. 
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Table VIII = Critical Values for the Sign Test 


rt 
Qa 0.10 0.05 0.01 Two-sided tests | 0.10 0.05 0.01 Two-sided tests 
n 0.05 0.025 0.005 One-sided tests n 0.05 0.025 0.005 One-sided tests 
5 0 23 7 6 4 
6 0 0 24 7 6 5 
7 0 0 25 7 7 5 
8 1 0 0 26 8 7 6 
9 1 1 0 Oe 8 7 6 
10 1 1 0 28 9 8 6 
11 2} 1 0 29 9 8 7 
12 2 2 1 30 10 9 7 
13 3 2) 1 31 10 9 7 
14 3 2 1 32 10 9 8 
15 3 3 D 33 11 10 8 
16 4 3 2) 34 11 10 9 
17 4 4 2) 35 12 11 9 
18 5 4 3 36 12 11 9 
19 5 4 3 37 13 12 10 
20 5 5 3 38 13 12 10 
21 6 5 4 39 13 12 11 
22 6 5 4 40 14 13 11 
Table IX Critical Values for the Wilcoxon Signed-Rank Test 
we 
a 0.10 0.05 0.02 0.01 Two-sided tests 
n* 0.05 0.025 0.01 0.005 One-sided tests 
4 
5 0 
6 D 0 
ui 3 2 0 
8 5 3 1 0 
9 8 5 3 1 
10 10 8 5 3 
11 13 10 7 5 
12 17 13 9 wT 
13 21 17 12 9 
14 25 21 15 12 
15 30 25 19 15 
16 35 29 23 19 
17 41 34 27 23 
18 47 40 32 2g 
19 53 46 37 32 
20 60 52 43 37 
21 67 58 49 42 
22 75 65 55 48 
23 83 73 62 54 
24 91 81 69 61 
25 100 89 76 68 


*Tfn > 25,W (or W_) is approximately normally distributed with mean n(n + 1)/4 


and variance n(n + 1)(2n + 1)/24. 


Table X Critical Values for the Wilcoxon Rank-Sum Test 


Wo.05 
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12 17 26 35) 46 58 71 85 99 1s) 
13 18 27 37 48 60 73 88 103 119 
14 ig) 28 38 50 63 76 91 106 123 
15 20 29 40 52 65 79 94 110 127 
16 21 31 42 54 67 82 97 114 131 
17 21 32 43 56 70 84 100 117 135 
18 22 33 45 58 v2 87 103 121 139 
19 23 34 46 60 74 90 107 124 

20 24 3) 48 62 il 93 110 

21 25 37 50 64 79 95 

22 26 38 51 66 82 

23 27 39 53 68 

24 28 40 55 

25 28 42 

26 29 

27 

28 


137 
141 
145 
150 
154 


160 
164 185 
169 


*For n, andn, > 8, W, is approximately normally distributed with mean 3n(, + ny + 1) and variance nyn,(n, + ny + 1)/12. 


Table X Critical Values for the Wilcoxon Rank-Sum Test (continued) 


Wo01 


Ny 4 5 6 W 8 8) 10 11 12 13 14 15 
5 15 
6 10 16 23 
tl 10 17 24 32 
8 11 17 25 34 43 
g) 11 18 26 35) 45 56 
10 12 19 27 37 47 58 71 
11 12 20 28 38 49 61 74 87 
12 13 21 30 40 51 63 76 90 106 
13 14 22 31 41 33 65 W) 93 109 125 
14 14 22 32 43 54 67 81 96 112 129 147 
15 15 23 3 44 56 70 84 99 115 133 151 171 
16 15 24 34 46 58 72 86 102 119 137 155 
17 16 25 36 47 60 74 89 105 122 140 
18 16 26 37 49 62 76 92 108 125 
19 i Pil 38 50 64 78 94 111 
20 18 28 39 52 66 81 97 
21 18 29 40 23 68 83 
22 19 29 42 55 70 
23 19 30 43 57 
24 20 31 44 
25 20 Bz 
26 21 
27 
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Table XI Factors for Constructing Variables Control Charts 


Factor for Control Limits 


X Chart R Chart S Chart 
n* A, Ay dy D; Ds Cr n 
2 3.760 1.880 1.128 0 3.267 0.7979 2) 
3 2.394 1.023 1.693 0 2.575 0.8862 3 
4 1.880 .729 2.059 0 2.282 0.9213 4 
5 1.596 S77 2.326 0 2.115 0.9400 5 
6 1.410 483 2.534 0 2.004 0.9515 6 
7 1.277 419 2.704 .076 1.924 0.9594 7 
8 1.175 373 2.847 136 1.864 0.9650 8 
9 1.094 337 2.970 184 1.816 0.9693 9 
10 1.028 308 3.078 223 1.777 0.9727 10 
11 973 285 3.173 .256 1.744 0.9754 11 
12 925 .266 3.258 284 1.716 0.9776 12 
13 884 .249 3.336 308 1.692 0.9794 13 
14 848 235 3.407 329 1.671 0.9810 14 
15 816 223 3.472 348 1.652 0.9823 15 
16 788 PD 3.532 364 1.636 0.9835 16 
17 .762 .203 3.588 379 1.621 0.9845 17 
18 .738 194 3.640 .392 1.608 0.9854 18 
19 717 187 3.689 404 1.596 0.9862 19 
20 .697 .180 37/35) 414 1.586 0.9869 20 
21 .679 173 3.778 425 1.575 0.9876 21 
22 .662 .167 3.819 434 1.566 0.9882 22 
23 647 162 3.858 443 1.557 0.9887 23 
24 .632 JIS 3.895 452 1.548 0.9892 24 
25 619 153 3.931 459 1.541 0.9896 25 


*n > 25: Ay = 3/Vn where n = number of observations in sample. 
1 
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Table XII Factors for Tolerance Intervals 
Values of k for Two-Sided Intervals 
Confidence Level 
0.90 0.95 0.99 
Sample Probability of Coverage 
Size 0.90 0.95, 0.99 0.90 0.95 0.99 0.90 0.95, 0.99 

2, 15.978 18.800 24.167 32.019 37.674 48.430 160.193 188.491 242.300 

3 5.847 6.919 8.974 8.380 9.916 12.861 18.930 22.401 29.055 
4 4.166 4.943 6.440 5.369 6.370 8.299 9.398 11.150 14.527 

5 3.949 4.152 5.423 4.275 5.079 6.634 6.612 7.855 10.260 

6 3.131 3.723 4.870 3.712 4.414 STS) 5.337 6.345 8.301 

7 2.902 3.452 4.521 3.369 4.007 5.248 4.613 5.488 7.187 

8 2.743 3.264 4.278 3.136 3.732 4.891 4.147 4.936 6.468 

9 2.626 3.125 4.098 2.967 3.532 4.631 3.822 4.550 5.966 
10 2.535 3.018 3.959 2.839 3.379 4.433 3.582 4.265 5.594 
11 2.463 2.933 3.849 2.737 3.259 4.277 3.397 4.045 5.308 
12 2.404 2.863 3.758 2.655 3.162 4.150 3.250 3.870 5.079 
13 2.355 2.805 3.682 2.587 3.081 4.044 3.130 3.727 4.893 
14 2.314 2.756 3.618 Desyls) 3.012 3.955 3.029 3.608 4.737 
15 2.278 2.713 3.562 2.480 2.954 3.878 2.945 3.507 4.605 
16 2.246 2.676 3.514 2.437 2.903 3.812 2.872 3.421 4.492 
17 2.219 2.643 3.471 2.400 2.858 3.754 2.808 3.345 4.393 
18 2.194 2.614 3.433 2.366 2.819 3.702 DS} B19 4.307 
19 2.172 2.588 3.399 2.337 2.784 3.656 2.703 3.221 4.230 
20 2.152 2.564 3.368 2.310 Dey) 3.615 2.659 3.168 4.161 
21 2.135 2.543 3.340 2.286 2.723 3.577 2.620 3.121 4.100 
22) 2.118 2.524 3.315 2.264 2.697 3.543 2.584 3.078 4.044 
23 2.103 2.506 3.292 2.244 2.673 3.512 2.551 3.040 3.993 
24 2.089 2.489 3.270 DIS 2.651 3.483 D522 3.004 3.947 
25 2.077 2.474 3.251 2.208 2.631 3.457 2.494 2.972 3.904 
30 2.025 2.413 3.170 2.140 229) 3.350 2.385 2.841 3.733 
40 1.959 2.334 3.066 2.052 2.445 3.213 2.247 2.677 3.518 
50 1.916 2.284 3.001 1.996 2.379 3.126 2.162 2.576 3.385 
60 1.887 2.248 2.955 1.958 2.333 3.066 2.103 2.506 3.293 
70 1.865 DDD, 2.920 1.929 DDS) 3.021 2.060 2.454 3.225 
80 1.848 2.202 2.894 1.907 2.272 2.986 2.026 2.414 3.173 
90 1.834 2.185 2.872 1.889 2.251 2.958 1.999 2.382 3.130 
100 1.822 2.172 2.854 1.874 2.233 2.934 1.977 2.355 3.096 
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Table XII Factors for Tolerance Intervals (continued) 


Values of k for One-Sided Intervals 


Confidence Level 
0.90 0.95 0.99 
Sample Probability of Coverage 
Size 0.90 0.95 0.99 0.90 0.95 0.99 0.90 0.95 0.99 

2 10.253 13.090 18.500 20.581 26.260 37.094 103.029 131.426 185.617 
3 4.258 5.311 7.340 6.155 7.656 10.553 13.995 17.370 23.896 
4 3.188 3257 5.438 4.162 5.144 7.042 7.380 9.083 12.387 
5 2.742 3.400 4.666 3.407 4.203 5.741 5.362 6.578 8.939 
6 2.494 3.092 4.243 3.006 3.708 5.062 4.411 5.406 7.335 
7 2.333 2.894 3.972 2.755 3.399 4.642 3.859 4.728 6.412 
8 2219 2.754 3.783 2.582 3.187 4.354 3.497 4.285 5.812 
9 2.133 2.650 3.641 2.454 3.031 4.143 3.240 3.972 5.389 
10 2.066 2.568 3.532 2355) 2.911 3.981 3.048 3.738 5.074 
11 2.011 2.503 3.443 2.275 2.815 3.852 2.898 3.556 4.829 
12 1.966 2.448 3.371 2.210 2.736 3.747 Qi 3.410 4.633 
13 1.928 2.402 3.309 2.155 2.671 3.659 2.677 3.290 4.472 
14 1.895 2.363 Bi) 2.109 2.614 BSS 2593 3.189 4.337 
15 1.867 2.329 3.212 2.068 2.566 3.520 2.521 3.102 4.222 
16 1.842 Day 3.172 2.033 2.524 3.464 2.459 3.028 4.123 
17 1.819 2.272 3.137 2.002 2.486 3.414 2.405 2.963 4.037 
18 1.800 2.249 3.105 1.974 2.453 3.370 235i 2.905 3.960 
19 1.782 2.227 3.077 1.949 2.423 3.331 2.314 2.854 3.892 
20 1.765 2.028 3.052 1.926 2.396 3295) 2.276 2.808 3.832 
21 1.750 2.190 3.028 1.905 2.371 3.263 2.241 2.766 3.777 
22 1.737 2.174 3.007 1.886 2.349 3233) 2.209 ZI29 3/27 
23 1.724 2.159 2.987 1.869 2.328 3.206 2.180 2.694 3.681 
24 L772 2.145 2.969 1.853 2.309 3.181 2.154 2.662 3.640 
25 1.702 2.132 2.952 1.838 2.292 3.158 2.129 2.633 3.601 
30 1.657 2.080 2.884 levi 2.220 3.064 2.030 2.515 3.447 
40 1.598 2.010 2.793 1.697 2.125 2.941 1.902 2.364 3.249 
50 1.559 1.965 DBS) 1.646 2.065 2.862 1.821 2.269 3.125 
60 1.532 1.933 2.694 1.609 2.022 2.807 1.764 2.202 3.038 
70 1.511 1.909 2.662 1.581 1.990 2.765 22) 2.153 2.974 
80 1.495 1.890 2.638 1.559 1.964 2.733 1.688 2.114 2.924 
90 1.481 1.874 2.618 1.542 1.944 2.706 1.661 2.082 2.883 
100 1.470 1.861 2.601 1.527 1.927 2.684 1.639 2.056 2.850 


CHAPTER 2 


Section 2-1 


2-1. 


Let a, b denote a part above, 

below the specification, respec- 

tively 

S = {aaa, aab, aba, abb, baa, 
bab, bba, bbb} 

Let a denote an acceptable 

power supply 

Let f, m, c denote a supply with 

a functional, minor, or cosmetic 

error, respectively. 

S = {a,f,m, c} 

Sequences of y or n of length 24 

with 24 outcomes 

Sis the sample space of 100 pos- 

sible two digit integers. 

S = {0,1,2,..., 1£09} in ppb 

S = {1.0, 1.1, 1.2, ..., 14.0} 

S = {0,1,2,...,} in milliseconds 

c = connect, b = busy, S = 

{c, bc, bbc, bbbc, bbbbc, .. .} 

(a) S = nonnegative integers 
from 0 to the largest integer 
that can be displayed by the 
scale § = {0, 1, 2,3,...} 


2-23. 


Appendix B 


Answers to 


Selected 


Exercises 


(b) S (c) {12, 13, 14, 15} 
(d) {0,1,2,..., 11} 
(e) S 
(f) {0,1,2,...,7} 
(g) © 
(h)@ (i) {8,9,10,...} 
Let d denoted a distorted bit 
and let o denote a bit that is not 
distorted. 
dddd, dodd, oddd, oodd, 
dddo, dodo, oddo, oodo, 


(a) S= 


ddod, dood, odod, oood, 
ddoo, dooo, odoo, 0000 
(b) No, for example A, M A, = 
{dddd, dddo, ddod, ddoo} 
dddad, dodad, 
dddo, dodo, 
ddod, dood, 
ddoo, dooo 
oddd, oodd, 
oddo, oodo, 


(c) Ay = 


d) A, = 
(d) odod, oood, 


odoo, 0000 
(e) Ay MAM A3M Ag = 
{dddd} 


2-25. 


2-27. 


2-29. 


2-31. 


(f) (Ay M Ag) U (A3 Ag) = 
{dddd, dodd, dddo, oddd, 
ddod, oodd, ddoo} 

Let P denote being positive and 

let N denote being negative. The 

sample space is {PPP, PPN, 

PNP, NPP, PNN, NPN, NNP, 


NNN}. 

(a) A = {PPP} 
(b) B = {NNN} 
(c) ANB=@ 


(d) AU B = {PPP, NNN} 
(a) A'M B= 10, B’ = 10, 
AUB=92 
(a) A’ ={x|x = 72.5} 
(b) B’ = {x|x = 52.5} 
(c) AN B={x|525<x< 
72.5} 
(d) AUB = {x|x > 0} 
Let g denote a good board, m a 
board with minor defects, and j 
a board with major defects. 
(a) S = {gg, gm, gj, mg, mm, 
mj, jg, jm, jj} 
(b) S = {gg, gm, gj, mg, mm, 
mj, jg, jm} 


731 


732 


2-35. 
2-37. 
2-39. 
2-41. 


2-43. 
2-45. 
2-47. 


2-49, 
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120 

144 

14,400 

(a) 416,965,528 

(b) 113,588,800 

(c) 130,721,752 

(a) 21 (b) 2520 (c) 720 
(a) 1000 (b) 160 (c) 720 
(a) 0.416 (b) 0.712 

(c) 0.206 

0.0082 


Section 2-2 


2-51. 
2-53. 


2-55. 


2-57. 
2-59, 


2-61. 
2-63. 
2-65. 
2-67. 


2-71. 
2-73. 


900 

(a) 673 (b) 1672 (c) 6915 
(d) 8399 (b) 1578 

(a) 0.4 (b) 0.8 (c) 0.6 

(d) 1 (e) 02 

(a) 1/10 (b) 5/10 

(a) S = {1, 2, 3, 4, 5, 6, 7, 8} 
(b) 2/8 (c) 6/8 

(a) 0.83 (b) 0.85 
(1/10*)*(1/267) = 5.7 x 1078 
(a) 44+4xX34+4x3X3=52 
(b) 36/52 (c) No 

(a) 0.30 (b) 0.77. (c) 0.70 
(d) 0.22 (e) 0.85 (f) 0.92 
0.9889 

(a) 0.0792 (b) 0.1969 

(c) 0.8142 (d) 0.9889 

(e) 0.1858 


Section 2-3 


2-75. 


2-77. 
2-79. 


2-81. 
2-83. 


2-85. 


(a) 0.9 (b) 0 (c) 0 

(d) 0 (e) 0.1 

(a) 0.70 (b) 0.95 (c) No 
(a) 350/370 (b) 362/370 
(c) 358/370 (d) 345/370 
(a) 13/130 (b) 0.90, No 
(a) 0.7255 (b) 0.8235 

(c) 0.7255 

(a) 0.2264 (b) 0.9680 

(c) 0.9891 


Section 2-4 


2-87. 


2-89. 
2-91. 


2-93. 


2-95. 


(a) 86/100 (b) 79/100 
(c) 70/79 (d) 70/86 
(a) 0.903 (b) 0.591 
(a) 12/100 (b) 12/28 
(c) 34/122 
(a) 0.5625 
(c) 0.3333 
(a) 20/100 (b) 19/99 
(c) 0.038 (d) 0.2 


(b) 0.1918 


2-97. (a) 0.02 (b) 0.000458 
(c) 0.9547 
2-99. No 
2-101. (a) 0.6087 (b) 0.3913 
(c) 0.5 (d) 0.5 
2-103. (a) 0.0987 (b) 0.0650 
Section 2-5 
2-105. (a) 0.2 (b) 0.3 
2-107. 0.014 
2-109. 0.028 
2-111. (a) 0.2376 (b) 0.0078 
2-113. (a) 0.2 (b) 0.2 
2-117. (a) 0.0109 (b) 0.2264 
(c) 0.9891 (d) 0.1945 
2-119. (a) 0.0792 (b) 0.8142 
(c) 0.9208 (d) 0.8031 
2-121. 0.2 
Section 2-6 
2-123. independent 
2-125. (a) notindependent. (b) yes 
2-127. (a) not independent. (b) 0.733 
2-129. (a) 0.59 (b) 0.328 (c) 0.41 
2-131. (a) 0.00307 (b) 0.04096 
2-133. (a) 0.01 (b) 0.49 (c) 0.09 
2-135. (a) 0.00003 (b) 0.00024 
(c) 0.00107 
2-137. 0.9702 
2-139. not independent. 
2-141. independent. 
Section 2-7 
2-143. 0.89 
2-145. (a) 0.97638 (b) 0.20755 
2-147. (a) 0.615 (b) 0.618 
(c) 0.052 
2-149. (a) 0.9847 (b) 0.1184 
2-151. 0.2540 
2-153. 0.5 


Supplemental Exercises 


2-155. 
2-157. 


2-161. 


2-163. 


2-165. 


2-167. 
2-169. 


0.014 

(a) 0.82 (b) 0.90 
(d) 0.80 (e) 0.92 
(a) 0.2 (b) 0.202 
(c) 0.638 (d) 0.2 
(a) 0.03 (b) 0.97 (c) 0.40 
(d) 0.05 (e) 0.012 (f) 0.018 
(g) 0.0605 
(a) 0.18143 
(c) 0.86494 
0.000008 
(a) 50 (b) 37 (c) 93 


(c) 0.18 
(f) 0.98 


(b) 0.005976 


2-171. 
2-173. 
2-175. 
2-177. 
2-179. 
2-181. 
2-183. 
2-185. 


2-187. 
2-189. 


2-191. 
2-193. 


2-195. 


2-197. 


S = {A, A'D,, A'Dp, 
A'D, A'D4, A'Ds} 

(a) 0.19 (b) 0.15 (c) 0.99 

(d) 0.80 (e) 0.158 

(a) No (b) No 

(c) 40/240 (d) 200/240 

(e) 234/240 (f) 1 

(a) 0.282 (b) 0.718 

0.996 

(a) 0.0037. (b) 0.8108 

(a) 0.0778 (b) 0.00108 

(c) 0.947 

(a) 0.9764 (b) 0.3159 

(a) 0.207 (b) 0.625 

(a) 0.453 (b) 0.262 

(c) 0.881 (d) 0.547 

(e) 0.783 (f) 0.687 

1.58 X 1077 

(a) 0.67336 

(b) 2.646 x 1078 

(c) 0.99973 

(a) 36’ (b) 70(26°) 

(c) 100(26°) 

(a) 0.994, 0.995 

(b) 0.99, 0.985 

(c) 0.998, 0.9975 


Mind-Expanding Exercises 


2-199, (a) n=3 (b) n=3 

2-201. 0.306, 0.694 

CHAPTER 3 

Section 3-1 

3-1. {0, 1, 2,..., 1000} 

3-3. {0,1,2,..., 99999} 

3-5. {1,2,...,491} 

39, {61,3 

3-9. {0,1,2,..., 15} 

3-11. {0,1,2,..., 10000} 

3-13. {0,1,2,...,40000} 

Section 3-2 

3-15. (a) 1 (b) 7/8 (©) 3/4 
(d) 1/2 

3-17. (a) 9/25 (b) 4/25 
(c) 12/25 (d) 1 

3-19. f(0) = 0.033, f(1) = 0.364, 
F(2) = 0.603 

3-21. P(X = 0) = 0.008, 
P(X = 1) = 0.096, 
P(X = 2) = 0.384, 
P(X = 3) = 0.512 

3-23. P(X = 50) = 0.5, 


P(X = 25) = 0.3, 
P(X = 10) = 0.2 


3-25. 


3-27. 


P(X = 15) = 06, 
P(X = 5) = 0.3, 

P(X = —0.5)=0.1 
P(X = 0) = 0.00001, 
P(X = 1) = 0.00167, 
P(X = 2) = 0.07663, 
P(X = 3) = 0.92169 


Section 3-3 


3-29, 


3-31. 


3-33. 


3-35. 


3-37. 


3-39. 


3-41. 


X = waiting time (hours) 
0.038, x = 1 

0.102, x = 2 

0.172, x = 3 

0.204, x = 4 

0.174, x =5 

0.124, x =6 

0.080, x = 7 

0.036, x = 8 

0.028, x = 9 

0.022, x = 10 

0.020, x = 15 

X = Non-failed well depth 

P(X = 255) = (1515+ 1343)/ 
7726 = 0.370 

P(X = 218) = 26/7726 = 0.003 
P(X = 317) = 3290/7726 = 0.426 
P(X = 231) = 349/7726 = 0.045 
P(X = 267) = (280+887)/7726 
= 0.151 

P(X = 217) = 36/7726 = 0.005 
(a) 7/8 (b) 1 (c) 3/4 

(d) 3/8 


fQ) = 


0, x<0O 
0.008, OsSx<1 
0.104, 1sx<2 
0.488, 25x<3 

1, 35x 
where f, (0) = 0.008, 

fC) = 0.096, 
Ff, (2) = 0.384, 
fF. B) = 0.512 
0, x< 10 
0.2, 10=x< 25 
0.5, 25=x< 50 
1, 50 =x 
where P(X = 50 million) = 0.5, 
P(X = 25 million) = 0.3, 
P(X = 10 million) = 0.2 
(b) 0.5 (c) 0.5 


F(x) = 


F(x) = 


(a) 1 
(d) 0.5 
(a) 1 (b) 0.75 (c) 0.25 
(d) 0.25 (e) 0 (f) 0 


0, x <266 
0.24, 2665x <271 
0.54, 271 sx <274 

1, 274sx 


3-43. F(x) = 


0, x< 15 
0.05, 15sx<3 
0.30, 35x<45 
0.65, 455x<5 
0.85, Ssx<7 

1 Tsx 


3-45. F(x) = 


Section 3-4 
3-47. 
3-49, 


w= 2,07 =2 
w =0,07 = 1.5 
3-51. pp = 2.8, 0° = 1.36 
3-53. w = 1.57,07 = 0.311 
3-55. 24 
3-57. pw = 0.0004, o? = 0.00039996 
3-59. (a) w = 18.694, 
o° = 735.9644, 
o = 27.1287 
(b) p = 37.172, 
o* = 2947.996, 
o = 54.2955 
E(X) = 4.808, V(X) = 6.147 
E(X) = 281.83, VX) = 976.24 


3-61. 
3-63. 


Section 3-5 

3-65. w= 2,07 = 0.667 

3-67. w =3.5,07 = 1.25 

3-69. (a) p = 687.5, 0° = 56.25 
(b) p = 87.5, 02 = 56.25 
E(X) = 4.5, E(Y) = 22.5, 
oy = 14.36 

pw =7,o0 = 1414 


3-71. 
3-73. 


Section 3-6 
3-77. (a) 0.9298 (b) 0 
(c) 0.0112 (d) 0.0016 
(a) 2.40 x 1078 
(b) 0.9999 
(c) 9.91 x 1078 
(d) 1.138 X 1074 
(a) 0 (b) 10 
(a) 0.215 (b) 0.9999 (c) 4 
(a) 0.410 (b) 0.218 (c) 0.37 
(a) 1 (b) 0.999997 
(c) E(X) = 12.244, 
V(X) = 2.179 
(a) Binomial, p = 10*/36”, 
n= 1E09 (b) 0 
(c) E(X) = 4593.9, 
V(X) = 4593.9 


3-79. 


3-81. 
3-85. 
3-87. 
3-89. 


3-91. 
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3-93. 
3-95. 


(a) 0.9961 (b) 0.9886 
(a) 0.142 (b) 0.322 
(c) 0.963 

(a) 0.009 (b) 0.382 
(c) 0.972 (d) 3 


3-97. 


Section 3-7 
3-99. (a) 0.5 (b) 0.0625 
(c) 0.0039 (d) 0.75 (e) 0.25 
3-101. (a) 5 (b) 5 
3-103. (a) 0.0064 (b) 0.9984 
(c) 0.008 
3-105. (a) 0.0167 (b) 0.9039 
(c) 50 
3-107. (a) 0.13 (b) 0.098 
(c) 7.69 ~ 8 
3-109. (a) 3.91 x 10°” 
(b) 200 (c) 2.56 x 10'8 
3-111. (a) 3 x 108 
(b) 3 x 10!° 
3-115. (a) 10 (b) 0.039 
(d) 0.271 


(c) 0.039 


Section 3-8 
3-117. (a) 0.4191 
(c) 0.001236 
(d) E(X) = 0.8, V(X) = 0.6206 
3-121. (a) 0.1201 (b) 0.8523 
3-123. (a) 0.087 (b) 0.9934 
(c) 0.297 (d) 0.9998 
3-125. (a) 0.7069 (b) 0.0607 
(c) 0.2811. 
3-127. (a) 0.0041 
(c) 0.0165 


(b) O 


(b) 0.3091 


Section 3-9 
3-129. (a) 0.0183 (b) 0.2381 
(c) 0.1954 (d) 0.0298 
3-131. E(X) = V(X) = 2.996. 
3-133. (a) 0.264 (b) 48 
3-135. (a) 0.4566 (b) 0.047 
3-137. (a) 0.2 (b) 99.89% 
3-139. (a) 0.6065 (b) 0.0067 
(c) P(W = 0) = 0.0067, 
P(W = 1) = 0.0437, 
P(W = 1) = 0.0504 
3-141. (a) 0.026 (b) 0.287 
(c) 0.868 


Supplemental Exercises 
3-143. E(X) = 1/4, VX) = 0.0104 
3-145. (a) n=50,p =0.1 (b) 0.112 
(c) P(X = 49) = 4.51 x 107% 
3-147. (a) 0.000224 (b) 0.2256 
(c) 0.4189 
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3-149, (a) 0.1024 (b) 0.1074 
3-151. (a) 3000 (b) 1731.18 
3-153. (a) P(X = 0) = 0.0498 
(b) 0.5768 
(c) PX Sx) 2=0.9,x =5 
(d) o = = 6. Not 
appropriate. 


3-155. (a) 0.1877 (b) 0.4148 (c) 15 


3-157. (a) 0.0110 (b) 8/3 
3-159. 40000 

3-161. 0.1330 

3-163. (a) 500 (b) 222.49 
3-165. 0.1 


3-167. f,(0) = 0.16, fy(1) = 0.19, 
fx(2) = 0.20, fr(3) = 0.31, 


Fx(4) = 0.14 
3-169. f.(2) = 0.2, f\(5.7) = 0.3, 


f(6.5) = 0.3, f(8.5) = 0.2 


3-171. (a) 0.0433 (b) 3.58 
3-173. (a) f,(0) = 0.2357, 
FxQ) = 0.3971, 

Fx) = 0.2647, 

Fx@) = 0.0873, 
FIA = 0.01424, 

Fx (5) = 0.00092 

(b) f,(0) = 0.0546, 

fxC) = 0.1866, 

fx(2) = 0.2837, 

fx) = 0.2528, 

Fx(4) = 0.1463, 

Fx (5) = 0.0574, 

fx (6) = 0.0155, 

Fx (7) = 0.0028, 

Fx(8) = 0.0003, 

Fx(9) = 0.0000, 

fx(10) = 0.0000 

3-175. 37.8 seconds 
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3-181. (a) 131 (b) 123 


CHAPTER 4 


Section 4-2 


4-1. (a) 0.3679 (b) 0.2858 (c) 0 


(d) 0.9817 (e) 0.0498 
4-3. (a) 0.5 (b) 0.1464 
(c) 0.7072 (d) 0.8536 
(e) 1.12 radians 
4-5. (a) 0.4375 (b) 0.7969 
(c) 0.5625 (d) 0.7031 


(d) 0 (e) 1 (f) 0.9655 
4-9. (a) 0.5 (b) 49.8 
4-11. (a) 0.1 


(e) 0.5 
4-7. (a) 0.5 (b) 0.4375 (c) 0.125 


APPENDIX B ANSWERS TO SELECTED EXERCISES 


Section 4-3 


4-13. 
4-15. 


4-17. 


(a) 0.56 (b) 0.7 (c) 0 (d) 0 
F(x) = 0 for x = 0; 1-e* 
forx >0 


F(x) = 0 for x = =o 


0.5 sin x 


+05for—~<x< 
T 5 or = = 
2 x 


lforx=— 
2 


(a) 0.56 (b) 0.7 
(d) 0 

0.2 

F(x) = 0 for x < 0; 0.25x? for 
0s=x<2;lfor2=x 


(c) 0 


» f(QX) = 0.2 for0 <x <4; fx) = 


0.04 for4 =x <9 


Section 4-4 


4-27. 
4-29. 
4-31. 
4-33. 
4-35. 


4-37. 


E(X) = 2, V(X) = 4/3 

E(X) = 0, V(X) = 0.6 

E(X) = 4, V(X) = 3.2 

E(X) =2 

(a) E(X) = 109.39, UX) = 33.19 
(b) 54.70 

(a) E(X) = 5.1, V(X) = 0.01 
(b) 0.3679 


Section 4-5 


4-39. 


4-41. 


4-43. 


4-45. 


4-47. 


(a) E(X) = 0, V(X) = 0.577 

(b) 0.90 (c) F(x) =0 
for x < —1;0.5x + 0.5 
for -1 Sx<1;1lforl =x 

(a) F(x) = 0 for x < 0.95; 
10x—9.5 for 0.95 =x < 
1.05; 1 for 1.05 = x 

(b) 0.3 (c) 0.96 

(d) E(X) = 1.00, V(X) = 
0.00083 

(a) F(x) = 0 for x < 0.2050; 
100x — 20.50 for 0.2050 = 
x < 0.2150; 1 for 0.21.50 = x 

(b) 0.25 (c) 0.2140 

(d) E(X) = 0.2100, VX) = 
8.33 X 10° 

(a) F(X) = x/90 for0 =x = 90 

(b) E(X) = 45, V(X) = 675 

(c) 1/3 (d) 0.333 

(a) Oy = 34.64 (b) 1/3 

(c) 1/2 


Section 4-6 


4-49, 


(a) 0.90658 (b) 0.99865 
(c) 0.07353 (d) 0.98422 
(e) 0.95116 


4-51. (a) 0.90 (b) 0.5 (c) 1.28 
(d) —1.28 (e) 1.33 

4-53. (a) 0.93319 (b) 0.69146 
(c) 0.9545 (d) 0.00132 
(e) 0.15866 

4-55. (a) 0.93319 (b) 0.89435 
(c) 0.38292 (d) 0.80128 
(e) 0.54674 

4-57. (a) 0.99379 (b) 0.13591 
(c) 5835 

4-59. (a) 0.0228 (b) 0.019 
(c) 152.028 
(d) small (less than 5%) 

4-61. (a) 0.0082 (b) 0.72109 
(c) 0.564 

4-63. (a) 12.309 (b) 12.1545 

4-65. (a) 0.00621 (b) 0.308538 
(c) 133.33 

4-67. (a) 0.1587 (b) 1.3936 
(c) 0.9545 

4-69. (a) 0.00043 (b) 6016 
(c) 1/8 

4-71. (a) 0.02275 (b) 0.324 
(c) 11.455 

4-73. [23.5, 24.5], no effect from 
stdev 

4-75. (a) 0.0248 (b) 0.1501 
(c) 92.0213 

Section 4-7 

4-77. (a) 0.0853 (b) 0.8293 
(c) 0.0575 

4-79. (a) 0.1446 (b) 0.4761 
(c) 0.3823 

4-81. (a) 0.2743 (b) 0.8413 

4-83. 0.022 

4-85. 0.5 

4-87. (a) 0 (b) 0.156 (c) 13,300 
(d) 8.3 days/year (e) 0.0052 

4-89. (a) 0.012 (b) 0.9732 
(c) 536.78 

Section 4-8 

4-91. (a) 0.3679 (b) 0.1353 
(c) 0.9502 
(d) 0.95, x = 29.96 

4-93. (a) 0.3679 (b) 0.2835 
(c) 0.1170 

4-95. (a) 0.1353 (b) 0.4866 
(c) 0.2031 (d) 34.54 

4-97. (a) 0.0498 (b) 0.8775 

4-99. (a) 0.0025 (b) 0.6321 
(c) 23.03 (d) same as part (c) 


(e) 6.93 


4-101. 


4-103. 


4-105. 


4-107. 


4-109. 


(a) 15.625 
(c) 3X 10°° 
(a) 0.2212 (b) 0.2865 

(c) 0.2212 (d) 0.9179 

(e) 0.2337 

(a) 0.3528 (b) 0.04979 
(c) 46.05 (d) 6.14 x 10°° 
(e) e  (f) same 

(a) 0.3679 (b) 0.1353 

(c) 0.0498 

(d) does not depend on 0 
(a) 0.435 (b) 0.135 

(c) 0.369 (d) 0.865 


(b) 0.1629 


Section 4-9 


4-111. 


4-113. 


4-115. 


4-117. 


4-119. 
4-123. 


(a) 120 (b) 1.32934 

(c) 11.6317 

(a) Erlang \ = 5 calls/min, 
r=10 (b) E(X) =2, 


V(X) = 0.4 
(c) 0.2 minute (d) 0.1755 
(e) 0.2643 
(a) 50000 (b) 0.6767 
(a) 5X 10° (b) V(X) = 
5 X 10!°, o = 223607 
(c) 0.0803 


(a) 0.1429 (b) 0.1847 
(a) 1.54 (b) 0.632 


Section 4-10 


4-125. 


4-127. 
4-129. 


4-131. 


4-135. 


4-137. 


E(X) = 12000, V(X) = 
3.61 X 10!° 
1000 


(a) 803.68 hours 

(b) 85319.64 (c) 0.1576 
(a) 443.11 (b) 53650.5 
(c) 0.2212 
(a) 0.5698 
(c) 0.4724 
(a) 0.0468 


(b) 0.1850 


(b) 0.1388 


Section 4-11 


4-139. 


4-141. 


4-143. 


4-147. 


(a) 0.0016 (b) 0.0029 

(c) E(X) = 12.1825, V(X) = 
1202455.87 

(a) 0.03593 (b) 1.65 

(c) 2.7183 
12.6965 

(a) 0 = 8.4056, w? = 1.6094 

(b) 0.2643 (c) 881.65 

(a) E(X) = 4.855, V(X) = 4.090 
(b) 0.9263 (c) 0.008 


Section 4-12 


4-149, 


(a) 0.0313 (b) 0.4559 
(c) E(X) = 0.7143, 
V(X) = 0.0454 


4-151. (a) Mode = 0.8333, E(X) = 
0.6818, V(X) = 0.0402 
(b) Mode = 0.6316, E(X) = 
0.6154, V(X) = 0.0137 
4-153. 0.0272 


Supplemental Exercises 


4-155. (a) 0.99379 (b) 0.621% 

4-157. (a) 0.15866 (b) 90.0 

(c) 0.9973 (d) (0.9973)!° 

(e) 9.973 

(a) 0.0217 (b) 0.9566 

(c) 229.5 

0.8488 

(a) 620.4 (b) 105154.9 

(c) 0.4559 

(a) 0.0625 (b) 0.75 

(d) F(x) = 0 for x < 2; 
x/4—x+1for2<=x<4; 
lfor4=x 

(e) E(X) = 10/3, V(X) = 0.2222 

(a) 0.3935 (b) 0.9933 

(a) 0 = 3.43, w” = 0.96 

(b) 0.946301 

(a) 0.6915 

(c) 1.86 

(a) 0.0062 

(b) 0.012 

(c) 5.33 

0.0008 to 0.0032 

(a) 0.5633 (b) 737.5 

(a) 0.9906 (b) 0.8790 


4-159. 


4-161. 
4-163. 


4-165. (c) 0.5 


4-167. 
4-169. 


4-171. (b) 0.683 


4-173. 


4-175. 
4-177. 
4-179. 


Mind-Expanding Exercises 
4-183. (a) ko? (b) ko? + kp. — my? 


CHAPTER 5 


Section 5-1 
5-1. (a) 3/8 
(d) 1/8 
(e) VX) = 0.4961 VY) = 
1.8594 
(f) f() = 1/4, f(.5) = 3/8, 
f(2.5) = 1/4, fB) = 1/8 
(g) f(2) = 1/3, f3) = 2/3 
(h) 1 (i) 21/3 
(j) Not independent 
5-3. (a) 3/8 (b) 3/8 (c) 7/8 
(d) 5/8 (e) V(X) = 0.4219 
V(Y) = 1.6875 
(f) f(-1) = 1/8, f(-0.5) = 1/4, 
f0.5) = 1/2, fl) = 1/8 
(g) 1 G1 @ 05 
(j) Not independent 


(b) 5/8 (c) 3/8 


5-7. 


5-11. 
5-13. 


5-15. 


5-17. 


5-19. 


5-21. 
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(b) fx(0) = 0.970299, 
fx) = 0.029403, 
fx(2) = 0.000297, 
fx) = 0.000001 
(c) 0.03 (d) f(0) = 0.920824, 
fQ) = 0.077543, f(2) = 
0.001632 
(e) 0.080807 
(g) Not independent 
(b) f(0) = 2.40 x 10°°, 
fQ) = 1.36 x 10°, 
f(2) = 2.899 x 104, 
Ff) = 0.0274, f(4) = 0.972 
(c) 3.972 (d) equals f(y) 
(e) 3.988 (f) 0.0120 
(g) Independent 
(b) fx(O) = 0.2511, 
fx() = 0.0405, 
fx(2) = 0.0063, 
fx) = 0.0009, 
fx (4) = 0.0001 
(c) 0.0562 
(d) fyg(0) = 2/3, fyg(1) = 1/3, 
fyp(2) = fy(3) = frp(4) =0 
(e) 0.0003 (f) 0.0741 
(g) Not independent 
(c) 0.308 (d) 5.7 
(a) 0.4444 (b) 0.6944 
(c) 0.5833 (d) 0.3733 
(e) 2 (f) 0 
(g) fila) = 2x/9;0<x <3 
(h) fris(y) = 2y/9,0<y <3 
(i) 2@) 4/9 
(k) fep(a) = 2x/9,0<x <3 
(a) 1/81 (b) 5/27 
(c) 0.790 (d) 16/81 
(e) 12/5 (f) 8/5 
(g) FQ) = 4x°/81;0<x<3 
(h) fyx=i(y) = 2y;0<y <1 
(i) 1 (jo 
(k) fepa2lx) = 2x/9,0 <x <3 
(a) 0.9879 (b) 0.0067 
(c) 0.000308 (d) 0.9939 
(ec) 0.04 (f) 8/15 
(g) f(x) = Se x > 0 
(h) fry) = 30° 1 y 
(i) 4/3 
(j) 0.9502, f,(2) = 15e~°/2; 
O0<y 
(k) fav-29) = 20S O< x <2 
(a) 1/30 (b) 1/12 (c) 19/9 
(d) 97/45 (g) 1 (h) 0.25 
(a) P(X > 5, Y> 5) = 0.0439, 
P(X > 10, Y> 10) = 0.0019 
(b) 0.0655 
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5-23. 


5-25. 


5-27. 
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(a) 0.25. (b) 0.0625 
(d) 1 (e) 2/3 
(g) 0.0625 

(h) fxy2, 2) = 2x30<x<1 
(i) 0.25 

(a) 0.75 (b) 3/4 

(c) 0.875 (d) 0.25 

(g) 1 forx > 0 

(a) 0.032 (b) 0.0267 


(c) 1 
(f) 0.25 


Section 5-2 


5-29. 
5-31. 
5-33. 
5-35. 
5-37. 
5-39. 


0.8851 

c = 1/36, p = —0.0435 
p = —0.5, negative 

c = 8/81, p = 0.4924 
Oxy = Pxy = 0 


Oxy = Pxy = 0 


Section 5-3 


5-43. 


5-45. 


5-47. 


5-49. 


(a) p; = 0.05, p, = 0.85, 
p3 = 0.10 

(d) E(X) = 1, V(X) = 0.95 

(f) 0.07195 

(g) 0.7358 

(h) E(Y) = 17 

(i) 0 

(j) (X=2,Y=1)= 
0.0540, P(X = 2| ¥ = 17) 
= 0.2224 

(k) E(X|¥= 17) =1 

(b) 0.1944 

(c) 0.0001 

(e) E(X) =2.4 

(f) E(Y) = 1.2 

(g) 0.7347 

(h) 0 

(i) P(X = 0|¥ = 2) = 0.0204, 
P(X = 1|Y=2) = 0.2449, 
P(X = 2| Y= 2) = 0.7347 

(j) 1.7143 

(a) 0.7887 (b) 0.7887 

(c) 0.6220 

0.8270 


Section 5-4 


5-55. 


5-57. 
5-59. 
5-61. 
5-63. 


5-65. 


(a) 18 (b) 77 
(d) 0.873 

(a) E(T) = 4, 0, = 0.1414 
(a) 0 (b) 1 

E(X) = 1290, V(X) = 19600 
(a) 0.002 (b) n=6 

(c) 0.9612 

(a) 0.0027 (b) No (c) 0 


(c) 0.5 


Section 5-5 
5-67. fy(y) = A y = 3,5, 7,9 


5-69. 


(b) 18 
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5-75. 


5-77. 


5-79, 


5-81. 


5-83. 


5-85. 


5-87. 
5-91. 


5-95. 


(a) 3/8 (b) 3/4 (c) 3/4 

(d) 3/8 

(e) E(X) = 7/8, VX) = 39/64, 
E(Y) = 7/8, VY) = 39/64 

(h) 2/3 (i) not independent 

(j) 0.7949 

(a) 0.0560 (b) Z~Bin(20, 0.1) 

(c) 2 (d) 0.863 

(e) Z|X~Bin(4, 0.25) 

(f) 1 (g) not independent 

(a) 1/108 (b) 0.5787 

(c) 3/4 (d) 0.2199 

(e) 9/4. (f) 4/3 

3/4 

(a) 0.085 

(c) 3 

(a) 0.499 (b) 0.5 

(a) 0.057 (b) 0.057 

(a) E(T) = 1.5, VT) = 0.078 

(b) 0.0367 

(c) E(P) = 4, V(P) = 0.568 

(a) p; = 0.13, p, = 0.72, 
p3=0.15,x +y+z= 12 

(b) not possible (c) 0.736 

(d) 0 (e) 0.970 (f) 0.285 

(g) 0.345 


(b) Z~Bin(10, 0.3) 


CHAPTER 6 
Section 6-1 


6-1. 
6-3. 


No, usually not, Ex: {2, 3} 
No, usually not, Ex: {1, 2, 3, 
1000}. 

Yes, Ex: {5,5, 5,5, 5,5, 5}. 
x = 74.0044, s = 0.00473 
x = 7068.1, s = 226.5 

x = 43.975, s = 12.294 
w= 5.44 

xX, = 287.89, s; = 160.154 
X, = 325.01, s, = 121.20 
x = 7.184, s = 0.02066 

(a) x = 65.86, s = 12.16 
(b) x = 66.86, s = 10.74 


Section 6-2 


6-21. 
6-23. 


Symmetric with a single mode 
X = 1436.5, lower quartile: 
Q, = 1097.8, and upper 
quartile: Q; = 1735.0 


6-25. 


6-27. 


6-29. 


6-31. 


6-33. 


6-35. 


X = 89.250, lower quartile: 
Q,= 86.100, and upper 
quartile: Q; = 93.125 

median: X¥ = 1436.5, mode: 
1102, 1315, and 1750, mean: 

x = 1403.7 

X = 366.57, s = 940.02, 

and X = 41.455 

95th percentile = 5479 

x = 260.3, 5 = 13.41, 

X = 260.85, and 90th percentile 
= 277.2 

X = 89.45, 5 = 2.8, * = 90, 
and proportion = 22/40 = 55% 


Section 6-4 


6-53. 


6-55. 


6-57. 


(a) x = 2.415, s? = 0.285, 
and s = 0.543 

(a) X = 952.44, s? = 9.55, 
and s = 3.09 

(b) % = 953 

(a) ¥ = 67.50, lower quartile: 
Q, = 58.50, and upper 
quartile: Q; = 75.00 

(b) ¥ = 68.00, lower quartile: 
Q, = 60.00, and upper 
quartile: Q; = 75.00 
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6-87. 


6-93. 


(a) Sample 1 Range = 4 
Sample 2 Range = 4 
(b) Sample 1: s = 1.604 
Sample 2: s = 1.852 
(b) x = 9.325, s = 4.486 
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6-105. 


6-109. 
6-111. 
6-113. 


s* (old) = 50.61, s? (new) = 
5061.1 

¥ = 431.89, s5 = 34.028 

X = 69 

(a) X = 89.29 (b) x = 89.19 
(c) No 


CHAPTER 7 
Section 7-2 


7-1. 
7-3. 
7-5. 
7-7. 
7-9. 
7-11. 
7-13. 
7-15. 


8/103 
0.8186 
0.4306 
0.191 
n=12 
0.2312 

(a) 0.5885 
0.983 


(b) 0.1759 


Section 7-3 


7-17. 


7-19. 
7-21. 
7-27. 
7-29. 


7-33. 


(a) N= 25, Mean = 150.47, 
S? = 105.06, SS = 2521.5 

(b) 150.468 

Bias = o°(2— — 1) 

V(6,) = 07/7 is smallest 

Bias = o7/n 

(a) 423.33 

(c) 1.85 

(e) 0.2917 

(d) 0.01 


(b) 9.08 
(d) 424 


(e) 0.0413 


Section 7-4 


7-35. 
7-39. 
7-41. 
7-47. 


7-49, 


x 

unbiased 

(a) 6 = &x?/(2n) 

(b) same as part (a) 

(b) Ag(m + x + 1)/ 
(m+ y+ 1) 

(a) 5.046 (b) 5.05 


Supplemental Exercises 


7-55. 
7-57. 
7-59. 
7-61. 
7-65. 


0.8664 

5.6569 

n= 100 

6 = x/3 

= 21.86, Asoo9 = 109,300, 
p= 07 


CHAPTER 8 


Section 8-1 


8-1. 


8-3. 
8-5. 


8-7. 
8-9. 
8-11. 
8-13. 


8-15. 
8-17. 


8-19. 
8-21. 


(a) 96.76% 

(c) 93.56% 

(a) 1.29 (b) 1.65 (c) 2.33 

(a) 1 CI = 50, 2" CI = 50 

(b) higher confidence implies a 
wider Cl 

(a) 4 (b) 7 

(a) Longer (b) No (c) Yes 

(87.85, 93.11] 

(a) [74.0353, 74.0367] 

(b) [74.035, %) 

(a) (3232.11, 3267.89] 

(b) (3226.4, 3273.6] 

267 

72 

(a) (13.383, 14.157] 

(b) [13.521, %) 

(c) 1 (d) 2 


(b) 98.72% 


Section 8-2 


8-23. 


(a) 2.179 (b) 2.064 
(c) 3.012 (d) 4.073 


8-25. (a) Mean = 25.1848, 
Variance = 2.5760 
(b) 24.037 < p < 26.333 

8-27. [58197.33, 62082.07] 

8-29. [1.094, 1.106] 

8-31. (—~, 125.312] 

8-33. [443.520, 528.080] 

8-35. [301.06, 333.34] 

8-37. (b) [2237.3, 2282.5] 
(c) [2241.4, %) 

8-39. (b) [4.051, 4.575] 
(c) [4.099, 2%) 

8-41. (b) [2.813, 2.991] 


Section 8-3 

8-43. (a) 18.31 (b) 13.85 
(c) 10.12, 30.14 

8-45. [0.0055, ~) 

8-47. [0.31, 0.46] 

8-49. [0.626, 1.926] 

8-51. o = 0.0122 


Section 8-4 

8-53. (a) [0.02029, 0.06637] 
(b) [—2, 0.0627] 

8-55. (a) [0.501, 0.571] 
(b) [0.506, %) 

8-57. (a) [0.225, 0.575] 
(c) 2401 

8-59. 666 


Section 8-6 
8-61. [52131.1, 68148.3] 
8-63. [1.068, 1.13] 
8-65. [292.049, 679.551] 
8-67. [263.7, 370.7] 
8-69. [2193.5, 2326.5] 
8-71. 90% PI = [2.71, 3.09] 
90% CI = [2.85, 2.95] 
99% CI = [2.81, 2.99] 
8-73. [49555.54, 70723.86] 
8-75. [1.06, 1.14] 
8-77. TI = [237.18, 734.42] 
CI = [443.42, 528.08] 
8-79. TI = [247.60, 386.60] 
CI = [301.06, 333.34] 
8-81. TI = [2.49, 3.31] 
CI = [2.84, 2.96] 


Supplemental Exercises 
8-85. (a) 0.0997 and 0.064 
(b) 0.044 and 0.014 
(c) 0.0051 and < 0.001 
8-87. (a) Normality 
(b) [16.99, 2) 
(c) [16.99, 33.25] 


(b) 2305 


8-89. 


8-91. 


8-93. 
8-95. 


8-97. 


8-99. 
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(d) (—%, 343.74] 

(e) (28.23, 343.74] 

(f) 16.91 < p < 29.09 
15.85 <0? < 192.97 

(g) mean: [16.88, 33.12], 
variance: [28.16, 342.94] 

(a) [13.74, 16.92] 

(b) [13.24, 17.42] 

(a) Yes 

(b) [197.84, 208.56] 

(c) [185.41, 220.99] 

(d) [171.21, 235.19] 

(0.0956, 2) 

(a) Yes 

(b) [1.501, 1.557] 

(c) [1370, 1.688] 

(d) [1.339, 1.719] 

(a) [0.0004505, 0.009549] 

(b) 518 

(c) 26002 

(a) Normality 

(c) [18.478, 26.982] 

(e) [19.565, 123.289] 


Mind-Expanding Exercises 


8-101. (b) [28.62, 101.98] 
8-103. (a) 46 (b) [10.19, 10.41], 


p = 0.6004 


8-105. 950 of CIs and 0.9963 


CHAPTER 9 


Section 9-1 


9-1. 


9-3. 


9-5. 


9-7. 


9-11. 


9-13. 


9-15. 


(a) Yes (b) No (c) No 
(d) No (e) No 
(a) Ho:~ = 20nm 

Ay:~ <20nm = (b) No 
(a) a = 0.02275. 
(b) B = 0.15866 


()B=05 _ 

(a) 11.4175 = X, = 11.42 

(b) 11.5875 = X, < 11.59 
<= 


(d) 11.7937 = X. = 11.84 

(a) P-value = 0.0135 

(b) P-value = 0.000034 

(c) P-value = 0.158655 

(a) 0.09296 

(b) B = 0.04648 

(c) B = 0.00005 

(a) B = 0.005543 

(b) B = 0.082264 

(c) As n increases, 8 decreases 
(a) a = 0.05705 (b) B = 0.5 
(c) B = 0.05705. 


9-19. 


9-21. 


9-23. 


9-25. 


9-27. 
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(a) pw = 191.40 

(b) p = 185.37 

(c) p = 186.6 

(d) p S 183.2 

(a) P-value = 0.2148 

(b) P-value = 0.008894 

(c) P-value = 0.785236 

(a) a = 0.0164 

(b) 1 — B = 0.21186 

(c) a will increase and the 
power will increase with 
increased sample size. 

(a) P-value = 0.238 

(b) P-value = 0.0007 

(c) P-value = 0.2585 

(a) a = 0.29372 

(b) B = 0.25721 

(a)a ~0O 

(b) B = 0.99506 


Section 9-2 


9-29. 


9-31. 


9-33. 


9-35. 


9-37. 


9-39. 


9-41. 


9-43. 


(a) Ho: wp = 10, H;: pp > 10 

(b) Hp: p= 7,H,: np #7 

(c) Hy: w=5,Aj:n~<5 

(a) a=z, = —2.33 

(b) a= z = —1.64 

(c) a= Zz = —1.29 

(a) P-value = 0.04 

(b) P-value = 0.066 

(c) P-value = 0.69 

(a) P-value = 0.98 

(b) P-value = 0.03 

(c) P-value = 0.65 

(a) StDev = 0.7495, z) = 
—0.468, P-value = 0.68, 
fail to reject Hp 

(b) one-sided 

(c) 19.42 =p S 20.35 

(d) 0.640 

(a) 0.6827 (b) one-sided 

(c) P-value = 0.002, reject Hp 

(d) 98.8518 = p 

(e) P-value = 0.15, fail to 
reject Hy 

(a) 7 = —0.95 > —1.96, fail 
to reject Hp 

(b) B = 0.80939 

(c) n= 16 

(a) Z% = 1.26 < 1.65 fail to 
reject Hy 

(b) P-value = 0.1038 

(c) B = 0.000325 

(d)n=1 

(e) 39.85 =p 


9-45. 


9-47. 


(a) Z = 1.56 > —1.65, fail to 
reject Hy 

(b) P-value = 0.94 

(c) Power = 0.97062 

(d) n=5 

(e) pw = 104.53 

(a) z = 1.77 > 1.65, reject Ho 

(b) P-value = 0.04 

(c) Power = | 

(d) n=2 

(e) 4.003 = p 


Section 9-3 


9-49, 


9-51. 


9-53. 


9-55. 


9-57. 


9-59. 


9-61. 


9-63. 


(a) critical value > 2.539 

(b) critical value > 1.796 

(c) critical value > 1.345 

(a) 0.05S=p=0.1 

(b) 0.05 =p =0.1 

(c) 05 =p =0.8 

(a) 0.95 = p = 0.975 

(b) 0.025 = p = 0.05 

(c) 0.6 = p = 0.75 

(a) 9 

(b) .05 < P-value < = 0.1, fail 
to reject Hy 

(c) two-sided 

(d) 11.89 =p S 13.23 

(e) fo = 1.905, reject Hy 

(f) reject Ho 

(a) to = 0.6665, fail to reject Ho 

(b) equal to P-value © 0.5 

(c) 246.84 S p S 404.15, fail 
to reject Hy 

(a) |fo| =3.48 > 2.064 
reject Hy P-value = 0.002 

(b) Yes 

(c) power = 1 

(d) n = 20 

(e) 98.065 = p = 98.463 

(a) |t| = 1.456 < 2.064, fail 
to reject Ho; 
0.1 < P-value < 0.2. 

(b) Yes 

(c) power = 0.80 

(d) n = 100 

(e) 129.406 = p S 130.100 

(a) |f| = 1.55 < 2.861, 
fail to reject Hp; 
0.10 < P-value < 0.20 

(b) Yes, see normal probability 
plot 

(c) power = 0.30 

(d) n = 40 

(ec) 19S p= 4.62 


9-65. 


9-67. 


9-69. 


(a) to = 0.15 < 1.753., fail to 
reject Hp; P-value > 0.40. 
(b) n = 4. Yes 
to = 3.46 > 1.833, reject Hp 
0.0025 < P-value < 0.005 
(a) t) = —14.69 < 1.6604, 
fail to reject Hp; P-value 
> 0.995 
(b) Yes 
(c) power = 1 
(d) n = 15 


Section 9-4 


9-71. 


9-73. 


9-75. 


9-77. 


9-79. 


9-81. 


9-83. 


(a) critical values 6.84 and 38.58 

(b) critical values 3.82 and 21.92 

(c) critical values 6.57 and 23.68 

@) Xai = 7.63 

(b) Xi-an—1 = 4.57 

CO) Niet S779 

(a) 0.5 < P-value < 0.9 

(b) 0.5 < P-value < 0.9 

(c) 0.005 < P-value < 0.01 

(a) xp = 0.23 < 26.30, fail to 
reject Hp; P-value > 0.995 

(b) 0.07 =o 

(a) x = 109.52 > 71.42, 
reject Hp; P-value < 0.01 

(b) 0.31 <0 < 0.46 

(a) xo = 12.46 > 7.26, fail to 
reject Ho; 
0.1 < P-value < 0.4 

(b) o = 5240 

(a) xj = 11.52 < 19.02, fail to 
reject Hp; 0.2 < P-value 

(b) 0.45 

(c) n = 30 


Section 9-5 


9-85. 


9-87. 


9-89. 


9-91. 


(a) one-sided 

(b) appropriate 

(c) p = 0.3564, z = —1.1867 
P-value = 0.118, 
p = 0.6105 

(d) 0.2354 

(a) z = —1.31 > —1.65, fail 
to reject Ho; 
P-value = 0.095 

(b) p = 0.0303 

(a) Z% = 2.06 > 1.65, reject 
Ho; P-value = 0.0196 

(b) 0.7969 = p 

(a) z = —0.94 < 2.33 fail to 
reject Hp; P-value = 0.826 

(b) 0.035 = p 


9-93. zz = 1.58 > —2.33, fail to 

reject Hp 

(a) z = 0.54 < 1.65, fail to 
reject Hp; P-value = 0.295 


(b) B = 0.639, n = 118 


9-95. 


Section 9-7 
9-97. (a) x5 = 6.955 < 15.09, fail to 


reject Hp 
(b) P-value = 0.2237 (from 
Minitab) 
9-99. (a) xi = 10.39 > 7.81 
reject Ho 
(b) P-value = 0.0155 (from 
Minitab) 
9-101. (a) x3 = 769.57 >>> 36.42, 
reject Hp 
(b) P-value = 0 


Section 9-8 
9-103. x3 + 8.05.65 fail to reject Hp; 
P-value = 0.070 (from Minitab) 
9-105. x3 > X5.01.9 reject Ho; 
P-value = 0.002 (from Minitab) 
9-107. xi + X6.01.3» fail to reject Ho; 
P-value = 0.013 
9-109. (a) x5 > xXb05,1 
(b) P-value < 0.005 


Section 9-9 
9-111. (a) P-value = 2P(R* <7|p = 
0.5) = 0.132 
(b) 0.180 
9-113. (a) P-value = 2*P(R* = 3|p = 
0.5) ~ 1 
(b) z = 0, P-value ~ 1 
9-115. Ignore ties 
(a) w=9>w__, = 5, fail to 
reject Hy 
(b) 0.110 
9-117. Ignore ties 
(a) w =27<w. 


ay = 4l, 
7 0.05,n=17 
reject Hy 


Supplemental Exercises 
9-119. (a) 15 
(b) SE Mean = 1.1525, 
tf) = —1.449,0.1 < 
P-value < 0.2, 
95.874 <= pw = 100.786 
(c) fail to reject Hy 
(d) fail to reject Hp 
9-121. (a) ft) = 0.5694, 0.25 < 
P-value < 0.4 
(b) 10.726 S p S 14.222 


9-123. (a) n = 25, 0.9783; n = 100, 
0.9554; n = 400, 0.8599; 
n = 2500, 0.2119 
(b) n = 25, 0.3783; n = 100, 
0.2643; n = 400, 0.1056; 
n = 2500, 0.0009; signifi- 
cant when n = 2500. 
9-125. (a) 0.4522 (b) 0.4404 
(c) 0.3557 (d) 0.2981 
9-127. (a) 0.6406 (b) 0.2877 
(c) 0.0537 
9-129. (a) n = 100, 0.3632; n = 150, 
0.2119; n = 300, 0.0352 
(b) n = 100, 0.6293; n = 150, 
0.4522; n = 300, 0.1292 


(c) 1 (d) 24,5 
9-131. (a) d= 2,8 =0;d=3,B =0 
(b) 2 


(c) d=1,8 ~ 0.1; 

d=158 ~ 0.04;n=4 
9-133. x3 = 0.000009, reject Hy 
9-135. (a) 


Value 0 1 2 =3 


Observed 3 7 4 6 


Expected 3.6484 6.2128 5.2846 4.6954 


x5 = 0.88971, fail to reject Hy 
(b) P-value = 0.6409 (from 
Minitab) 
9-137. (a) normal distribution used 
because sample size is 
large, 7) = 6.12, fail to 


reject Hy 
(b) P-value ~ 1 
(c) 
Obs. Exp. 

Interval Frequency. Frequency. 

x = 45.50 9 75 
45.50 <x = 51.43 5 75 
51.43 <x = 55.87 7 75 
55.87 <x = 59.87 11 75 
59.87 <x <= 63.87 4 75 
63.87 <x = 68.31 9 TS 
68.31 <x S 74.24 8 TS 

x = 74.24 6 TS 


x5 = 5.06, fail to reject Hy 
9-139. (a) Ho: w = 0.635 vs. 
H,: p > 0.635 
(b) normal distribution used 
because sample size is 
large, Z = —5.31, fail to 
reject Hy 
(c) P-value ~ 1 
9-141. (a) t) = —6.10, P-value < 
0.001, reject Hp 
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(b) d = 4.54, Power = 1 
(c) d=2.27,n=5 
(e) From a normal probability 
plot, assumption is 
reasonable 
9-143. (a) t) = 0.37, fail to reject Hy 
(b) From a normal probability 
plot, assumption is 
reasonable 
(c) 0.25 < P-value < 0.4 
9-145. (a) xo = 58.81, reject Hy 
(b) P-value < 0.01 
9-147. (a) xi = 0.509, fail to reject Hy 
(b) x3 = 0.509, P-value < 
0.01, reject Ho 


CHAPTER 10 


Section 10-1 
10-1. (a) —1.96 <z = —0.9 < 1.96, 
do not reject Ho; 
P-value = 0.368 
(b) —9.79 = py — po = 3.59 
(c) Power = 0.14 
(d) ny = ny = 180 
10-3. (a) z = 0.937 < 2.325, 
do not reject Ho; 
P-value = 0.174 
(b) wi — po = —4.74 
(c) Power = 0.04 
(d) Use ny = ny = 339 
10-5. (a) z = —5.84 < 1.645, do 
not reject Hp; P-value = 1 
(b) pi — Po = 6.8 
(c) Power = 0.9988 
(d) The sample size is 
adequate 
(a) Z% = —7.25 < —1.645 
reject Hp; P-value = 0 
(b) 3.684 = py — po 
= —2.116 
(c) ny =n = 11 
10-9. (a) —5.83 S py, — po 


10-7. 


= —0.57; 

P-value = 0.0173 
(b) Yes 
(c) Power = 0.9616; n = 10 
(d) Normal 


Section 10-2 

10-11. (a) df = 26.45 ~ 26, 
fy — Po = —1.688, 
0.0025 < P-value < 0.005, 
one-sided 


740 


(b) reject Hy (c) Yes 
(d) 0.005 < P-value < 0.01, 
reject Hy 
10-13. (a) tf = —1.94 < —1.701 
0.025 < P-value < 0.05 
(b) py — pr = —0.196 
(c) Power = 0.95 
(d) n =n, =n = 21 
10-15. (a) —2.042 < t) = 0.230 
< 2.042, do not reject 
Ho; P-value > 0.80 
(b) —0.394 S py — py = 0.494 
10-17. (a) t) = —3.11 < —2.485, 
reject Hp 
(b) —5.688 = py — p2 
= —0.3122 
10-19. (a) Assumptions verified 
(b) tf = —2.83 < —2.101 
reject Hp; 0.010 < 
P-value < 0.020 
(c) —0.7495 = py) — py 
= —0.1105 
10-21. (a) t = —5.498 < —2.021 
reject Hp; P-value < 0.0010 
(b) ny = ny = 38 
10-23. (a) to = 3.03 > 2.056 reject Ho; 
0.005 < P-value < 0.010 
(b) to = 3.03 > 1.706, reject Ho 
10-25. (a) ft = 7.0 > 2.048, reject Hp; 
P-value = 0 
(b) 14.93 S py — po S 27.28 
(c)n>8 
10-27. (a) t = 2.82 > 2.326 reject Ho; 
P-value = 0.025 
(b) wi — Bo = 0.178 
10-29. (a) Normal 
(b) fo = 2.558 > 2.101 reject 
Ho; P-value ~ 0.02 
(c) 1.86 S py — po = 18.94 
(d) Power = 0.05 
(e) n=51 


Section 10-3 
10-31. (a) w) = 75 > Wooos = 51, fail 
to reject Hy 
(b) P-value = 2[1 — P(Z< 


0.58)] = 0.562 
10-33. (a) w, = 77 = Woo) = 78; 
reject Hy 
(b) P-value = 2[1 — P(Z< 
2.19)] = 0.034 


10-35. (a) Min (258, 207) > 
Wo.05 = 185, fail to reject Hy 
(b) P-value = 0.0155 
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Section 10-4 

10-37. (a) 0.1699 = py = 0.3776 
(b) t-test is appropriate. 

10-39. —727.46 S py = 2464.21 

10-41. (a) tf = 5.465 > 1.761 reject 


Ho; P-value = 0 
(b) 18.20 = pg 
10-43. (a) tf) = 8.387 > 1.833 
reject Ho 
(b) to = 3.45 > 1.833 reject Hy 
(c) Yes 


10-45. (a) Normal 
(b) —0.379 S py = 0.349 
(c)65n 
10-47. (a) P-value = P(R* = 7r* = 14| 
p = 0.5) = 0.0005, 
reject Hy 


Section 10-5 
10-49. (a) fo.255,10 = 1.59 
(b) fo.10,24,9 = 2.28 
(c) fo.05,8,15 = 2.64 
(d) fo75,5,10 = 0.529 
(€) fo.90,24,9 = 90.525 
(f) foosgis = 9.311 
10-51. fo = 0.805 > 0.166, fail to 
reject Hp; o,/o2 = 2.20 
10-53. (a) fy = 1.21 > 0.333, fail to 
reject Hp; 
oT 
0.403 < 1 <3.63 
02 
(b) Power = 0.65 
(c) n= 31 
10-55. (a) fo = 0.923 > 0.365, do not 
reject Hy 
(b) 0.3369 =" < 2.640 
07 


10-57. (a) 0.6004 < = < 1,428 
(b) 0.5468 = a < 1.5710 


= 1 
(c) 0.661 = w 
10-59. 0.4058 < fo = 1.78 < 2.46, do 
not reject Ho 
10-61. 0.248 < fy = 0.640 < 4.04, do 
not reject Ho; 
O71 
0.159 = — = 2.579 
G2 
10-63. 0.333 < fo = 1.35 < 3, do not 
reject Hp; 
2 
al 
0.45 = — = 4.05 


07 


10-65. (a) 0.248 < fy = 3.337 < 4.03, 
do not reject Ho 
(b) No 


Section 10-6 
10.67. (a) one-sided 
(b) z = 1.4012, 
P-value = 0.0806, 
(c) reject Hy at a = 0.10, fail 
to reject Hy at a = 0.05 
10-69. (a) z = 4.45 > 1.96 reject Ho; 
P-value ~ 0 
(b) 0.039 = p, — p2 = 0.1 
10-71. (a) z = 5.36 > 2.58 reject Ho; 
P-value ~ 0 


Supplemental Exercises 
10-73. (a) df = 38, tp = —0.988, 
0.2 < P-value < 0.5, 
—2.478 = wy — po 
= 0.758 
(b) one-sided 
(c) fail to reject Hy 
10-75. (a) normality, equality of vari- 
ance, and independence of 
the observations. 
(b) 1.40 = py, — po S 8.36 
(c) Yes 


2. 
(d) 0.1582 <2) < 5,157 
7) 


(e) No 
10-77. (a) to = 2.554 > 1.895, 
reject Hp 
(b) f% = 2.554 < 2.998, do not 
reject Ho 
(c) t = —1.986 < 1.895, do 
not reject Hp 
(d) t% = —1.986 < 2.998, do 
not reject Ho 
10-79. (a) z = 6.55 > 1.96, reject Hp 
(b) z = 6.55 > 2.58, reject Hy 
(c) zis so large 
10-81. (a) —0.0335 = p, — po 
= 0.0329 
(b) —0.0282 = p,; — pz 
= 0.0276 
(c) 95% CI: —0.0238 = p; — po 
= 0.0232 
90% CI:—0.0201 = p; — po 
= 0.0195 
10-83. (a) Yes 
(b) Yes if similar populations 


10-85. 


10-87. 


10-89. 


10-91. 


10-93. 


(a) 0.0987 = wy — py 
= 0.2813 

(b) 0.0812 S p; — po 
= 0.299 

(c) by — Bo = 0.2813 

(d) z = 3.42 > 1.96, reject Hp; 
P-value = 0.00062 


(e) ny =m =9 
(a) z = —5.36 < —2.58, 
reject Hp 


(b) conclusions are the same 

(c) n = 60 

(a) No 

(b) data appear normal with 
equal variances 

(c) It is more apparent the data 


follow normal distributions. 
2 


(a) 18.114 < 2Y < 294.35 
Ou 


(e) fo = 72.78 > 4.03, 
reject Hp 

(a) Normality appears valid. 

(b) 0.50 < P-value < 0.80, do 
not reject Hy 

(c) n = 30 

(a) It may not be assumed that 
eae 

(b) % = —2.74 < —2.131, 
reject Hp 

(c) Power = 0.95 

(d) n = 26 


Mind-Expanding Exercises 


10-99. (c) 0.519 < 6 < 3.887 
CHAPTER 11 
Section 11-2 
11-1. (a) Bo = 48.013, B; = —2.330 
(b) 37.99 (c) 39.39 (d) 6.71 
11-3. (a) Bo = 14.195, B; = 10.092, 
o* = 27.24 
(b) 89.95 (c) —10.092 
(d) 0.99 (e) —7.98, 3.13 
11-5. (a) By = —6.3355, 
B, = 9.20836, o? = 3.7746 
(b) 500.124 (c) 9.20836 
(d) —1.618 
11-7. (a) By = —16.5093, 
B, = 0.0693554, 
o* = 7.3212 
(b) 1.39592 (c) 49.38 
11-9. (b) Bo = 234.071, 


B, = —3.50856, 
o” = 398.25 


11-11. 


11-13. 


11-15. 


11-17. 


11-19. 


(c) 128.814 
(d) 156.833 and 15.1175 
(b) Bo = 2625.39, 

B, = 36.962, o7 = 9811.2 


(c) 1886.15 

(a) Bo = 0.658, 8, = 0.178, 
o” = 0.083 

(b) 3.328 (c) 0.534 

(d) 1.726 and 0.174 

(b) Bo = 2.02, B; = 0.0287, 
o” = 0.0253 

(a) $ = 39.2 — 0.0025x 

(b) 8, = —0.0025 

(b) Bo = 2132.41, 
B; = —36.9618 


Section 11-4 


11-21. 


11-23. 


11-25. 


11-27. 


11-29. 


11-31. 


(a) fo = 12.4853, P-value < 
0.001; t; = 20.387, 
P-value < 0.001; MS; = 
2.194; fy = 415.91, 
P-value © 0 

(b) reject Hy: B = 0 

(c) 2.194 

(a) fo = 74.63, P-value = 
0.000002, reject Hp 

(b) 6? = 27.2, 
se(B,) = 0.2696 

(c) se(Bo) = 0.9043 

(a) fo = 61.41, P-value = 0, 
reject Ho 

(b) se(B,) = 1.288 
se(Bo) = 9.059 

(c) ft = 0.072, fail to reject Ho 

(a) fo = 74334.4, P-value ~ 0, 
reject Hp 

(b) se(Bo) = 1.66765, 
se(B,) = 0.0337744 

(c) tf = —23.37, P-value = 
0.000, reject Ho 

(d) to = —3.8, P-value < 
0.005, reject Hp 

(a) fo = 44.0279, P-value = 
0.00004, reject Ho 

(b) se(Bo) = 9.84346, 
se(B,) = 0.0104524 

(c) tf) = —1.67718, P-value = 
0.12166, fail to reject Hy 

(a) fo = 155.2, P-value < 
0.00001, reject Ho 

(b) se(Bo) = 2.96681, 
se(B,) = 45.3468 

(c) fo = —2.3466, P-value = 
0.0306, fail to reject Ho 


11-33. 


11-35. 


11-37. 
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(d) ft = 57.8957, P-value < 
0.00001, reject Hp 

(e) to = 2.7651, P-value = 
0.0064, reject Ho 

(a) P-Value = 0.0000, 
reject Hp 

(b) 6? = 0.083 
se(Bo) = 0.1657, 
se(B,) = 0.014 

(c) Reject Ho 

(a) P-value = 0.310, No 

(b) 6? = 30.69 

(c) se(Bo) = 9.141 

0.55 


Sections 11-5 and 11-6 


11-39. 


11-41. 


11-43. 


11-45. 


11-47. 


11-49. 


11-51. 


(a) (—2.9713, —1.7423) 
(b) (46.7145, 49.3115) 

(c) (41.3293, 43.0477) 

(d) (39.1275, 45.2513) 

(a) [10.02, 15.28] 

(b) [—4.30, 32.69] 

(c) [85.59, 104.27] 

(d) (62.77, 127.10] 

(a) (9.10130, 9.31543) 

(b) (—11.6219, —1.04911) 
(c) (498.72024, 501.52776) 
(d) (495.57344, 504.67456) 
(a) (0.03689, 0.10183) 

(b) (—47.0877, 14.0691) 
(c) (44.0897, 49.1185) 

(d) (37.8298, 55.3784) 

(a) (201.552, 266.590) 

(b) (—4.67015, —2.34696) 
(c) (111.8339, 145.7941) 
(a) (—43.1964, —30.7272) 
(b) (2530.09, 2720.68) 

(c) (1823.7833, 1948.5247) 
(d) (1668.9013, 2103.4067) 
(a) (0.1325, 0.2235) 

(b) (0.119, 1.196) 

(c) (1.87, 2.29) 


Section 11-7 


11-53. 
11-55. 
11-57. 
11-59. 
11-61. 


11-65. 


(a) R* = 67.2% 

(a) R* = 99.986 % 

(a) R* = 87.94% 

(a) R? = 85.22% 

(a) R* = 89.6081% 

(c) R* = 95.73% 

(d) 6? old = 9811.21, 
6 new = 4022.93 

(a) fo = 207, reject Ho 
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Section 11-8 


11-67. 


11-69. 


11-71. 


11-73. 


11-75. 


(a) to = 4.81, P-value < 
0.0005, reject Hy 

(b) z = 1.747, P-value = 
0.04, reject Ho 

(c) p = 2.26, reject Ho 

(a) to = 5.475, P-value = 
0.000, reject Ho 

(b) (0.3358, 0.8007) 

(c) Yes 

(a) § = —0.0280411 + 
0.990987x 

(b) fo = 79.838, reject Ho 

(c) 0.903 

(d) to = 8.9345, reject Ho 

(e) % = 3.879, reject Ho 

(f) (0.7677, 0.9615) 

(a) y = 5.50 + 6.73x 

(b) 0.948 

(c) t) = 8.425, reject Hy 

(d) (0.7898, 0.9879) 

(a) r= 0.82 

(b) ft) = 7.85, reject Ho, 
P-value < 0.005 

(c) (0.664, 0.908) 

(d) zp = 1.56, fail to reject Ho, 
P-value = 0.119 


Section 11-9 


11-77. 


11-79. 


(a) Yes (b) No (c) Yes 
(d) Yes 
(b) y = —0.8819 + 0.00385x 


(c) fo = 122.03, reject Hp 


Section 11-10 


11-81. 


11-83. 


11-85. 


(a) Bo = 5.340, 
6, = —0.00155 

(b) Test that all slopes zero: 
P-value ~ 0 


(a) By = —7.047, 
8, = —0.00074, 
6B. = —0.9879 


(b) Test that all slopes zero: 
P-value = 0.036 

(c) 0.771 

(b) ¥ = —0.966824 + 1.54376x 

(c) fo = 252263.9, P-value ~ 
0, reject Hy 

(d) [1.5367, 1.5509] 

(e) to = —199.34, reject Ho 


Supplemental Exercises 


11-87. 


y* = 1.2232 + 0.5075x where 
y* = Vy 


11-89. 
11-91. 


11-93. 


11-95. 


3 = 0.7916x 

(b) } = 0.6714 + —2964x 

(c) R? = 21.5% 

(b) y = —0.699 + 1.66x 

(c) fo = 22.75, reject Ho, 
P-value = 0.001 

(d) (3.399, 5.114) 

(c) all data: (7741.74, 10956.26), 
outlier removed: (8345.22, 
11272.79) 


CHAPTER 12 
Section 12-1 


12-1. 


12-3. 
12-5. 


12-7. 


12-9, 


12-11. 


12-13. 


(b) y = 171.055 + 3.714x, 
— 1.126x, 
(c) 189.49 
(b) 2 
(a) Satisfaction = 144 — 1.11 
Age — 0.585 Severity + 1.30 
Anxiety 
(b) 49.5 
(c) 5.90, 0.13, 0.13, 1.06 
(d) No, standard errors differ 
(a) = 49.90 — 0.01045x, 
— 0.0012x, — 0.00324x,; 
+ 0.292x4 — 3.855x5 
+ UTR 
(b) G = 4.965 
) = 19.67, 
) = 0.02338, 
) = 0.01631, 
) = 0.0009459, 
4) = 1.765, 
) 
) 
7 


(c) 29.86 
(a) $ = 47.82 — 9.604x, 

+ 0.44152x, + 18.294x, 
(b) 6? = 12.3 


(d) 91.3 

(a) § = —440.39 + 19.147x, 

+ 68.080x5 

6? = 55563 se(Bo) = 94.20, 

se(B,) = 3.460, and 

se(Bo) = 5.241 

(c) 186.675 

(a) § = —0.1105 + 0.4072x, 
+ 2.108x> 


(b) 


12-15. 


12-17. 


12-19. 


(b) 6? = 0.00008 
se(Bo) = 0.2501, 
se(B,) = 0.1682, and 

se(B) = 5.834 
(c) 0.97074 
(a) $ = 238.56 + 0.3339x, 


= 
io) 
~— 
Rous 
ek & 8 
ton 


(d) 484.0 — 7.656xy 
222x) — 0.004149 


147.0, 


<a 


ll 


a 


(e) 


Section 12-2 


12-21. 


12-23. 


(a) fo = 53.0872, P-value ~ 0, 
= 15.02, P-value ~ 0; 
tp = —23.43, P-value ~ 0; 


MS, = 25.5833; fo = 445.2899, 
P-value ~ 0 
(b) fo = 445.2899, reject Hy 


(c) t-test for each regressor is 
significant 
(a) fo =184.25, 
P-value = 0.000, Reject Ho 
(b) t(8,) = 16.21, 
P-value < 0.001, reject Hp 
to(B2) = —11.04, 
P-value < 0.001, 
reject Ho 


12-25. 


12-27. 


12-29, 


12-31. 


12-33. 


12-35. 


12-37. 


12-39. 


(a) P-value = 8.16 E-5, 3.82 
E-8, and 0.3378 

(b) to = 0.98, fail to reject Hy 

(a) fo = 19.53, reject Ho 

(a) fo = 828.31, reject Ho 

(b) fo = —2.58, reject Ho 
ty = 1.84, fail to reject Ho 
to = 13.82, reject Hp 

(a) fo = 99.67, reject Hp, 
P-value ~ 0 

(b) t = 5.539, reject Ho 
to = 12.99, reject Hp 

(a) fo = 9.28, reject Ho, 
P-value = 0.015 

(b) ty = 0.49, fail to reject Hy 
ty = —3.94, reject Ho 

(a) fo = 36.59, reject Hp 

(b) t = 12.30, reject Ho 
to = 11.94, reject Hp 
to = —7.84, reject Ho 

(c) fo = 142.66, reject Ho 

(a) fo = 97.59, reject Ho, 
P-value = 0.002 

(b) to = —6.42, reject Ho 
to = —2.57, fail to 
reject Hp 

(c) fo = 6.629, fail to reject Ho 

(d) fo = 7.714, fail to reject Ho 

(e) fo = 1.11, fail to reject Ho 

(f) 147.0 

(a) fo = 65.55, P-value ~ 0, 
reject Hy 

(b) Age: t, = —8.40, 
Severity: t, = —4.43, 
Anxiety: f, = 1.23, not 
all necessary 


ll 


Sections 12-3 and 12-4 


12-41. 


12-43. 


12-45. 


(a) (49.927 < By < 292.183) 
(0.033 < B, < 7.393) 
(—2.765 < B, < 0.513) 

(b) (158.82, 220.13) 

(c) (126.06, 252.88) 

(a) (—20.477 < B, < 1.269) 
(—0.245 < B, < 1.076) 
(14.428 < B; < 22.159) 

(b) (77.582, 105.162) 

(c) (82.133, 100.611) 

(a) (—6.9467 < B, < —0.3295) 
(—0.3651 < B, < 0.1417) 

(b) (—45.8276 < B, < 30.5156) 
(—1.3426 < B, < 0.8984) 
(—0.03433 < B; < 0.04251) 


12-47. 


12-49. 


12-51. 


12-53. 


12-55. 


(a) (12.1363 < B, < 26.1577) 
(57.4607 < B, < 78.6993) 

(b) (—233.4, 63.2) 

(c) (—742.09, 571.89) 

(a) (0.0943 < B, < 0.7201) 
(—8.743 < B) < 12.959) 

(b) (0.861, 0.896) 

(c) (0.855, 0.903) 

(a) (—2.173 < B, <2.841) 
(—5.270 < B, < —0.164) 

(b) (—36.7, 125.8) 

(c) (—112.8, 202.0) 

(d) CI: (107.4, 267.2) 

PI: (30.7, 344.0) 

(a) —10.18 < By < 16.62 
(1.00 < B; < 1.45) 

(3.85 < B, <5.00) 
(=5.11 < Bip =< — 307) 

(b) 0.3877 

(c) (81.3, 82.9) 

(a) [—9.052, 15.024], [0.999, 
1.398], [3.807, 5.384], 
[—4.808, —2.817] 

(b) 0.43 

(c) [80.965, 82.725] 


Section 12-5 


12-57. 
12-59. 
12-61. 
12-63. 
12-65. 
12-67. 
12-69. 
12-71. 


(a) 0.893 

(a) 0.95 

(a) 0.843 

(a) 0.997 

(a) 0.756 

(a) 0.985 (b) 0.99 
(b) 0.9937, 0.9925 
(a) 0.12 (b) Yes 


Section 12-6 


12-73. 


12-75. 


12-77. 


(a) ¥ = —1.633 + 1.232x 
—1.495° 

(b) fo = 1,858,613, reject Ho 

(c) t9 = —601.64, reject Ho 

(a) ~ = —1.769 + 0.421x, 

+ 0.222x, — 0.128x3 

— 0.02x). + 0.009x;3 

+ 0.003x23 — 0.019x7 
— 0.007x3 + 0.001x3 

(b) fo = 19.628, reject Ho 

(d) Jo = 1.612, fail to 
reject Hy 

(b) ¥ = 56.677 — 0.1457x, 
— 0.00525x. — 0.138x3 
— 4.179x4 


12-79. 


12-81. 


12-83. 


12-85. 


12-87. 


12-89. 
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(a) Min C,: x1, x 
C, = 3.0, MSg = 55563.92 
¥ = —440.39 + 19.147x, 
+ 68.080x> 
Min MS; is same as Min C, 

(b) Same as part (a) 

(c) Same as part (a) 

(d) Same as part (a) 

(e) All models are the same 

(a) Min C;: x 
C, = 1.1, MSz = 0.0000705 
y = —0.20052 
+ 0.467864x, 
Min MS; is same as Min C, 

(b) Same as part (a) 

(c) Same as part (a) 

(d) Same as part (a) 

(e) All models are the same 

(a) Min C,: xy 
C, = 1.2, MSp = 1178.55 
¥ = 253.06 — 2.5453x, 
Min MS; is same as Min C, 

(b) Same as part (a) 

(c) Same as part (a) 

(d) Same as part (a) 

(e) All models are the same 

(a) Min C,: x1, x2 
C, = 2.9, MSp = 10.49 
3 = —50.4 + 0.671%, 
+ 1.30x2 
Min MS; is same as 
Min C, 

(b) Same as part (a) 

(c) Same as part (a) 

(d) Same as part (a) 

(e) All models are the same 

(a) Min. MS; (MS; = 0.01858) 
model Att, PctComp, Yds, 
YdsperAtt, TD, PctTD, 
PctInt; Min. Cp (Cp = 5.3) 
model PctComp, YdsperAtt, 
PctTD, Pctint; 

(b) PctComp, YdsperAtt, 
PctTD, PctInt 

(c) PctComp, YdsperAtt, 
PctTD, PctInt 

(d) Att, PctComp, Yds, 
YdsperAtt, TD, PctInt 

(a) Min C, = 1.3: 
¥ = 61.001 — 0.02076X ia 
— 0.00354x.n, — 3.45 7Xaxle 
Min MS; = 4.0228 
¥ = 49.5 — 0.017547 xX.i¢ 


744 


— 0.003425 2X, + 1.29Xcmp 
= 3.18 4X arte as 0.0096x,92 
(b) ¥ = 63.31 — 0.0178x.i¢ 
— 0.00375 Xen, — 3-3Xaxte 
— 0.0084x.9 
(c) Same as Min MSE model in 
part (a) 
(d) ~ = 45.18 — 0.00321x,,, 
=>, 4 AXarle so 0.385 Xp/y 
(e) Min C, model is preferred 
(f) Min C, = 4.0, 
Min MS; = 2.267 
¥ = 10 — 0.0038023xX¢n, 
+ 3.936X cmp + 15.216X¢, 
— 0.011118x,92 — 7.401 Xpans 
+ 3.6131 X¢y1 + 2.342x¢r 2 
Stepwise: 
¥ = 39.12 — 0.0044x.4, 
+ 0.271 Xpiy — 4.5Xns 
+ 3.2X a1 + 1.7 Xan 
Forward selection: 
¥ = 41.12 — 0.00377 Xen, 
+ 0.336X py — 2-1 Xaxte 
— 3.4Xrans + 2.1 Xavt 
+ Xan 
Backward elimination is same 
as Min C, and Min MS; 
12-91. (a) } = —0.304 + 0.083x, 
— 0.031x; + 0.00425 
C, = 4.04 MS; = 0.004 
(b) ¥ = —0.256 + 0.078x, 
+ 0.022x, — 0.042x,; 
+ 0.0008x3 
C, = 4.66 MS; = 0.004 
(c) Prefer the model in part (a) 


Supplemental Exercises 


12-95. (a) 2 
(b) 0.0666, 0.0455 
(c) 6.685 
12-97. (a) fy = 1321.39, reject Ho, 


P-value < 0.00001 
(b) tf = —1.45, fail to reject Hp 
ty = 19.95, reject Ho 
to = 2.53, fail to reject Ho 
12-99. (a) § = 4.87 + 6.12x; — 6.53x5 
— 3.56x; — 1.44x; 
(b) fo = 21.79, reject Ho 
to = 5.76, reject Hy 
ty = —5.96, reject Ho 
to = —2.90, reject Ho 
to = —4.99, reject Ho 
12-101. (a) $° = 21.068 — 1.404x3 
+ 0.0055x4 + 0.000418x5 


12-103. 


12-105. 
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MS; = 0.013156 
C, = 4.0 
(b) Same as part (a) 
(c) x4, x5 with C, = 4.1 and 
MS; = 0.0134 
(d) The part (c) model is 
preferable 
(e) Yes 
(a) ¥ = 300.0 + 0.85x, 
+ 10.4%), § = 405.8 
(b) fo = 55.37, reject Hy 
(c) 0.9022 
(d) MS; = 10.65 
(e) fo = 0.291, fail to 
reject Hp 
(a) fo = 18.28, reject Ho 
(b) fo = 2, do not reject Hy 
(c) MS;(reduced) = 0.005 
MS, (Full) = 0.004 


Mind-Expanding Exercises 


12-109. 


R? = 0.449 


CHAPTER 13 
Section 13-2 


13-1. 


13-3. 


13-5. 


13-7. 


13-9, 


13-11. 


13-13. 


13-15. 


13-27. 


(a) 4 

(b) 5 

(c) fo = 1.58, 0.1 < P-value 
< 0.25 

(d) fail to reject Hp 

(a) fo = 14.76, reject Ho 

(a) fo = 12.73, reject Ho 

(b) P-value = 0 

(a) fo = 16.35, reject Ho 

(c) 95%: (140.71, 149.29) 
99%: (7.36, 24.14) 

(a) fo = 1.86, fail to 
reject Hp 

(b) P-value = 0.214 

(a) fo = 8.30, reject Ho 

(b) P-value = 0.002 

(d) (69.17, 81.83) 

(e) (8.42, 26.33) 

(a) fo = 0.72, fail to 
reject Hy 

(b) P-value = 0.486, 
SSp = 0.146 

(a) fo = 2.62, fail to 
reject Hp 

(b) P-value = 0.083 

n=3 


Section 13-3 
13-29. (a) fo = 5.77, reject Ho 
(b) 0.01412 
(c) 0.0148 
(a) fo = 0.75, fail to reject Ho 
(b) 0 (c) 24 
(a) Hy: 02 = 0A): 02 #0 
(c) Set equal sample sizes, 
o* = 0.164, 02 = 0 


13-31. 


13-33. 


Section 13-4 
13-35. fy = 147.35, P-value ~ 0, reject 
Hy at a = 0.05 or a = 0.01 
13-37. (a) fo = 8.92, reject Ho 
13-39. (a) fo = 3.00, fail to reject Ho 
13-41. (a) fo = 1.61, fail to reject Hp 
13-43. (a) 3. (b)3 
(c) fo = 23.15, P-value < 0.01 
(d) reject Hy 


Supplemental Exercises 
13-45. (a) fo = 76.09, reject Ho 
(c) (132.97, 147.83) 
13-47. (a) fo = 7.84, reject Ho 
(b) P-value = 0.007 
13-49. (a) fo = 6.23, reject Ho 
(c) Algorithm 5 
13-51. (a) Power = 0.2 (b) n = 50 


CHAPTER 14 


Section 14-3 
14-1. (a) 1. Ap: tT) = 1% =0 
H: at least one T; # 0 
> Bi = By = Bs = 0 
Hy, : at least one B; # 0 
(b) f(G) = 273.79, 
f(P) = 8.84, 
f(GP) = 1.26 reject Hy 
for only main effects 
(a) f(M) = 7.91, 
f(T) = 28.97, 
(MT) = 3.56 reject Hy 
for both main effects and 
the interaction 
(a) Yi = w+ 7) + By + (TB)y 
i= 1,2,3 
j= 1,2,3,4 
k = 1,2,3,4,5,6 
(b) f(Z) = 40.07, 
f(T) = 0.32, fUT) = 1.70 
reject Hp for only insulation 


14-3. 


14-5. 


+ Six 


14-7. (a) f(D) = 25.23, 
f(A) = 543.52, 
f(DA) = 3.54 
reject Hp for both main 
effects and the interaction 


14-9. (—3.40, 7.64) 


Section 14-4 


14-11. (a) f(H) = 7.64, 
f(C) = 31.66, 
f(F) = 19.92, 
f(HC) = 2.92, 
f(HF) = 2.97, 
f(CF) = 0.96 


H, C, F, HF are significant 
at a = 0.05. The P-value 
for HC is 0.075. 


Section 14-5 
14-13. Significant effects: 
A = 17.00, C = 10.875, 
D = 8.375, AD = 9.125 
14-15. (a) Cleaning Method = —5.93 
Test Position = —1.280 
Clean*test = —1.220 
(b) Cleaning Method is the only 
significant factor 
14-17. (a) Significant effects: 
A = 11.8125, 
B = 33.9375, C = 9.6875, 
AB = 7.9375 
14-19. None of the effects are 
significant 
14-21. (b) A, Band AB 
(c) y = 400 + 40.124x, 
— 32.75x, + 26.625xx» 
14-23. fo = 5.11, do not reject Hp 
14-25. (a) For model with A, B, C, AB, 
s = 2.92; 
s (center pts) = 2.86 
(b) Fo = 192.71, curvature is 
significant 
14-27. (a) Large effects: C = —39.79, 
D = —198.47, E = —64.86 


(b) y = 546.90 — 39.79x, 
198.47x, — 64.86x, 
(d) none 


14-29. (a) Large effects: B = —0.000750 
(b) with all effects in error except 
B the P-value = 0.007 


Section 14-6 


14-31. Significant effects: 
A = 15.625, C = 10.625, 


D = 8.875, AD = 8.875, 
CD = —3.125, ACD = 1.875 
14-37. (a) Effect JED is confounded 
with blocks 
(b) Marginal significant effects: 
J = 19.0, D = 14.75, 
JD = —18.0, 
ED = —2555, 
14-39. (a) Blocking important, 
SSpiocks large relative 
to SSéror 
(b) ABC, ABD, ACD, 
and BCD 
(c) Coefficient for AD = 15.14, 
t-statistic for AD = 1.525, 
dfBlocks = 1, MS for inter- 
actions = 1165.33 


Section 14-7 


14-41. (a) 2°7! 
(b) A = 10.8750, 
B = 33.6250, 
C = 10.6250, 
D = —0.6250, 
E = 0.3750, 


AB + CDE = 7.1250, 
14-43. (a) E = —ABCD 
(b) Resolution V 
(c) E = —0.4700, 
BE = —0.4050, 
DE = —0.3150 
14-45. (c) A = 1.435, B = —1.465, 
D = 4,545, AB = 1.15, 
AD = ~—1.23 
14-47. (b) Design Generator: D = BE, 
E=AC 
Defining Relation: 
I = ACE = BDE 
= ABCDE 
Aliases 
A = CE = BCDE = ABDE 
B = DE = ACDE = ABCE 
C = AE = ABDE = BCDE 
D = BE = ABCE = ACDE 
E=AC = BD = ABCD 
(c) A = —1.525, B = —5.175, 
C = 2.275, D = —0.675, 
E = 2.275 
14-49. (a) Alias Structure: 
I+ ABD + ACE + BCF 
+ DEF + ABEF + ACDF 


+ BCDE 
A+ BD+ CE 
B+AD+ CF 
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C+ AE + BF 
D+ AB+ EF 
E+AC+ DF 
F+BC+DE 


AF + BE+ CD 
(b) Alias Structure: J + ABCG 


ABDH + ABEF 
ACDF + ACEH 
ADEG + AFGH 
BCDE + BCFH 
BDFG + BEGH 
+ CDGH + CEFG 
+ DEFH 

A 

B 

Cc 

D 

E 

F 

G 

H 

AB + CG + DH + EF 

AC + BG + DF + EH 

AD + BH + CF + EG 

AE + BF + CH + DG 

AF + BE + CD + GH 


AG + BC+ DE + FH 
AH + BD + CE + FG 
14-51. (a) Generators are EF = BCD, 

F =ACD, G = ABC, and 
H =ABD,1I= BCDE 
ACDF = ABEF = ABCG 
ADEG = BDFG = CEFG 
ABHD = ACEH = BCFH 
DEFH = CDGH = 
BEGH = AFGH 

(b) Glassware = 1.4497, 
Reagent = —0.8624, Prep = 
0.6034, Tracer = 0.6519, 
Dissolution = —0.8052, 
Hood = 1.3864, Chemistry = 
0.0591, Ashing = —0.0129 


Section 14-8 
14-53. (b) $ = 82.024 — 1.115x; 
— 2,.408x, + 0.86 1x7 
— 1.5933 — 1.801x, x 
14-55. Path of steepest ascent passes 
through the point (0, 0) and has 
a slope —0.8/1.5 = —0.533 
14-57. (a) Central composite design, 
not rotatable 
(b) ¥ = 150.04 — 58.47x, 
+ 3.35x) — 6.53x7 
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+ 10.58x5 + 0.50x, x, (b) f(N) = 311.71, 

The linear terms are signifi- f(P) = 119.17, 

cant (p = 0.001), while f(PN) = 92.94 

both the square terms and (d) 6 = 1.22 

interaction terms are 14-77. (a) Generators are E = ABC, 

insignificant F = ABD, and G=ACD; 
14-59. (a) along the vector (1.2, —2.1, I = ABCE = ABDF 

1.6, —0.6) CDEF = ACDG = BDEG 

(b) (1.22, 2.13, 1.62, 0.61) = BCFG = AEFG 
(b) A = —74.9, B = 76.1, 

Supplemental Exercises C = —366.4, D = 236.9, 


14-61. (a) t(p) = 2.54, t(pc) = 5.02 


E= —213.4, F = 119.9, 
G = 101.9 
14-79. (a) Block 1: (1), be, ac, ab; 
Block 2: a, b, c, abc 
(b) A = —0.500, B = 13.000, 
C = 6.500, AB = —2.000, 


The main effect of pH and 
the interaction of pH and 
Catalyst Concentration 
(CC) are significant 


eee ie _ ees AC = —14.500, BC = —5.000 
_ : 14-81. (a) A: 4 levels, B: 3 levels 
TES) 2? (b) 1 (c) AB interaction not 
L, S, LS are significant ee 
14-65. (a) A = —2.74, B = —6.66, significant (d) dfError = 6, 
C = 3.49, AB = —8.71, SSz = 34670882, MS; = 
AC = 7.04, BC = 11.46, 29736583, fy = 1.36 
ABC = —6.49, not 
significant 
14-67. (a) V = —15.75, P = 10.75, CHAPTER 15 
G = —25.00, Section 15-3 
PG = —19.25, 15-1. (a) x chart: UCL = 242.78, 
(b) § = 102.75 — 7.87x, CL = 223, LCL = 203.22, 
+ 5.37x3 — 12.50x4 R chart: UCL = 72.51, 
_ 9.63x34 CL = 34.286, LCL = 0 
14-69. V = 3.25, F = —10.25, (b) fp = 2236 = 14.74 
P = —10.25, G = —23.75, 15-3. (a) Xchart: UCL = 245.11, 
PG = —11.75 CL = 223, LCL = 200.89 
14-71. Design Generators: D = AB S chart: UCL = 30.77, 
E=AC CL = 13.58, LCL = 0 
Alias Structure (b) f& = 223, 6 = 14.74 
I+ ABD + ACE + BCDE 15-5. (a) x chart: UCL = 4.930, 
A+ BD + CE + ABCDE CL = 4.668, LCL = 4.406, 
B+AD+ CDE + ABCE R chart: UCL = 0.961, 
C+ AE + BDE + ABCD CL = 0.454, LCL = 0 
D+ AB + BCE + ACDE (b) x chart: UCL = 4.910, 
E+ AC + BCD + ABDE CL = 4.668, LCL = 4.425, 
BC + DE + ABE + ACD S chart: UCL = 0.355, 
BE + CD + ABC + ADE CL = 0.17, LCL =0 
14-73. (a) 2” factorial with two 15-7. (a) x chart: UCL = 37.5789, 
replicates CL = 34.32, 
(b) Significant effects: LCL = 31.0611 
x; = 0.795, x. = —1.160 R chart: UCL = 11.9461, 
(c) ¥ = 0.1994 + 0.07688, CL = 5.65, LCL = 0 
14-75. (a) Ho: 7; = 1 =73 =0 (b) x chart: UCL = 37.4038, 
Hy: By = Bo = B3 = 0 CL = 34.0947, 
Ho: (TB) =... LCL = 30.7857 
= (7B)33 = R chart: UCL = 12.1297, 


CL = 5.73684, LCL = 0 
G = 2.4664 
15-9. (a) xchart: UCL = 17.4, 
CL = 15.09, LCL = 12.79 
Rchart: UCL = 5.792, 
CL = 2.25, LCL = 0 
(b) x chart: UCL = 17.96, 
CL = 15.78, LCL = 13.62 
Rchart: UCL = 5.453, 
CL = 2.118, LCL = 0 
xX chart: UCL = 17.42, 
CL = 15.09, LCL = 12.77 
S chart: UCL = 3.051, 
CL = 1.188, LCL = 0 
Revised 
x chart: UCL = 17.95, 
CL = 15.78, LCL = 13.62 
S chart: UCL = 2.848, 
CL = 1.109, LCL = 0 
15-11. (a) x chart: UCL = 0.0635, 
CL = 0.0629, 
LCL = 0.0624 
Rchart: UCL = 0.0020, 
CL = 0.0009, 
LCL = 0 
(b) x chart: UCL = 0.0634, 
CL = 0.0630, 
LCL = 0.0626 
Rchart: UCL = 0.0014, 
CL = 0.0007, LCL = 0 
(c) x chart: UCL = 0.0634, 
CL = 0.0630, 
LCL = 0.0626 
S chart: UCL = 0.00058, 
CL = 0.00027, LCL = 0 
Revised: 
x chart: UCL = 0.0633, 
CL = 0.0630, 
LCL = 0.0626 
S chart: UCL = 0.00058, 
CL = 0.00028, LCL = 0 
15-13. (a) 2.73 (b) no 
15-15. s = 2.956, r/d, = 1.251 


(c 


YV 


Section 15-4 
15-17. (a) Individual chart: 


UCL = 60.889, 
CL = 53.05, 
LCL = 45.211 


MR chart: UCL = 9.634, 
CL = 2.94737, LCL = 0 
The process appears to be 
in control. 

(b) fp = 53.05, 6 = 2.613 


15-19. (a) Individual chart: 
UCL = 10.5358, 
CL = 10.0272, 
LCL = 9.5186 
MR chart: 
UCL = 0.625123, 
CL = 0.19125, LCL = 0 
The process appears to be 
in control. 

(b) p& = 10.0272, 6 = 0.1696 

15-21. (a) Initial study: Individual 
chart: UCL = 130.5, 
CL = 100.78, LCL = 71.06 
MR chart: UCL = 36.51, 
CL = 11.18, LCL = 0 
Revised: Individual chart 
UCL = 127.08, 
CL = 99.48, LCL = 71.88 
MR chart: UCL = 33.91, 
CL = 10.38, LCL = 0 

(b) pp = 99.4792, ¢ = 9.20059 


15-23. (a) X chart: UCL = 116.43, 
CL = 82.0, LCL = 47.57, 
Rchart: UCL = 42.30, 
CL = 12.95, LCL = 0; 
in control 
(b) 82.0, 11.48 


Section 15-5 
15-25. (a) 1.3333 (b) 26 
15-27. (a) PC = PCRx, = 1.5 (b) 0 
15-29. Proportion nonconforming is 
0.00779 
PCR = 0.905 
PCRx = 0.837 
15-31. 0.0009 
PCR = 1.13 
PCRx = 1.06 
15-33. (a) PCR = 1.35 
PCRx = 1.217 
(b) 0.00013 
15-37. (a) Fraction defective = 0.002, 
PCR = 1.03, PCR, = 1.03 
(b) Fraction defective = 0.057, 
PCR = 1.03, PCR, = 0.526 
15-39. (a) 0.03 
(b) PCR = 1.11, PCR, = 0.778 


Section 15-6 

15-41. (a) not in control 
UCL = 0.0835, 
CL = 0.0645, 
LCL = 0.0455 


(b) Revised P-chart: 
UCL = 0.1252, 
CL = 0.0561, LCL = 0 
15-43. (a) P chart: UCL = 0.1986, 
CL = 0.1506, 
LCL = 0.1026 
(b) Revised P chart: 
UCL = 0.2062, 
CL = 0.1573, 
LCL = 0.1085 
15-45. (a) The limits need to be 
revised. UCL = 3.811, 
CL = 1.942, LCL = 0.072, 
sample 5 and 24 exceed 
limits 
(b) U chart: UCL = 3.463, 
CL = 1.709, LCL = 0 
15-47. (a) UCL = 0.3528, 
CL = 0.2598, LCL = 0.1667, 
not in control 
(b) UCL = 0.6694, CL = 
0.5195, LCL = 0.3696, not 
in control, points 4, 5, 10, 
12, 18, 20 exceed the con- 
trol limits 


Section 15-7 
15-49. (a) 4 (b) 0.0228. 


(c) 43.8596 

15-51. (a) 0.2177. (b) ARL = 4.6 
15-53. (a) 0.1515 (b) ARL = 6.6 
15-55. (a) 0.1020 (b) ARL = 9.8 


15-57. (a) 0.2877 (b) ARL = 3.48 
15-59. (a) 5.196 (b) 0.01 (c) 102.04 


Section 15-8 
15-61. (a) h=4,k = 05, 
UCL = 3.875, 
LCL = —3.875 
Yes, this process is in-control. 
(b) Observation 20 is out of 
control, CUSUM = 6.08 
15-63. (a) ¢ = 0.1736 
(b) h=4,k =05, 
UCL = 0.0678, 
LCL = —0.0678, out of 
control at the specified 
target level 
15-65. (a) ARL = 38.0 
(b) ARL = 10.4 
15-67. (a) 0.169548 
(b) The process appears to be 
in control. UCL = 10.17, 
CL = 10, LCL = 9.83 
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(c) Out of control at observa- 
tion 13, UCL = 10.29, 
CL = 10, LCL = 9.71 
15-69. (a) prefer A = 0.1 and L = 2.81 
(b) prefer A = 0.5 and 
L = 3.07 
(c) 9 
15-71. (a) UCL = 12, LCL = —12, not 
in control 
(b) shift from 70 to 80: 2.01 < 
ARL < 2.57 


Supplemental Exercises 


15-73. (a) x chart: UCL = 64.019, 
CL = 64, LCL = 63.982 
R chart: UCL = 0.046, 
CL = 0.018, LCL = 0 
(b) p = 64,6 = 0.011 
(c) PCR = 0.63 
(d) PCR, = 0.63 
(e) 0? = (0.0033)? = 0.000011 
(f) ARL = 12.9 
15-75. (a) The process appears to be 
in control. 
(b) Pchart: UCL = 0.1717, 
CL = 0.1063, 
LCL = 0.04093 
Individual chart: 
UCL = 60.327362, 
CL = 60.32641, 
LCL = 60.325458 
MR chart: 
UCL = 0.001169, 
CL = 0.000358, LCL = 0 
(b) Individual chart: 
UCL = 0.001362, 
CL = 00.00041, 
LCL = 0.000542 
MR chart: 
UCL = 0.001169, 
CL = 0.000358, LCL = 0 
(c) & = 60.3264 
& = 0.0003173 
PCR = 1.0505 
PCR, = 0.9455 
15-79. (a) Trial control limits 
S chart: UCL = 170.25, 
CL = 86.42, LCL = 2.59 
x chart: UCL = 670.00, 
CL = 558.77, 
LCL = 447.53 
Revised S chart: 
UCL = 158.93, 
CL = 80.68, LCL = 2.42 


15-77. (a 


YS 


748 


15-89. 


15-91. 
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x chart: UCL = 655.79, 
CL = 551.95, 
LCL = 448.10 
(b) PCRx = 0.8725 
(c) 6 = 36.9917 
(d) ARL = 18.6 
(a) ARL = 43.9 
(b) ARL = 6.30 
(c) ARL = 2.00 
(a) x chart: UCL = 140.168, 
CL = 139.49, 
LCL = 138.812 
R chart: UCL = 2.48437, 
CL = 1.175, LCL = 0 
(b) Revised: X chart: 
UCL = 140.417, 
CL = 139.709, 
LCL = 139.001 


Rchart: UCL = 2.596, 

CL = 1.227, LCL = 0 

& = 0.5276 

(c) PCR = 1.26 
PCR, = 1.08 

(d) o* = 0.081 

(e) ARL = 5.55 


15-93. P(X < USL) = 0.00135 


15-95, 


15-97. 


15-101. 


(a) P(U > 12.24 when 
d = 16) = 0.96995 

(b) P(U > 10.68 when 
A= 16) =1 

(a) ARL = 17.5 

(b) ARL = 3.63 


(a) ¢ = 3.0411, probability 


= 0.03 
(b) 31.95 


Mind-Expanding Exercises 

15-103. 0.125, 0.004 

15-105. (b) ARL = 1/p where 
p=1-O(k - Vind) 


+ B(-— k — Vn8) 
(c) ARL = 22.0 for k = 2 
(d) ARL = 4.47 
R(1-p 
15-107. n = a) 
15-111. (a) C chart 
CL=8 
UCL = 16.49 
LCL = 0 
(b) Yes 
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Glossary 


2" factorial experiment. A full factorial experiment with k factors 
and all factors tested at only two levels (settings) each. 


2*? factorial experiment. A fractional factorial experiment with 
k factors tested in a 2~? fraction with all factors tested at only two 
levels (settings) each. 


Acceptance region. In hypothesis testing, a region in the sample 
space of the test statistic such that if the test statistic falls within it, 
the null hypothesis cannot be rejected. This terminology is used be- 
cause rejection of Ho is always a strong conclusion and acceptance 
of Hy is generally a weak conclusion. 


Addition rule. A formula used to determine the probability of the 
union of two (or more) events from the probabilities of the events and 
their intersection(s). 


Additivity property of y. If two independent random variables X, 
and X, are distributed as chi-square with v, and v2 degrees of freedom, 
respectively, Y = X, + X, is a chi-square random variable with u = 
v,; + vy degrees of freedom. This generalizes to any number of inde- 
pendent chi-square random variables. 


Adjusted R?. A variation of the R? statistic that compensates for 
the number of parameters in a regression model. Essentially, the ad- 
justment is a penalty for increasing the number of parameters in the 
model. 


Alias. In a fractional factorial experiment when certain factor effects 
cannot be estimated uniquely, they are said to be aliased. 


All possible (subsets) regressions. A method of variable selection in 
regression that examines all possible subsets of the candidate regressor 
variables. Efficient computer algorithms have been developed for 
implementing all possible regressions. 


Alternative hypothesis. In statistical hypothesis testing, this is a 
hypothesis other than the one that is being tested. The alternative 
hypothesis contains feasible conditions, whereas the null hypothesis 
specifies conditions that are under test. 


Analysis of variance (ANOVA). A method of decomposing the total 
variability in a set of observations, as measured by the sum of the 
squares of these observations from their average, into component 
sums of squares that are associated with specific defined sources of 
variation. 


Analytic study. A study in which a sample from a population is 
used to make inference to a future population. Stability needs to be 
assumed. See Enumerative study. 


Arithmetic mean. The arithmetic mean of a set of numbers x), x9, ..., 
x, is their sum divided by the number of observations, or 
(1/n) g }-\x;. The arithmetic mean is usually denoted by x, and is 
often called the average. 


Assignable cause. The portion of the variability in a set of observa- 
tions that can be traced to specific causes, such as operators, materials, 
or equipment. Also called a special cause. 


Asymptotic relative efficiency (ARE). Used to compare hypothesis 
tests. The ARE of one test relative to another is the limiting ratio of the 
sample sizes necessary to obtain identical error probabilities for the 
two procedures. 


Attribute. A qualitative characteristic of an item or unit, usually 
arising in quality control. For example, classifying production units as 
defective or nondefective results in attributes data. 


Attribute control chart. Any control chart for a discrete random 
variable. See Variables control chart. 


Average. See Arithmetic mean. 


Average run length, or ARL. The average number of samples taken 
in a process monitoring or inspection scheme until the scheme 
signals that the process is operating at a level different from the 
level in which it began. 


Axioms of probability. A set of rules that probabilities defined on a 
sample space must follow. See Probability. 


Backward elimination. A method of variable selection in regression 
that begins with all of the candidate regressor variables in the model 
and eliminates the insignificant regressors one at a time until only 
significant regressors remain. 


Bayes estimator. An estimator for a parameter obtained from a 
Bayesian method that uses a prior distribution for the parameter along 
with the conditional distribution of the data given the parameter to 
obtain the posterior distribution of the parameter. The estimator is 
obtained from the posterior distribution. 


Bayes’ theorem. An equation for a conditional probability such as 
P(A | B) in terms of the reverse conditional probability P(B | A). 


Bernoulli trials. Sequences of independent trials with only two out- 
comes, generally called “success” and “failure,” in which the probability 
of success remains constant. 


Bias. An effect that systematically distorts a statistical result or 
estimate, preventing it from representing the true quantity of interest. 


Biased estimator. See Unbiased estimator. 
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Bimodal distribution. A distribution with two modes. 


Binomial random variable. A discrete random variable that equals 
the number of successes in a fixed number of Bernoulli trials. 


Bivariate distribution. The joint probability distribution of two 
random variables. 


Bivariate normal distribution. The joint distribution of two normal 
random variables. 


Block. In experimental design, a group of experimental units or 
material that is relatively homogeneous. The purpose of dividing 
experimental units into blocks is to produce an experimental de- 
sign wherein variability within blocks is smaller than variability 
between blocks. This allows the factors of interest to be compared 
in a environment that has less variability than in an unblocked 
experiment. 


Box plot (or box and whisker plot). A graphical display of data in 
which the box contains the middle 50% of the data (the interquartile 
range) with the median dividing it, and the whiskers extend to the 
smallest and largest values (or some defined lower and upper limits). 


C chart. An attribute control chart that plots the total number of 
defects per unit in a subgroup. Similar to a defects-per-unit or U chart. 


Categorical data. Data consisting of counts or observations that can 
be classified into categories. The categories may be descriptive. 


Causal variable. When y = f(x) and y is considered to be caused by 
xX, x is sometimes called a causal variable. 


Cause-and-effect diagram. A chart used to organize the various 
potential causes of a problem. Also called a fishbone diagram. 


Center line. A horizontal line on a control chart at the value that esti- 
mates the mean of the statistic plotted on the chart. See Control chart. 


Central composite design (CCD). A second-order response surface 
design in k variables consisting of a two-level factorial, 2k axial runs, 
and one or more center points. The two-level factorial portion of a 
CCD can be a fractional factorial design when k is large. The CCD is 
the most widely used design for fitting a second-order model. 


Central limit theorem. The simplest form of the central limit theo- 
rem states that the sum of n independently distributed random vari- 
ables will tend to be normally distributed as n becomes large. It is a 
necessary and sufficient condition that none of the variances of the 
individual random variables are large in comparison to their sum. 
There are more general forms of the central theorem that allow infi- 
nite variances and correlated random variables, and there is a multi- 
variate version of the theorem. 


Central tendency. The tendency of data to cluster around some 
value. Central tendency is usually expressed by a measure of location 
such as the mean, median, or mode. 


Chance cause. The portion of the variability in a set of observations 
that is due to only random forces and which cannot be traced to 
specific sources, such as operators, materials, or equipment. Also 
called a common cause. 


Chi-square (or chi-squared) random variable. A continuous 
random variable that results from the sum of squares of independent 
standard normal random variables. It is a special case of a gamma 
random variable. 


Chi-square test. Any test of significance based on the chi-square 
distribution. The most common chi-square tests are (1) testing 
hypotheses about the variance or standard deviation of a normal 


distribution and (2) testing goodness of fit of a theoretical 
distribution to sample data. 


Coefficient of determination. See R’. 


Combination. A subset selected without replacement from a set used 
to determine the number of outcomes in events and sample spaces. 


Comparative experiment. An experiment in which the treatments 
(experimental conditions) that are to be studied are included in the 
experiment. The data from the experiment are used to evaluate the 
treatments. 


Completely randomized design (or experiment). A type of experi- 
mental design in which the treatments or design factors are assigned to 
the experimental units in a random manner. In designed experiments, 
a completely randomized design results from running all of the treat- 
ment combinations in random order. 


Components of variance. The individual components of the total 
variance that are attributable to specific sources. This usually refers to 
the individual variance components arising from a random or mixed 
model analysis of variance. 


Conditional mean. The mean of the conditional probability distribu- 
tion of a random variable. 


Conditional probability. The probability of an event given that the 
random experiment produces an outcome in another event. 


Conditional probability density function. The probability density 
function of the conditional probability distribution of a continuous 
random variable. 


Conditional probability distribution. The distribution of a random 
variable given that the random experiment produces an outcome in 
an event. The given event might specify values for one or more other 
random variables. 


Conditional probability mass function. The probability mass func- 
tion of the conditional probability distribution of a discrete random 
variable. 


Conditional variance. The variance of the conditional probability 
distribution of a random variable. 


Confidence coefficient. The probability 1 — a associated with a 
confidence interval expressing the probability that the stated interval 
will contain the true parameter value. 


Confidence interval. If it is possible to write a probability statement 
of the form 


P(LS=0=5U)=1-a 


where L and U are functions of only the sample data and 6 is a param- 
eter, then the interval between L and U is called a confidence interval 
(or a 100(1 — «)% confidence interval). The interpretation is that a state- 
ment that the parameter 8 lies in this interval will be true 100(1 — a)% 
of the times that such a statement is made. 


Confidence level. Another term for the confidence coefficient. 


Confounding. When a factorial experiment is run in blocks and the 
blocks are too small to contain a complete replicate of the experiment, 
one can run a fraction of the replicate in each block, but this results in 
losing information on some effects. These effects are linked with or 
confounded with the blocks. In general, when two factors are varied 
such that their individual effects cannot be determined separately, 
their effects are said to be confounded. 


Consistent estimator. An estimator that converges in probability to 
the true value of the estimated parameter as the sample size increases. 


Contingency table. A tabular arrangement expressing the assign- 
ment of members of a data set according to two or more categories or 
classification criteria. 


Continuity correction. A correction factor used to improve the 
approximation to binomial probabilities from a normal distribution. 


Continuous distribution. A probability distribution for a continuous 
random variable. 


Continuous random variable. A random variable with an interval 
(either finite or infinite) of real numbers for its range. 


Continuous uniform random variable. A continuous random 
variable with range of a finite interval and a constant probability 
density function. 


Contour plot. A two-dimensional graphic used for a bivariate prob- 
ability density function that displays curves for which the probability 
density function is constant. 


Contrast. A linear function of treatment means with coefficients that 
total zero. A contrast is a summary of treatment means that is of interest 
in an experiment. 


Control chart. A graphical display used to monitor a process. It 
usually consists of a horizontal center line corresponding to the in- 
control value of the parameter that is being monitored and lower and 
upper control limits. The control limits are determined by statistical 
criteria and are not arbitrary, nor are they related to specification 
limits. If sample points fall within the control limits, the process is 
said to be in-control, or free from assignable causes. Points beyond 
the control limits indicate an out-of-control process; that is, assignable 
causes are likely present. This signals the need to find and remove the 
assignable causes. 


Control limits. See Control chart. 


Convolution. A method to derive the probability density function 
of the sum of two independent random variables from an integral (or 
sum) of probability density (or mass) functions. 


Cook’s distance. In regression, Cook’s distance is a measure of the 
influence of each individual observation on the estimates of the 
regression model parameters. It expresses the distance that the vector 
of model parameter estimates with the ith observation removed lies 
from the vector of model parameter estimates based on all observa- 
tions. Large values of Cook’s distance indicate that the observation is 
influential. 


Correction factor. A term used for the quantity (1/n)(g /_,x;)° that is 
subtracted from g ;_)x7 to give the corrected sum of squares defined as 


(1/n) g j=:(%; — x). The correction factor can also be written as nx. 


Correlation. In the most general usage, a measure of the interde- 
pendence among data. The concept may include more than two vari- 
ables. The term is most commonly used in a narrow sense to express 
the relationship between quantitative variables or ranks. 


Correlation coefficient. A dimensionless measure of the linear 
association between two variables, usually lying in the interval from 
—1 to +1, with zero indicating the absence of correlation (but not 
necessarily the independence of the two variables). 


Correlation matrix. A square matrix that contains the correlations 
among a set of random variables, say, X,, X>,..., X;,. The main diagonal 
elements of the matrix are unity and the off-diagonal elements r; are 
the correlations between X; and Xj. 


GLOSSARY 753 


Counting techniques. Formulas used to determine the number of 
elements in sample spaces and events. 


Covariance. A measure of association between two random variables 
obtained as the expected value of the product of the two random vari- 
ables around their means; that is, Cov(X, Y) = E[(X — px)(Y — py). 


Covariance matrix. A square matrix that contains the variances and 
covariances among a set of random variables, say, X,, X,..., X,. The 
main diagonal elements of the matrix are the variances of the random 
variables and the off-diagonal elements are the covariances between 
X; and X;. Also called the variance-covariance matrix. When the 
random variables are standardized to have unit variances, the covari- 
ance matrix becomes the correlation matrix. 


Critical region. In hypothesis testing, this is the portion of the 
sample space of a test statistic that will lead to rejection of the null 
hypothesis. 


Critical value(s). The value of a statistic corresponding to a stated 
significance level as determined from the sampling distribution. For 
example, if P(Z = zpos) = P(Z = 1.96) = 0.05, then zoos = 1.96 is 
the critical value of z at the 0.05 level of significance. 


Crossed factors. Another name for factors that are arranged in a 
factorial experiment. 


Cumulative distribution function. For a random variable X, the 
function of X defined as P(X = x) that is used to specify the probability 
distribution. 


Cumulative normal distribution function. The cumulative distribu- 
tion of the standard normal distribution, often denoted as ®(x) and 
tabulated in Appendix Table II. 


Cumulative sum control chart (CUSUM). A control chart in which 
the point plotted at time f is the sum of the measured deviations from 
target for all statistics up to time ¢. 


Curvilinear regression. An expression sometimes used for nonlinear 
regression models or polynomial regression models. 


Decision interval. A parameter in a tabular CUSUM algorithm that is 
determined from a trade-off between false alarms and the detection of as- 
signable causes. 


Defect. Used in statistical quality control, a defect is a particular 
type of nonconformance to specifications or requirements. 
Sometimes defects are classified into types, such as appearance 
defects and functional defects. 


Defect concentration diagram. A quality tool that graphically 
shows the location of defects on a part or in a process. 


Defects-per-unit control chart. See U chart. 


Defining relation. A subset of effects in a fractional factorial design 
that define the aliases in the design. 


Degrees of freedom. The number of independent comparisons that 
can be made among the elements of a sample. The term is analogous 
to the number of degrees of freedom for an object in a dynamic 
system, which is the number of independent coordinates required to 
determine the motion of the object. 


Deming. W. Edwards Deming (1900-1993) was a leader in the use of 
statistical quality control. 


Deming’s 14 points. A management philosophy promoted by 
W. Edwards Deming that emphasizes the importance of change and 
quality. 
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Density function. Another name for a probability density function. 


Dependent variable. The response variable in regression or a 
designed experiment. 


Design matrix. A matrix that provides the tests that are to be 
conducted in an experiment. 


Designed experiment. An experiment in which the tests are planned 
in advance and the plans usually incorporate statistical models. See 
Experiment. 


Discrete distribution. A probability distribution for a discrete 
random variable. 


Discrete random variable. A random variable with a finite 
(or countably infinite) range. 


Discrete uniform random variable. A discrete random variable 
with a finite range and constant probability mass function. 


Dispersion. The amount of variability exhibited by data. 


Distribution free method(s). Any method of inference (hypothesis 
testing or confidence interval construction) that does not depend 
on the form of the underlying distribution of the observations. 
Sometimes called nonparametric method(s). 


Distribution function. Another name for a cumulative distribution 
function. 


Efficiency. A concept in parameter estimation that uses the variances 
of different estimators; essentially, an estimator is more efficient than 
another estimator if it has smaller variance. When estimators are 
biased, the concept requires modification. 


Empirical model. A model to relate a response to one or more 
regressors or factors that is developed from data obtained from the 
system. 


Enumerative study. A study in which a sample from a population is 
used to make inference to the population. See Analytic study. 


Erlang random variable. A continuous random variable that is the 
sum of a fixed number of independent, exponential random variables. 


6-error (or $-risk). In hypothesis testing, an error incurred by 
failing to reject a null hypothesis when it is actually false (also called 
a type II error). 


a-error (or o-risk). In hypothesis testing, an error incurred by 
rejecting a null hypothesis when it is actually true (also called a type I 
error). 


Error mean square. The error sum of squares divided by its number 
of degrees of freedom. 


Error of estimation. The difference between an estimated value and 
the true value. 


Error propagation. An analysis of how the variance of the random 
variable that represents that output of a system depends on the 
variances of the inputs. A formula exists when the output is a linear 
function of the inputs and the formula is simplified if the inputs are 
assumed to be independent. 


Error sum of squares. In analysis of variance, this is the portion of 
total variability that is due to the random component in the data. It 
is usually based on replication of observations at certain treatment 
combinations in the experiment. It is sometimes called the residual 
sum of squares, although this is really a better term to use only 
when the sum of squares is based on the remnants of a model-fitting 
process and not on replication. 


Error variance. The variance of an error term or component in a model. 


Estimate (or point estimate). The numerical value of a point 
estimator. 


Estimator (or point estimator). A procedure for producing an 
estimate of a parameter of interest. An estimator is usually a function 
of only sample data values, and when these data values are available, 
it results in an estimate of the parameter of interest. 


Event. A subset of a sample space. 


Exhaustive. A property of a collection of events that indicates that 
their union equals the sample space. 


Expected value. The expected value of a random variable X is its 
long-term average or mean value. In the continuous case, the expected 
value of Xis E(X) = f%, xf(x) dx where f(x) is the density function of 
the random variable X. 


Experiment. A series of tests in which changes are made to the 
system under study. 


Exponential random variable. A continuous random variable that is 
the time between events in a Poisson process. 


Extra sum of squares method. A method used in regression analysis 
to conduct a hypothesis test for the additional contribution of one or 
more variables to a model. 


Factorial experiment. A type of experimental design in which every 
level of one factor is tested in combination with every level of another 
factor. In general, in a factorial experiment, all possible combinations 
of factor levels are tested. 


False alarm. A signal from a control chart when no assignable causes 
are present. 


F distribution. The distribution of the random variable defined as the 
ratio of two independent chi-square random variables, each divided by 
its number of degrees of freedom. 


Finite population correction factor. A term in the formula for the 
variance of a hypergeometric random variable. 


First-order model. A model that contains only first-order terms. For 
example, the first-order response surface model in two variables is 
y = Bo + Bix, + Box. + €. A first-order model is also called a main 
effects model. 


Fisher’s least significant difference (LSD) method. A series of 
pair-wise hypothesis tests of treatment means in an experiment to 
determine which means differ. 


Fixed factor (or fixed effect). In analysis of variance, a factor 
or effect is considered fixed if all the levels of interest for that fac- 
tor are included in the experiment. Conclusions are then valid about 
this set of levels only, although when the factor is quantitative, it is 
customary to fit a model to the data for interpolating between these 
levels. 


Forward selection. A method of variable selection in regression, 
where variables are inserted one at a time into the model until no other 
variables that contribute significantly to the model can be found. 


Fraction defective control chart. See P chart. 


Fraction defective. In statistical quality control, that portion of a 
number of units or the output of a process that is defective. 


Fractional factorial experiment. A type of factorial experiment in 
which not all possible treatment combinations are run. This is usually 
done to reduce the size of an experiment with several factors. 


Frequency distribution. An arrangement of the frequencies of 
observations in a sample or population according to the values that the 
observations take on. 


F-test. Any test of significance involving the F distribution. The most 
common F-tests are (1) testing hypotheses about the variances or stan- 
dard deviations of two independent normal distributions, (2) testing 
hypotheses about treatment means or variance components in the 
analysis of variance, and (3) testing significance of regression or tests 
on subsets of parameters in a regression model. 


Gamma function. A function used in the probability density function 
of a gamma random variable that can be considered to extend factorials. 


Gamma random variable. A random variable that generalizes an 
Erlang random variable to noninteger values of the parameter r. 


Gaussian distribution. Another name for the normal distribution, 
based on the strong connection of Karl F Gauss to the normal distri- 
bution; often used in physics and electrical engineering applications. 


Generating function. A function that is used to determine properties 
of the probability distribution of a random variable. See Moment 
generating function. 


Generator. Effects in a fractional factorial experiment that are used 
to construct the experimental tests used in the experiment. The gener- 
ators also define the aliases. 


Geometric mean. The geometric mean of a set of n positive data 
values is the nth root of the product of the data values; that is, 
& = (wjax)'”. 

Geometric random variable. A discrete random variable that is the 
number of Bernoulli trials until a success occurs. 


Goodness of fit. In general, the agreement of a set of observed values 
and a set of theoretical values that depend on some hypothesis. The 
term is often used in fitting a theoretical distribution to a set of 
observations. 


Harmonic mean. The harmonic mean of a set of data values 
is the reciprocal of the arithmetic mean of the reciprocals of the data 


_ = Low L\ 
values: hatin = (7 12) ‘ 


Hat matrix. In multiple regression, the matrix H = X(X'X)"'X’. 
This a projection matrix that maps the vector of observed response 
values into a vector of fitted values by ¥ = X(X'X)'X’y = Hy. 


Hidden extrapolation. An extrapolation is a prediction in a regres- 
sion analysis that is made at point (x,, x), ..., x,) that is remote from 
the data used to generate the model. Hidden extrapolation occurs 
when it is not obvious that the point is remote. This can occur when 
multicollinearity is present in the data used to construct the model. 


Histogram. A univariate data display that uses rectangles propor- 
tional in area to class frequencies to visually exhibit features of data 
such as location, variability, and shape. 


Homogeneity test. In a two-way (r by c) contingency table, this tests 
if the proportions in the c categories are the same for all r populations. 


Hypergeometric random variable. A discrete random variable that is 
the number of success obtained from a sample drawn without replace- 
ment from a finite populations. 


Hypothesis (as in statistical hypothesis). A statement about the pa- 
rameters of a probability distribution or a model, or a statement about 
the form of a probability distribution. 
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Hypothesis testing. Any procedure used to test a statistical hypothesis. 


Independence. A property of a probability model and two (or more) 
events that allows the probability of the intersection to be calculated as 
the product of the probabilities. 


Independence test. In a two-way (r by c) contingency table, this tests 
if the row and column categories are independent. 


Independent random variables. Random variables for which P(X € A, 
YE B) = P(XE A)P(YE B) for any sets A and B in the range of X and 
Y, respectively. There are several equivalent descriptions of indepen- 
dent random variables. 


Independent variable. The predictor or regressor variables in a 
regression model. 


Inference. Conclusion from a statistical analysis. It usually refers to 
the conclusion from a hypothesis test or an interval estimate. 


Indicator variable(s). Variables that are assigned numerical val- 
ues to identify the levels of a qualitative or categorical response. 
For example, a response with two categorical levels (yes and no) 
could be represented with an indicator variable taking on the val- 
ues 0 and 1. 


Individuals control chart. A Shewhart control chart in which each 
plotted point is an individual measurement, rather than a summary 
statistic. See Control chart, Shewhart control chart. 


Influential observation. An observation in a regression analysis that 
has a large effect on estimated parameters in the model. Influence is 
measured by the change in parameters when the influential observation 
is included and excluded in the analysis. 


Interaction. In factorial experiments, two factors are said to interact 
if the effect of one variable is different at different levels of the other 
variables. In general, when variables operate independently of each 
other, they do not exhibit interaction. 


Intercept. The constant term in a regression model. 


Interquartile range. The difference between the third and first quar- 
tiles in a sample of data. The interquartile range is less sensitive to 
extreme data values than the usual sample range. 


Interval estimation. The estimation of a parameter by a range of 
values between lower and upper limits, in contrast to point estimation, 
where the parameter is estimated by a single numerical value. A 
confidence interval is a typical interval estimation procedure. 


Intrinsically linear model. In regression analysis, a nonlinear func- 
tion that can be expressed as a linear function after a suitable transfor- 
mation is called intrinsically linear. 


Jacobian. A matrix of partial derivatives that is used to determine the 
distribution of transformed random variables. 


Joint probability density function. A function used to calculate 
probabilities for two or more continuous random variables. 


Joint probability distribution. The probability distribution for two 
or more random variables in a random experiment. See Joint probability 
mass function and Joint probability density function. 


Joint probability mass function. A function used to calculate prob- 
abilities for two or more discrete random variables. 


Kurtosis. A measure of the degree to which a unimodal distribution is 
peaked. 
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Lack of memory property. A property of a Poisson process. The 
probability of a count in an interval depends only on the length of the 
interval (and not on the starting point of the interval). A similar property 
holds for a series of Bernoulli trials. The probability of a success in a 
specified number of trials depends only on the number of trials (and 
not on the starting trial). 


Least significance difference test (or Fisher’s LSD test). An ap- 
plication of the t-test to compare pairs of means following rejection 
of the null hypothesis in an analysis of variance. The error rate is 
difficult to calculate exactly because the comparisons are not all 
independent. 


Least squares (method of). A method of parameter estimation in 
which the parameters of a system are estimated by minimizing the 
sum of the squares of the differences between the observed values and 
the fitted or predicted values from the system. 


Least squares estimator. Any estimator obtained by the method of 
least squares. 


Level of significance. If Z is the test statistic for a hypothesis, and the 
distribution of Z when the hypothesis is true are known, then we can 
find the probabilities P(Z = z,) and P(Z = zy). Rejection of the hy- 
pothesis is usually expressed in terms of the observed value of Z 
falling outside the interval from z, to zy. The probabilities P(Z = z,) 
and P(Z = z,) are usually chosen to have small values, such as 0.01, 
0.025, 0.05, or 0.10, and are called levels of significance. The actual 
levels chosen are somewhat arbitrary and are often expressed in 
percentages, such as a 5% level of significance. 


Levels of a factor. The settings (or conditions) used for a factor in an 
experiment. 


Likelihood function. Suppose that the random variables X,, X, 

, X, have a joint distribution given by f(x;, x2, ..., X%,3 8), 89, ..., 
8,,) where the @s are unknown parameters. This joint distribution, 
considered as a function of the 0s for fixed x’s, is called the like- 
lihood function. 


Likelihood principle. This principle states that the information 
about a model given by a set of data is completely contained in the 
likelihood. 


Likelihood ratio. Let x,, x5, ..., x, be a random sample from the 
population f(x; 6). The likelihood function for this sample is 
L = w}-, f(x; 8). We wish to test the hypothesis Hy: 8 € w, where w 
is a subset of the possible values Q for @. Let the maximum value of L 
with respect to @ over the entire set of values that the parameter can 
take on be denoted by LQ), and let the maximum value of L with 6 re- 
stricted to the set of values given by w be L(@). The null hypothesis is 
tested by using the likelihood ratio A = L(@)/L(Q), or a simple 
function of it. Large values of the likelihood ratio are consistent 
with the null hypothesis. 


Likelihood ratio test. A test of a null hypothesis versus an alterna- 
tive hypothesis using a test statistic derived from a likelihood ratio. 


Linear function of random variables. A random variable that is 
defined as a linear function of several random variables. 


Linear model. A model in which the observations are expressed as 
a linear function of the unknown parameters. For example, y = By + 
Bix + e andy = By + Bix + Box? + € are linear models. 


Location parameter. A parameter that defines a central value in a 
sample or a probability distribution. The mean and the median are 
location parameters. 


Logistic regression. A regression model that is used to model a cate- 
gorical response. For a binary (0, 1) response, the model assumes that 
the logarithm of the ratio of probabilities (for zero and one) is linearly 
related to the regressor variables. 


Lognormal random variable. A continuous random variable with 
probability distribution equal to that of exp(W) for a normal random 
variable W. 


Main effect. An estimate of the effect of a factor (or variable) that 
independently expresses the change in response due to a change in 
that factor, regardless of other factors that may be present in the 
system. 


Marginal probability density function. The probability density 
function of a continuous random variable obtained from the joint 
probability distribution of two or more random variables. 


Marginal probability distribution. The probability distribution of a 
random variable obtained from the joint probability distribution of two or 
more random variables. 


Marginal probability mass function. The probability mass function 
of a discrete random variable obtained from the joint probability 
distribution of two or more random variables. 


Maximum likelihood estimation. A method of parameter estimation 
that maximizes the likelihood function of a sample. 


Mean. The mean usually refers either to the expected value of a 
random variable or to the arithmetic average of a set of data. 


Mean square. In general, a mean square is determined by dividing 
a sum of squares by the number of degrees of freedom associated 
with the sum of squares. 


Mean square(d) error. The expected squared deviation of an estimator 
from the true value of the parameter it estimates. The mean square 
error can be decomposed into the variance of the estimator 
plus the square of the bias; that is, MSE(®) = E(O — 6 = 
V(@) + [E(@) — 67. 

Mechanistic model. A model developed from theoretical knowledge 
or experience in contrast to a model developed from data. See 
Empirical model. 


Median. The median of a set of data is that value that divides the data 
into two equal halves. When the number of observations is even, say 
2n, it is customary to define the median as the average of the nth and 
(n + 1)st rank-ordered values. The median can also be defined for a 
random variable. For example, in the case of a continuous random vari- 
able X, the median M can be defined as [™. f(x)dx = fi f(x)dx = 
1/2. 


Method of steepest ascent. A technique that allows an experi- 
menter to move efficiently toward a set of optimal operating condi- 
tions by following the gradient direction. The method of steepest 
ascent is usually employed in conjunction with fitting a first-order 
response surface and deciding that the current region of operation is 
inappropriate. 

Mixed model. In an analysis of variance context, a mixed model 
contains both random and fixed factors. 


Mode. The mode of a sample is that observed value that occurs 
most frequently. In a probability distribution f(x) with continuous 
first derivative, the mode is a value of x for which df(x)/dx = 0 and 
@f(x)/dx’ <0. There may be more than one mode of either a 
sample or a distribution. 


Moment (or population moment). The expected value of a function 
of a random variable such as E(X — c)' for constants c and r. When c = 
O, it is said that the moment is about the origin. See Moment generating 
function. 


Moment estimator. A method of estimating parameters by equating 
sample moments to population moments. Since the population 
moments will be functions of the unknown parameters, this results in 
equations that may be solved for estimates of the parameters. 


Moment generating function. A function that is used to determine 
properties (such as moments) of the probability distribution of a 
random variable. It is the expected value of exp(tX). See Generating 
function and Moment. 


Moving range. The absolute value of the difference between succes- 
sive observations in time-ordered data. Used to estimate chance variation 
in an individual control chart. 


Multicollinearity. A condition occurring in multiple regression 
where some of the predictor or regressor variables are nearly linearly 
dependent. This condition can lead to instability in the estimates of the 
regression model parameters. 


Multinomial distribution. The joint probability distribution of the 
random variables that count the number of results in each of k classes 
in a random experiment with a series of independent trials with 
constant probability of each class on each trial. It generalizes a 
binomial distribution. 


Multiplication rule. For probability, a formula used to determine 
the probability of the intersection of two (or more) events. For 
counting techniques, a formula used to determine the number of 
ways to complete an operation from the number of ways to 
complete successive steps. 


Mutually exclusive events. A collection of events whose intersec- 
tions are empty. 


Natural tolerance limits. A set of symmetric limits that are three 
times the process standard deviation from the process mean. 


Negative binomial random variable. A discrete random variable 
that is the number of trials until a specified number of successes occur 
in Bernoulli trials. 


Nonlinear regression model. A regression model that is nonlinear 
in the parameters. It is sometimes applied to regression models that 
are nonlinear in the regressors or predictors, but this is an incorrect 
usage. 


Nonparametric — statistical method(s). See Distribution free 
method(s). 


Normal approximation. A method to approximate probabilities for 
binomial and Poisson random variables. 


Normal equations. The set of simultaneous linear equations arrived 
at in parameter estimation using the method of least squares. 


Normal probability plot. A specially constructed plot for a 
variable x (usually on the abscissa) in which y (usually on the 
ordinate) is scaled so that the graph of the normal cumulative 
distribution is a straight line. 


Normal random variable. A continuous random variable that is the 
most important one in statistics because it results from the central 
limit theorem. See Central limit theorem. 


NP chart. An attribute control chart that plots the total of defective 
units in a subgroup. Similar to a fraction-defective chart or P chart. 
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Nuisance factor. A factor that probably influences the response 
variable, but which is of no interest in the current study. When the 
levels of the nuisance factor can be controlled, blocking is the design 
technique that is customarily used to remove its effect. 


Null distribution. In a hypothesis test, the distribution of the test 
statistic when the null hypothesized is assumed to be true. 


Null hypothesis. This term generally relates to a particular hypothesis 
that is under test, as distinct from the alternative hypothesis (which 
defines other conditions that are feasible but not being tested). The 
null hypothesis determines the probability of type I error for the test 
procedure. 


Observational study. A system is observed and data might be 
collected, but changes are not made to the system. See Experiment. 


Odds ratio. The odds equals the ratio of two probabilities. In logistic 
regression, the logarithm of the odds is modeled as a linear function of 
the regressors. Given values for the regressors at a point, the odds can 
be calculated. The odds ratio is the odds at one point divided by the 
odds at another. 


One-way model. In an analysis of variance context, this involves a 
single variable or factor with a different levels. 


Operating characteristic curves (OC curves). A plot of the proba- 
bility of type II error versus some measure of the extent to which the 
null hypothesis is false. Typically, one OC curve is used to represent 
each sample size of interest. 


Optimization experiment. A experiment conducted to improve (or 
optimize) a system or process. It is assumed that the important factors 
are known. 


Orthogonal. There are several related meanings, including the math- 
ematical sense of perpendicular, two variables being said to be 
orthogonal if they are statistically independent, or in experimental 
design where a design is orthogonal if it admits statistically independent 
estimates of effects. 


Orthogonal design. See Orthogonal. 
Outcome. An element of a sample space. 


Outlier(s). One or more observations in a sample that are so far from 
the main body of data that they give rise to the question that they may be 
from another population. 


Overcontrol. Unnecessary adjustments made to processes that 
increase the deviations from target. 


Overfitting. Adding more parameters to a model than is necessary. 


P chart. An attribute control chart that plots the proportion of defec- 
tive units in a subgroup. Also called a fraction-defective control chart. 
Similar to an NP chart. 


Parameter estimation. The process of estimating the parameters 
of a population or probability distribution. Parameter estimation, 
along with hypothesis testing, is one of the two major techniques of 
statistical inference. 


Parameter. An unknown quantity that may vary over a set of values. 
Parameters occur in probability distributions and in statistical models, 
such as regression models. 


Pareto chart. A bar chart used to rank the causes of a problem. 


PCR. A process capability ratio with numerator equal to the difference 
between the product specification limits and denominator equal to six 
times the process standard deviation. Said to measure the potential 
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capability of the process because the process mean is not considered. 
See Process capability, Process capability ratio, Process capability 
study, and PCR,. Sometimes denoted as C, in other references. 


PCR,. A process capability ratio with numerator equal to the differ- 
ence between the product target and the nearest specification limit and 
denominator equal to three times the process standard deviation. Said 
to measure the actual capability of the process because the process 
mean is considered. See process capability, process capability ratio, 
process capability study, and PCR. Sometimes denoted as C,,, in other 
references. 


Percentage point. A particular value of a random variable deter- 
mined from a probability (expressed as a percentage). For example, 
the upper 5 percentage point of the standard normal random variable 
iS Zy95 = 1.645. 


Percentile. The set of values that divide the sample into 100 equal parts. 


Permutation. An ordered sequence of the elements in a set used to 
determine the number of outcomes in events and sample spaces. 


Point estimator. See Estimator. 


Poisson process. A random experiment with events that occur in an 
interval and satisfy the following assumptions. The interval can be 
partitioned into subintervals such that the probability of more than one 
event in a subinterval is zero, the probability of an event in a subinterval 
is proportional to the length of the subinterval, and the event in each 
subinterval is independent of other subintervals. 


Poisson random variable. A discrete random variable that is the 
number of events that occur in a Poisson process. 


Pooled ¢-test. A hypothesis to compare the means of two populations 
with the variances assumed to be equal. 


Pooling. When several sets of data can be thought of as having been 
generated from the same model, it is possible to combine them, usually 
for purposes of estimating one or more parameters. Combining the 
samples for this purpose is usually called pooling and it is commonly 
used to estimate a variance. 


Population standard deviation. See Standard deviation. 
Population variance. See Variance. 


Population. Any finite or infinite collection of individual units or 
objects. 


Posterior distribution. The probability distribution for a parameter 
in a Bayesian analysis calculated from the prior distribution and the 
conditional distribution of the data given the parameter. 


Power. The power of a statistical test is the probability that the test 
rejects the null hypothesis when the null hypothesis is indeed false. 
Thus, the power is equal to one minus the probability of type II error. 


Prediction. The process of determining the value of one or more 
statistical quantities at some future point in time. In a regression 
model, predicting the response y for some specified set of regressors 
or predictor variables also leads to a predicted value, although there 
may be no temporal element to the problem. 


Prediction interval. The interval between a set of upper and lower 
limits associated with a predicted value designed to show on a proba- 
bility basis the range of error associated with the prediction. 


Predictor variable(s). The independent or regressor variable(s) in a 
regression model. 


PRESS statistic. In regression analysis, the predicted residual sum 
of squares. Delete each point and estimate the parameters of the model 
from the data that remain. Estimate the deleted point from this model. 
Restore the point and then delete the next point. Each point is 
estimated once and the sum of squares of these errors is calculated. 


Prior distribution. The initial probability distribution assumed for a 
parameter in a Bayesian analysis. 


Probability. A numerical measure between 0 and | assigned to 
events in a sample space. Higher numbers indicate the event is more 
likely to occur. See Axioms of probability. 


Probability density function. A function used to calculate probabil- 
ities and to specify the probability distribution of a continuous random 
variable. 


Probability distribution. For a sample space, a description of the set 
of possible outcomes along with a method to determine probabilities. 
For a random variable, a probability distribution is a description of the 
range along with a method to determine probabilities. 


Probability mass function. A function that provides probabilities for 
the values in the range of a discrete random variable. 


Probability plot. A scatter plot used to judge if data can reasonably 
be assumed to follow a particular probability distribution. A normal 
probability plot is often used to evaluate the normality assumption of 
data or residuals. 


Process capability. The capability of a process to produce product 
within specification limits. See Process capability ratio, Process 
capability study, PCR, and PCR,. 


Process capability ratio. A ratio that relates the width of the product 
specification limits to measures of process performance. Used to quan- 
tify the capability of the process to produce product within specifica- 
tions. See Process capability, Process capability study, PCR, and PCR,. 


Process capability study. A study that collects data to estimate 
process capability. See Process capability, Process capability ratio, 
PCR, and PCR,. 


P-Value. The exact significance level of a statistical test; that is, the 
probability of obtaining a value of the test statistic that is at least as 
extreme as that observed when the null hypothesis is true. 


Qualitative (data). Data derived from nonnumeric attributes, such as 
sex, ethnic origin or nationality, or other classification variables. 


Quality control. Systems and procedures used by an organization to 
assure that the outputs from processes satisfy customers. 


Quantiles. The set of n — 1 values of a variable that partition it into 
a number n of equal proportions. For example, n — 1 = 3 values 
partition data into four quantiles, with the central value usually called 
the median and the lower and upper values usually called the lower 
and upper quartiles, respectively. 


Quantitative (data). Data in the form of numerical measurements or 
counts. 


Quartiles. The three values of a variable that partition it into four 
equal parts. The central value is usually called the median and the 
lower and upper values are usually called the lower and upper quar- 
tiles, respectively. See Quantiles. 


R’. A quantity used in regression models to measure the proportion of 
total variability in the response accounted for by the model. 
Computationally, R? = SSpegression/SStoals and large values of R? (near 


unity) are considered good. However, it is possible to have large 
values of R? and find that the model is unsatisfactory. R? is also called 
the coefficient of determination (or the coefficient of multiple deter- 
mination in multiple regression). 


R chart. A control chart that plots the range of the measurements in 
a subgroup that is used to monitor the variance of the process. 


Random. Nondeterministic, occurring purely by chance, or independent 
of the occurrence of other events. 


Random effects model. In an analysis of variance context, this refers 
to a model that involves only random factors. 


Random error. An error (usually a term in a statistical model) that 
behaves as if it were drawn at random from a particular probability 
distribution. 


Random experiment. An experiment that can result in different 
outcomes, even though it is repeated in the same manner each time. 


Random factor. In analysis of variance, a factor whose levels are 
chosen at random from some population of factor levels. 


Random order. A sequence or order for a set of objects that is carried 
out in such a way that every possible ordering is equally likely. In 
experimental design, the runs of the experiment are typically arranged 
and carried out in random order. 


Random sample. A sample is said to be random if it is selected in 
such a way so that every possible sample has the same probability of 
being selected. 


Random variable. A function that assigns a real number to each 
outcome in the sample space of a random experiment. 


Randomization. Randomly assign treatments to experimental units 
or conditions in an experiment. This is done to reduce the opportunity 
for a treatment to be favored or disfavored (biased) by test conditions. 


Randomized complete block design. A type of experimental design 
in which treatment or factor levels are assigned to blocks in a random 
manner. 


Range. The largest minus the smallest of a set of data values. The 
range is a simple measure of variability and is widely used in quality 
control. 


Range (control) chart. A control chart used to monitor the variability 
(dispersion) in a process. See Control chart. 


Rank. In the context of data, the rank of a single observation is its 
ordinal number when all data values are ordered according to some 
criterion, such as their magnitude. 


Rational subgroup. A sample of data selected in a manner to include 
chance sources of variation and to exclude assignable sources of variation 
to the extent possible. 


Reference distribution. The distribution of a test statistic when the 
null hypothesis is true. Sometimes a reference distribution is called 
the null distribution of the test statistic. 


Reference value. A parameter set in a tabular CUSUM algorithm 
that is determined from the magnitude of the process shift that should 
be detected. 


Regression. The statistical methods used to investigate the relation- 
ship between a dependent or response variable y and one or more 
independent variables x. The independent variables are usually called 
regressor variables or predictor variables. 


Regression coefficient(s). The parameter(s) in a regression model. 
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Regression diagnostics. Techniques for examining a fitted regression 
model to investigate the adequacy of the fit and to determine if any of the 
underlying assumptions have been violated. 


Regression line (or curve). A graphical display of a regression 
model, usually with the response y on the ordinate and the regressor x 
on the abscissa. 


Regression sum of squares. The portion of the total sum of squares 
attributable to the model that has been fit to the data. 


Regressor variable. The independent or predictor variable in a 
regression model. 


Rejection region. In hypothesis testing, this is the region in the sample 
space of the test statistic that leads to rejection of the null hypothesis 
when the test statistic falls in this region. 


Relative frequency. The relative frequency of an event is the pro- 
portion of times the event occurred in a series of trials of a random 
experiment. 


Reliability. The probability that a specified mission will be 
completed. It usually refers to the probability that a lifetime of a 
continuous random variable exceeds a specified time limit. 


Replicates. One of the independent repetitions of one or more 
treatment combinations in an experiment. 


Replication. The independent execution of an experiment more than 
once. 


Reproductive property of the normal distribution. A linear 
combination of independent, normal random variables is a normal 
random variable. 


Residual. Generally this is the difference between the observed and 
the predicted value of some variable. For example, in regression a 
residual is the difference between the observed value of the response 
and the corresponding predicted value obtained from the regression 
model. 


Residual analysis (and plots). Any technique that uses the residuals, 
usually to investigate the adequacy of the model that was used to 
generate the residuals. 


Residual sum of squares. See Error sum of squares. 


Resolution. A measure of severity of aliasing in a fractional factorial 
design. We commonly consider resolution III, IV, and V designs. 


Response (variable). The dependent variable in a regression model or 
the observed output variable in a designed experiment. 


Response surface. When a response y depends on a function of k 
quantitative variables x), x2, ..., x;,, the values of the response may be 
viewed as a surface in k + 1 dimensions. This surface is called a 
response surface. Response surface methodology is a subset of exper- 
imental design concerned with approximating this surface with a 
model and using the resulting model to optimize the system or 
process. 


Response surface designs. Experimental designs that have been 
developed to work well in fitting response surfaces. These are usually 
designs for fitting a first- or second-order model. The central composite 
design is a widely used second-order response surface design. 


Ridge regression. A method for fitting a regression model that is 
intended to overcome the problems associated with using standard (or 
ordinary) least squares when there is a problem with multicollinearity in 
the data. 
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Rotatable design. In a rotatable design, the variance of the predicted 
response is the same at all points that are the same distance from the 
center of the design. 


Run rules. A set of rules applied to the points plotted on a Shewhart 
control chart that are used to make the chart more sensitized to 
assignable causes. See Control chart, Shewhart control chart. 


Runs test. A nonparametric test to compare two distributions or 
check for independence of measurements. 


S chart. A control chart that plots the standard deviation of the mea- 
surements in a subgroup that is used to monitor the variance of the 
process. 


Sample. Any subset of the elements of a population. 


Sample mean. The arithmetic average or mean of the observations in 
a sample. If the observations are x,, X>,..., X,, then the sample mean is 
(1/n) g j=: x;. The sample mean is usually denoted by x. 


Sample moment. The quantity (1/n) g ;_, x* is called the kth sample 
moment. 


Sample range. See Range. 
Sample size. The number of observations in a sample. 


Sample space. The set of all possible outcomes of a random 
experiment. 


Sample standard deviation. The positive square root of the sample 
variance. The sample standard deviation is the most widely used 
measure of variability of sample data. 


Sample variance. A measure of variability of sample data, defined as 
x =[1/(n — 1)]g j=1(x; — %), where X is the sample mean. 
Sampling distribution. The probability distribution of a statistic. For 


example, the sampling distribution of the sample mean X is the 
normal distribution. 


Scatter diagram. A diagram displaying observations on two vari- 
ables, x and y. Each observation is represented by a point showing its 
x-y coordinates. The scatter diagram can be very effective in revealing 
the joint variability of x and y or the nature of the relationship between 
them. 


Screening experiment. An experiment designed and conducted for 
the purpose of screening out or isolating a promising set of factors for 
future experimentation. Many screening experiments are fractional 
factorials, such as two-level fractional factorial designs. 


Second-order model. A model that contains second-order terms. For 
example, the second-order response surface model in two variables is 
y = Bo + Bin, + Bor + Brtite + Bix + Bor} + €. The second 
order terms in this model are B12x1%9, By,x7, and Byx3. 


Shewhart control chart. A specific type of control chart developed by 
Walter A. Shewhart. Typically, each plotted point is a summary statistic 
calculated from the data in a rational subgroup. See Control chart. 


Sign test. A statistical test based on the signs of certain functions of 
the observations and not their magnitudes. 


Signed-rank test. A statistical test based on the differences within a 
set of paired observations. Each difference has a sign and a rank, and 
the test uses the sum of the differences with regard to sign. 


Significance. In hypothesis testing, an effect is said to be significant 
if the value of the test statistic lies in the critical region. 


Significance level. See Level of significance. 


Six-sigma process. Originally used to describe a process with the 
mean at least six standard deviations from the nearest specification 
limits. It has now been used to describe any process with a defect rate 
of 3.4 parts per million. 


Skewness. A term for asymmetry usually employed with respect to a 
histogram of data or a probability distribution. 


Specification limits. Numbers that define the region of measurement 
for acceptable product. Usually there is an upper and lower limit, but 
one-sided limits can also be used. 


Standard deviation. The positive square root of the variance. The 
standard deviation is the most widely used measure of variability. 


Standard error. The standard deviation of the estimator of a param- 
eter. The standard error is also the standard deviation of the sampling 
distribution of the estimator of a parameter. 


Standard normal random variable. A normal random variable with 
mean zero and variance one that has its cumulative distribution function 
tabulated in Appendix Table II. 


Standardize. The transformation of a normal random variable that 
subtracts its mean and divides by its standard deviation to generate a 
standard normal random variable. 


Standardized residual. In regression, the standardized residual is 
computed by dividing the ordinary residual by the square root of the 
residual mean square. This produces scaled residuals that have, 
approximately, a unit variance. 


Statistic. A summary value calculated from a sample of observations. 
Usually, a statistic is an estimator of some population parameter. 


Statistical inference. See Inference. 


Statistical Process Control (SPC). A set of problem-solving tools 
based on data that are used to improve a process. 


Statistical quality control. Statistical and engineering methods used 
to measure, monitor, control, and improve quality. 


Statistics. The science of collecting, analyzing, interpreting, and 
drawing conclusions from data. 


Steepest ascent (or descent). A strategy for a series of tests to opti- 
mize a response used along with response surface models. 


Stem-and-leaf diagram. A method of displaying data in which the 
stem corresponds to a range of data values and the leaf represents 
the next digit. It is an alternative to the histogram but displays the 
individual observations rather than sorting them into bins. 


Stepwise regression. A method of selecting variables for inclusion in 
a regression model. It operates by introducing the candidate variables 
one at a time (as in forward selection) and then attempting to remove 
variables following each forward step. 


Studentized range. The range of a sample divided by the sample 
standard deviation. 


Studentized residual. In regression, the studentized residual is 
calculated by dividing the ordinary residual by its exact standard devi- 
ation, producing a set of scaled residuals that have, exactly, unit 
standard deviation. 


Sufficient statistic. An estimator is said to be a sufficient statistic for 
an unknown parameter if the distribution of the sample given the 
statistic does not depend on the unknown parameter. This means that 
the distribution of the estimator contains all of the useful information 
about the unknown parameter. 


Tabular CUSUM. A numerical algorithm used to detect assignable 
causes on a cumulative sum control chart. See V mask. 


Tampering. Another name for overcontrol. 


t Distribution. The distribution of the random variable defined as the 
ratio of two independent random variables. The numerator is a 
standard normal random variable and the denominator is the square 
root of a chi-square random variable divided by its number of degrees 
of freedom. 


Test statistic. A function of a sample of observations that provides the 
basis for testing a statistical hypothesis. 


Time series. A set of ordered observations taken at points in time. 


Tolerance interval. An interval that contains a specified proportion 
of a population with a stated level of confidence. 


Tolerance limits. A set of limits between which some stated propor- 
tion of the values of a population must fall with a specified level of 
confidence. 


Total probability rule. Given a collection of mutually exclusive 
events whose union is the sample space, the probability of an event 
can be written as the sum of the probabilities of the intersections of the 
event with the members of this collection. 


Treatment. In experimental design, a treatment is a specific level of 
a factor of interest. Thus, if the factor is temperature, the treatments 
are the specific temperature levels used in the experiment. 


Treatment effect. The mean change to the response due to the 
presence of the treatment. 


Treatment sum of squares. In analysis of variance, this is the sum of 
squares that accounts for the variability in the response variable due to 
the different treatments that have been applied. 


t-test. Any test of significance based on the f¢ distribution. The most 
common f-tests are (1) testing hypotheses about the mean of a normal 
distribution with unknown variance, (2) testing hypotheses about the 
means of two normal distributions, and (3) testing hypotheses about 
individual regression coefficients. 


Two-level factorial experiment. A full or fractional factorial experi- 
ment with all factors tested at only two levels (settings) each. See 2" 
factorial experiment. 


Type I error. In hypothesis testing, an error incurred by rejecting a 
null hypothesis when it is actually true (also called an a-error). 


Type Il error. In hypothesis testing, an error incurred by failing to 
reject a null hypothesis when it is actually false (also called a B-error). 


U chart. An attribute control chart that plots the average number of 
defects per unit in a subgroup. Also called a defects-per-unit control 
chart. Similar to a C chart. 


Unbiased estimator. An estimator that has its expected value equal 
to the parameter that is being estimated is said to be unbiased. 


Uniform random yariable. Refers to either a discrete or continuous 
uniform random variable. 


Uniqueness property of moment generating function. Refers to the 
fact that random variables with the same moment generating function 
have the same distribution. 
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Universe. Another name for population. 


V mask. A geometrical figure used to detect assignable causes on a 
cumulative sum control chart. With appropriate values for parameters, 
identical conclusions can be made from a V mask and a tabular 
CUSUM. 


Variable selection. The problem of selecting a subset of variables for 
a model from a candidate list that contains all or most of the useful 
information about the response in the data. 


Variables control chart. Any control chart for a continuous random 
variable. See Attribute control chart. 


Variance. A measure of variability defined as the expected value of 
the square of the random variable around its mean. 


Variance component. In analysis of variance models involving 
random effects, one of the objectives is to determine how much 
variability can be associated with each of the potential sources of 
variability defined by the experimenters. It is customary to define a 
variance associated with each of these sources. These variances in 
some sense sum to the total variance of the response, and are usually 
called variance components. 


Variance inflation factors. Quantities used in multiple regression to 
assess the extent of multicollinearity (or near linear dependence) in 
the regressors. The variance inflation factor for the ith regressor VIF; 
can be defined as VIF; = [1/(1 — R3)], where R? is the coefficient of 
determination obtained when x; is regressed on the other regressor 
variables. Thus, when x; is nearly linearly dependent on a subset of the 
other regressors, R3 will be close to unity and the value of the corre- 
sponding variance inflation factor will be large. Values of the variance 
inflation factors that exceed 10 are usually taken as a signal that 
multicollinearity is present. 


Warning limits. Horizontal lines added to a control chart (in addition 
to the control limits) that are used to make the chart more sensitive to 
assignable causes. 


Weibull random variable. A continuous random variable that is 
often used to model the time until failure of a physical system. The 
parameters of the distribution are flexible enough that the probability 
density function can assume many different shapes. 


Western Electric rules. A specific set of run rules that were devel- 
oped at Western Electric Corporation. See Run rules. 


Wilcoxon rank-sum test. A nonparametric test for the equality of 
means in two populations. This is sometimes called the Mann- 
Whitney test. 


Wilcoxon signed-rank test. A distribution-free test of the equality of 
the location parameters of two otherwise identical distributions. It is 
an alternative to the two-sample t-test for nonnormal populations. 


With replacement. A method to select samples in which items are 
replaced between successive selections. 


Without replacement. A method to select samples in which items 
are not replaced between successive selections. 


X chart. A control chart that plots the average of the measurements 
in a subgroup that is used to monitor the process mean. 


2? factorial design, 571 

2} factorial design, 577 

2* designs-mask on a cumulative sum control 
chart, 678 

2* factorial designs, 571 

2"! design, 602 

2*» fractional factorial designs, 608 


A 


Acceptance region, 286, also see critical 
region 

Actual capability of a process, 664 

Addition of center points to a 2" design, 588 

Addition rules of probability, 37 

Adjusted R?, 472 

Aliases in a fractional factorial design, 603 

All possible regression, 495 

Alternate fraction, 604 

Alternative hypothesis, 285, 292 

Analysis of a second-order response surface 
model, 622 

Analysis of variance (ANOVA), 417, 470, 
520, 523, 534, 539, 540, 559 

Analytic study, 12 

ANOVA sum of squares identity for a 
randomized complete block design, 539 

ANOVA sum of squares identity for a 
single-factor experiment, 518 

ANOVA sum of squares identity for a 
two-factor factorial experiment, 560 

Approximate sampling distribution of a 
difference in two sample means, 229 

Assignable causes, 640 

Attributes control charts, 643, 668 

Average run length, 674 

Axioms of probability, 34 
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Backward variable elimination in 
regression, 500 

Bayes’ theorem, 55, 56 

Bayesian estimation of parameters, 244 


Bayesian networks, 56 
Bernoulli trial, 80, 86, 440 


762 


Index 


Beta distribution, 146, 147, 704 

Beta random variable, 146 

Bias of an estimator, 231 

Binomial distribution, 79, 81, 127, 704 
Binomial expansion, 81 

Binomial probability table, 705 
Binomial random variable, 81 
Bivariate normal distribution, 177 
Bivariate probability distributions, 154 
Blocking, 538, 595 

Bounds on a P-value, 311 

Box plots, 208 
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Calculating normal probabilities, 123 
Calculation of interaction effects in a 2" 
factorial, 572, 578, 580 
Categorical regressor variables, 492 
Cause-and-effect relationships, 6, 352, 405 
Center line on a control chart, 11 
Center points and checking for 
curvature, 588 
Central composite design, 622, 625 
Central limit theorem, 118, 227 
Chance causes, 640 
Chi-square distribution, 266 
Chi-square distribution percentage 
points, 710 
Chi-square goodness of fit test, 330 
Combinations, 26, 27 
Comparative experiments, 7, 285, 351 
Complement of an event, 23 
Completely randomized design, 517 
Completely randomized experiment, 
315, 515 
Components of variance, 518, 534 
Conditional distribution of bivariate normal 
random variables, 179 
Conditional probability, 41, 42 
Conditional probability distributions, 
158, 159 
Confidence coefficient, 254 
Confidence interval, 252, 253, 254, 295, 358, 
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Confidence interval on a correlation 
coefficient, 433 

Confidence interval on the difference in 
means of two normal distributions, 
variances known, 358 

Confidence interval on the difference in 
means of two normal distributions, 
variances unknown, 368, 369 

Confidence interval on the difference in two 
proportions, 392 

Confidence interval on the mean response in 
regression, 421, 480 

Confidence interval on the ratio of the vari- 
ances of two normal distributions, 387 

Confidence intervals on a proportion, 
271, 272 

Confidence intervals on means in ANOVA, 
521, 522, 523 

Confidence intervals on regression 
coefficients, 421, 479 

Confidence intervals on the mean of a 
normal distribution, variance known, 
253, 254, 257 

Confidence intervals on the mean of a 
normal distribution, variance unknown, 
261, 263 

Confidence intervals on the variance and 
standard deviation of a normal 
distribution, 266, 268 

Confounding, 595 

Connection between hypothesis testing and 
confidence intervals, 295 

Contingency tables, 333 

Continuity correction, 128 

Continuous random variable, 58, 107, 108, 
110, 144 

Continuous sample space, 20 

Continuous uniform distribution, 116, 704 

Continuous uniform random variable, 116 

Contrasts and effect estimates in a 2" 
factorial, 573 

Contrasts in a 2* factorial, 573 

Control chart, 11, 639 

Control chart constants, 728 


Control chart for defects per unit, 670 

Control chart model, 642 

Control chart performance, 673 

Control charts for individual 
measurements, 658 

Cook’s distance, 487 

Correlation, 170, 173 

Correlation and independent random 
variables, 174 

Correlation coefficient, 432 

Counting techniques, 24, 25, 26 

Covariance, 170, 171 

Covariance matrix, 461 

C, statistic in regression, 495 

Critical region, 286 

Critical values, 286 

Cumulative distribution function, 71, 72, 
111, 112 

Cumulative frequency plot, 206 

Cumulative sum control chart, 676 


D 


Data collection, 5 

Data versus information, 6 

Decision interval, 679 

Defect concentration diagram, 689 

Defining contrast, 596 

Defining relation for a fractional factorial 
design, 602, 609 

Deming’s 14 points, 690 

Descriptive statistics, 191 

Design generator, 602 

Design matrix, 577 

Design of a control chart, 644 

Design resolution, 608 

Designed experiment, 5, 6, 296, 405, 513, 
514, 552 

Diagonal elements of the hat matrix, 486 

Digidot plot, 211 

Discrete random variables, 58, 66, 67 

Discrete sample space, 20, 31 

Discrete uniform distribution, 77, 704 

Distribution of a subset of random 
variables, 165 

Dot diagram, 4 


E 

Effect of sample size on type II error 
probability, 290 

Empirical model, 12, 13, 402 

Engineering method, 2 

Enumerative study, 12 

Equally likely outcomes, 32 

Erlang distribution, 138, 704 

Estimation of parameters, 195 

Events, 18, 22, 23 

Expected value, 74, 114 

Expected value of a function of a continuous 
random variable, 114 


Expected value of a function of a discreet 
random variable, 76 

Expected value of a function of two random 
variables, 171 

Exponential distribution, 132, 704 

Exponential random variable, 132 

Exponentially weighted moving average 
(EWMA) control chart, 682 

Extra sum of squares method, 474 

Extrapolation in regression, 482 


F 
Factor levels, 7 
Factorial experiment, 7, 552, 555, 557, 558, 
568, 571 
Factorial experiments and interaction, 557 
F-distribution, 382 
F-distribution percentage points, 712 
Finite population correction factor, 95 
First quartile, 201 
First-order response surface model, 619 
Fisher least significance difference method 
of comparing means following 
ANOVA, 524 
Fixed-effects model, 517, 559 
Fixed significance level testing, 294, 
312, 324 
Formulating one-sided hypotheses, 293 
Forward variable selection in regression, 500 
Fraction defective control chart, 668 
Fractional factorial experiment, 8, 9, 602 
Frequency distributions, 203 
Fundamental theorem of calculus, 112 


G 

Gamma distribution, 138, 139, 704 

Gamma function, 139 

Gamma random variable, 139 

Gaussian distribution, 118, also see normal 
distribution 

General factorial experiments, 568 

General functions of random variables, 
185, 186 

General regression significance test, 474 

Generalized interaction, 599 

Geometric distribution, 86, 704 

Geometric random variable, 86 

Goodness-of-fit tests, 330 

Graphical comparison of means following 
ANOVA, 525 


H 

Hat matrix in regression, 486 

Histogram, 109, 203, 204 

Hypergeometric distribution, 92, 93, 704 

Hypergeometric random variable, 93 

Hypothesis, 284 

Hypothesis testing, 7, 223, 284, 286, 295, 
296, 299, 310, 319, 330, 333, 337, 342, 
358, 362, 373, 384 
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Hypothesis testing on a population 
proportion, 323 

Hypothesis testing on the difference in means 
of two normal distributions, variances 
unknown, 361, 362, 365 

Hypothesis testing on the difference in two 
proportions, 389 

Hypothesis testing on the mean of a normal 
distribution, variance known, 299 

Hypothesis testing on the mean of a normal 
distribution, variance unknown, 310 

Hypothesis testing on the ratio of the vari- 
ances of two normal distributions, 384 

Hypothesis testing on the variance and 
standard deviation of a normal 
distribution, 319 


I 

Independence, 50, 161 

Independent events, 51 

Independent random variables, 161, 162, 166 
Indicator variables in regression, 492 
Influential observations in regression, 487 
Interaction, 8, 452, 555 

Interpreting a confidence interval, 255 
Interquartile range, 201 

Intersection of events, 23 


J 

Jacobian, 186 

Joint probability density function, 155 

Joint probability distributions, 152, 153, 155, 
163, 176 

Joint probability mass function, 154 


L 

Lack of memory property of the exponential 
random variable, 134, 135 

Lack of memory property of the geometric 
random variable, 88 

Large sample confidence interval for a 
parameter, 269 

Large sample hypothesis testing on the mean 
of a normal distribution, 307 

Large sample test for difference in two 
means, 355 

Large-sample confidence interval for the 
mean, 258 

Least squares estimators, 407, 408, 454, 457 

Least squares normal equations for multiple 
linear regression, 454, 456 

Least squares normal equations for simple 
linear regression, 407 

Likelihood function, 239 

Linear combinations of random 
variables, 181 

Logistic regression, 440 

Logit response function, 441 

Lognormal distribution, 144, 704 
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Lognormal random variable, 144 
Lower confidence limit, 254 
Lower control limit, 11, 645 


M 

Main effect, 555, 572, 578, 579 

Marginal distributions of bivariate normal 
random variables, 178 

Marginal probability distribution, 156, 
157, 164 

Maximum likelihood estimator, 239, 240 
Mean and variance of a continuous random 
variable, 114 

Mean and variance of a discrete random 
variable, 74 

Mean and variance of an average, 183 
Mean of a linear combination of random 
variables, 182 

Mean squared error of an estimator, 234 
Mean squares, 519 

Mechanistic model, 12, 13 

Median, 199, 337 

Method of least squares, 406, 452 
Method of maximum likelihood, 239 
Method of moments, 237 

Method of steepest ascent, 620 

Minimum variance unbiased estimator, 233 
Model adequacy checking, 426, 484, 526, 
527, 544, 564 

Moment estimators, 238 

Moments, 237 

Moving range control chart, 658 
Multicollinearity in regression, 502 
Multinomial probability distribution, 176 
Multiple comparisons in ANOVA, 524, 542 
Multiple linear regression model, 450 
Multiplication rule for counting 
techniques, 25 

Multiplication rule for probability, 47 
Mutually exclusive events, 23, 39 


N 
Negative binomial distribution, 86, 88, 89 
Noncentral t distribution, 315 

Nonparametric statistical tests, 337, 342, 
344, 373 

Normal approximation for the sign test, 340 
Normal approximation to Poisson 
distribution, 127, 130 

Normal approximation to the binomial 
distribution, 127, 128, 270 

Normal distribution, 118, 704 

Normal probability plot, 215, 216 

Normal probability plot of effects, 606, 611 
Normal probability plotting of effects, 585 
Normal probability plotting of residuals, 426, 
485, 526, 544, 565, 576 

Normal random variable, 119 

Nuisance variables in an experiment, 538 
Null hypothesis, 285 


O 

Observational study, 5, 6, 352 

One observation per cell in a factorial, 566 

One-half fraction, 9 

One-sided and two-sided hypotheses, 292, 
300, 311, 320 

One-sided confidence bounds, 257, 264, 
268, 272 

Operating characteristic curves, 305, 315, 
322, 527, 717, 718, 719, 720.721, 722, 
723, 724, 725 

Optimal estimator, 235 

Optimization experiment, 553, 619 

Ordered stem-and-leaf diagram, 199 

Orthogonal design, 581 

Outlier, 208 

Outliers, 486 

Overcontrol, 10 

Overfitting, 472 


P 
Paired t-test, 376 
Parameter estimation, 223 
Pareto chart, 207, 688 
Partial F-test, 475 
Partial regression coefficients, 451 
Patterns on control charts, 647 
P-chart, 668 
Percentile, 201 
Permutations of similar objects, 26 
Permutations, 25 
Point estimate, 224 
Point estimation, 224, 231, 237 
Point estimator, 225 
Poisson distribution, 97, 98, 127, 
130, 704 
Poisson process, 98 
Poisson random variable, 98 
Polynomial regression models, 451, 490 
Pooled estimate of variance, 362 
Pooled t-test, 362 
Population standard deviation, 195 
Population variance, 195 
Population, 4 
Posterior distribution, 244 
Potential capability of a process, 664 
Power of a statistical test, 292 
Prediction interval for a future observation, 
274, 275 
Prediction interval in regression, 423, 481 
PRESS statistic in regression, 496 
Principal block, 596 
Principal fraction, 604 
Prior distribution, 244 
Probability, 15, 17, 31, 33 
Probability as degree of belief, 32 
Probability density function, 108, 109 
Probability distribution, 68, 107 
Probability mass function, 69 
Probability model, 16 


Probability plots, 214, 330 

Process capability determination, 662 

Process capability ratios, 663, 665 

Projection of 2 designs, 584 

Projection of a fractional factorial 
design, 607 

Properties of the maximum likelihood 
estimator, 241 

P-value, 294, 300, 311, 321, 324, 386 


Q 


Quality control and improvement, 638 
Quartiles, 200 


R 

R chart, 651 

R?, 428, 472 

Raleigh distribution, 142 

Random effects model in ANOVA, 534 

Random experiments, 18, 19 

Random factors in an experiment, 533 

Random sample, 43, 44, 226 

Random variable, 4, 57, 58 

Randomization, 6, 526, 558 

Randomized complete block design 
(RCBD), 538 

Randomized complete block design 
ANOVA, 540 

Rational subgroups, 646 

Reference distribution, 301, 354 

Reference value, 679 

Regression coefficients in a 2* factorial, 575 

Regression model, 14, 401, 402, 404, 449, 
450, 574, 583 

Regressor variable, 405, 451 

Relative efficiency of an estimator, 235 

Relative frequency distribution, 204 

Relative frequency, 32 

Replication, 515 

Reproductive property of the normal 
distribution, 183 

Residual, 407, 457, 526, 544, 564, 574 

Residual plots, 426, 484, 485, 526, 544, 564, 
576, 577, 583 

Resolution of designs, 608 

Resolution III designs, 608 

Resolution IV designs, 608 

Response surface, 619 

Response variable, 405 

Retrospective study, 5 

Rotatable second-order design, 625 

Runs rules for control charts, 649 


N) 

Sample, 4 

Sample correlation coefficient, 432 

Sample mean, 192 

Sample range, 195 

Sample sizes for confidence intervals, 256, 
272, 359 


Sample sizes in ANOVA, 527 

Sample sizes in hypothesis tests, 303, 304, 
305. 314, 322, 326, 356, 357, 367, 
387, 391 

Sample space, 18, 19, 20 

Sample standard deviation, 193 

Sample variance, 193 

Sampling distribution, 224, 225, 226 

Sampling with replacement, 27 

Sampling without replacement, 27 

Saturated fractional factorial design, 615 

Scatter diagram, 403 

Scatter, see variability 

Science of data, see statistics 

Scientific method, 2 

Screening experiment, 553 

Second-order response surface model, 
619, 622 

Sensitivity of a statistical test, 292 

Shewhart control charts, 642 

Sign test, 337, 340, 341, 344 

Sign test critical values, 726 

Simple linear regression model, 404, 405 

Single replicate of a 2" design, 585 

Single-sample hypothesis testing, 7, 310, 
319, 323 

Six-sigma process, 666 

Sources of variability, 3 

Sparsity of effects principle, 585 

Standard deviation, 74, 114 

Standard error of a point estimator, 233 

Standard error of a regression coefficient in a 
2* factorial, 575 

Standard error of regression coefficients, 
415, 461 

Standard normal distribution, 120, 144, 301 

Standard normal distribution cumulative 
probability table, 708 

Standard normal random variable, 120, 144 

Standardized residuals, 484 

Standardizing a normal random variable, 122 

Statistic, 224, 226 

Statistical inference, 4, 223, 225 

Statistical process control (SPC), 12 

Statistical process control, 640 

Statistical quality control, 638, 639 

Statistical thinking, 4 


Statistical versus practical significance, 
296, 297 

Statistics, 3 

Stem-and-leaf diagrams, 197 

Stepwise regression, 499, 500 

Strong versus weak conclusions in 
hypothesis testing, 291, 292 

Studentized residuals, 430, 486 

Summary table and guidelines for 
one-sample inference procedures, 274 

Summary table and guidelines for 
two-sample inference procedures, 395 

Sums of squares in a 2* factorial, 573 


T 

Tabular cumulative sum control chart, 679 

t-distribution, 262, 310, 315 

t-distribution percentage points, 711 

Test for significance of regression, 
416, 417, 470 

Test for zero correlation, 433 

Test of homogeneity in a contingency 
table, 335 

Test of independence in a contingency 
table, 333 

Test statistic, 100, 310, 319, 324 

Tests on individual regression coefficients in 
multiple regression, 472 

Third quartile, 201 

Three-sigma control limits, 673 

Tier chart, 662 

Ties in the sign test, 340 

Time series data, 9, 210 

Time series plot, 9, 210 

Time weighted control charts, 676 

Tolerance chart, 662 

Tolerance interval for a normal 
distribution, 276 

Total probability, 47, 48 

Transformation of variables, 437 

Treatments, 351, 515 

Tree diagram, 21 

t-test statistic for a regression coefficient in a 
2* factorial, 575 

t-tests in multiple regression, 473 

t-tests in simple linear regression, 415 

t-tests, 310, 362, 365, 376, 415, 473, 575 
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Two-factor factorial experiment, 558 

Two-factor interaction, 556 

Two-sample hypothesis testing, 7 

Type I error, 287, 288 

Type II error, 287, 289, 290, 292, 
303, 367, 459, 460 


U 

U-chart, 670 

Unbalanced single-factor design 
ANOVA, 523 

Unbiased estimator, 231, 409, 414 

Union of events, 23 

Upper confidence limit, 254 

Upper control limit, 11, 645 


Vv 

Variability, 3 

Variable selection in regression, 
472, 494, 499 

Variables control charts, 643 

Variance component estimation, 535 

Variance inflation factors, 502 

Variance of a linear combination of random 
variables, 182 

Variance of a point estimator, 232 

Venn diagram, 23 

Verifying assumptions, 214 


WwW 
Warning limits on control charts, 649 
Weibull distribution, 141, 704 
Weibull random variable, 141 
Western Electric rules on control charts, 648 
Wilcoxon rank-sum test, 373 
Wilcoxon rank-sum test critical values, 727 
Wilcoxon signed-rank test, 342, 344 
Wilcoxon signed-rank test critical 

values, 736 
Word in a defining relation, 609 


x 


X and R charts, 649 
X and S chart, 651, 652 


Z 
Z-tests, 299, 323, 358 


Applications in Examples and Exercises, continued 


Bolts in bearing cap and plate 
Brake contact resistance 
Casing for a gear housing 
Cast aluminum parts 

Circular tubes yield strength 
Cold start ignition time 
Connector pull-off force 


Copper plate warping 
Cycles to failure 


Deflection temperature for plastic pipe 
Dot diameter 

Drag coefficient 

Electromechanical product 

F-117A mission duration 

Fatigue crack growth 

Flatness distortion 

Fretting wear 


Gap width of a magnetic recording head 
Glass bottle thickness 


Height of leaf springs 
Hole diameter 


Jet-turbine or rocket thrust 


Machined dimensions 
Mechanical assembly 
Missile miss distance 
Molded parts 
Nonconforming coil springs 
Nozzle velocity 
Particleboard deflection 
Precision measuring instruments 
Robotic insertion tool 
Shaft and washer assemblies 
Shear strengths 

of Rubber 

of Spot weld 


Sheet metal operation 

Space shuttle flight control system 
Spindle saw processes 

Surface roughness 


Suspension helmets impact test 
Suspension rod, piston rings, PVC pipe, 
and other diameters 


Temperature of joint for O-rings 
Tire life 


Tool life 


Wear in auto parts 
Wire bond strength 
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Exercises 2-177, 3-170, 5-77 

Exercises 13-45 

Example 2-9 

Exercise 2-27 

Exercise 6-10 

Exercises 6-53, 6-64, 6-98 

Exercises 6-67, 7-28, 9-4, 
9-131 

Exercises 14-6, 14-9 

Exercises 6-23, 6-27, 6-39, 
6-43, 6-79 

Exercise 10-18 

Exercises 4-70, 4-174 

Exercises 6-18, 6-56, 6-105 

Exercise 2-44 

Exercise 6-13 

Exercise 14-8 

Exercise 14-62 

Exercises 11-9,11-47, 
11-57, 11-61, 11-63 

Exercise 4-23 

Exercises 8-40, 8-70, 8-72, 
8-82 

Exercise 14-73 

Examples 4-2, 4-4, 4-8, 
4-31, 8-39 

Exercises 8-10, 9-74, 15-21, 

15-63 

Exercises 12-74,12-96, 
12-97, 12-101 

Examples 5-12, 5-13 

Example 3-27 

Example 14-7 

Example 2-1 

Exercises 6-91, 9-134 

Exercise 13-37 

Exercises 11-14,11-34 

Exercise 9-132 

Exercise 2-184 

Exercise 15-41 

Exercise 10-37 

Exercise 12-72 

Exercises 6-31, 6-47, 6-61, 
9-136, 11-69 

Exercise 2-40 

Exercises 3-22, 3-57 

Exercise 10-25 

Examples 14-2, 14-4, 
10-14, 12-13 
Exercises 2-78, 2-109, 
2-158, 5-19, 13-42, 14-22, 
14-67, 14-69 

Exercise 8-58 


Example 4-16 
Exercises 6-7, 6-73, 7-3, 8-13, 
8-34, 8-68, 8-78, 10-15, 
10-57, 15-7, 15-33, 16-77 

Exercises 6-19, 6-57, 6-77 

Exercises 8-27, 8-61, 8-73, 
9-65, 9-81, 10-39 

Exercises 14-12, 14-30, 14-56 

Exercises 9-41, 10-23 

Examples 11-8, 12-1, 12-2, 
12-3, 12-4, 12-5, 12-6, 12-7, 
12-8, 12-9, 12-10, 12-11 
Exercises 12-8, 12-28, 12-46, 
12-82, 15-9, 15-15, 15-35, 
12-76, 16-7, 16-11, 16-29 


MEDICAL 


ACL reconstruction surgery 
Antirheumatoid medication 
Artificial hearts 

Bacteria exposure 

Basal metabolic rate (BMR) 
Cholesterol level 

Completing a blood sample test 
Diagnostic kit revenue 
Diagnostic 

Heart failure 

Hemoglobin level 

Knee injuries 

Lung cancer 

Meniscal tear 

Noise exposure and hypertension 
Pain medication 

Patient satisfaction 

Plasma antioxidants from chocolate 
Radiation dose in X-ray 


Recovering from an illness 

Salk polio vaccine trials 

Sick-leave time 

Skin desquamation (peeling) 

Success rates in kidney stone removals 

Surgical versus medical patients 

Syringe length 

Therapeutic drugs 

Tissue assay by liver manganese 

Treatment of renal calculi by operative 
surgery 

Weight and systolic blood pressure 


PHYSICS 


Alignment accuracy of optical chips 
Atomic clock 

Current draw in magnetic core 
Density measurement error 


Density of the earth 
Geiger counts 


Laser diode samples 

Laser failure time 

Number of stars 

Optical correlator modulation 


Oxygen in a superconducting compound 
Porosity of ultrafine powder 

Shortened latencies in dry atmosphere 
Silver particles in photographic emulsion 
Solar intensity 


Supercavitation for undersea vehicles 
Thermal conductivity 

Transducer calibration 

Tube conductivity 

Velocity of a particle in a gas 
Velocity of light 

Voids in a ferrite slab 

Wavelength of radiation 


Exercises 4-59, 5-61 
Exercise 5-76 
Exercise 9-47 
Exercise 4-159 
Exercise 8-100 
Exercises 4-60, 10-41 
Exercise 4-154 
Exercise 2-200 
Example 2-37 
Exercises 2-112, 3-90, 3-107 
Exercises 8-86, 8-89 
Exercises 2-76, 3-19 
Exercise 8-56 
Exercises 8-56, 10-68 
Exercise 11-58 
Exercises 3-162, 10-84 
Exercises 12-5, 12-106 
Exercises 13-2, 13-25 
Exercises 12-11, 12-31, 
12-44, 12-61, 12-79 
Exercise 3-161 
Exercise 10-79 
Exercise 4-158 
Exercise 2-88 
Exercise 2-115 
Exercise 9-104 
Exercises 15-10, 15-16 
Exercise 9-133 
Exercise 9-144 


Exercises 9-89, 9-99 
Exercises 11-72, 11-89 


Exercise 14-64 

Exercise 11-94 

Exercise 11-73 

Exercises 12-13, 12-49, 
12-63, 12-81 

Exercise 6-102 

Example 4-23 

Exercise 4-93 

Example 2-15 

Exercises 4-115, 4-160 

Exercise 3-133 

Exercises 12-15, 12-33, 
12-51, 12-65, 12-83 

Exercises 13-10, 13-22 

Exercises 11-12, 11-50 

Exercise 8-99 

Exercise 4-164 

Exercises 6-12, 6-69, 
6-76, 8-34 

Exercise 9-45 

Example 7-5 

Exercise 3-150 

Exercises 13-7, 13-21 

Exercise 5-72 

Exercise 6-101 

Exercises 2-12, 6-30, 6-46 

Exercise 3-69 


SEMICONDUCTOR MANUFACTURING 
Examples 2-17, 2-19, 2-20, 2-27, 2-28, 2-33, 3-2, 3-3, 3-5, 3-21, 4-26, 


10-11, 10-13, 14-3 


Exercises 2-41, 2-92, 2-95, 2-168, 3-21, 3-84, 4-61, 4-80, 5-83, 6-36, 
6-50, 6-62, 7-29, 7-44, 9-88, 9-124, 12-9, 12-29, 12-43, 12-69, 14-17, 


14-25, 14-41, 14-58, 14-65, 15-18, 15-84 


SPORTS 


Baseball coefficient of restitution 
Electromyostimulation 

Football helmets 

Golf 


Hockey player performance 
Liveliness of baseballs 
Major League Baseball 
National Hockey League 


Quarterback ratings 


TEXTILE MANUFACTURING 
Examples 13-4, 13-5 


Breaking strength of yarn 
Thread elongation of a drapery yarn 


Exercises 9-62, 9-139 
Exercise 8-48 
Exercise 9-91 
Examples 9-6, 9-7 
Exercises 6-9, 6-33, 
6-49, 6-95, 6-99, 9-69, 
9-138, 10-29, 10-30, 
10-61, 10-62 
Exercise 9-77 
Exercises 8-92, 8-93 
Exercise 12-104 
Exercises 12-18, 12-38, 
12-56, 12-70, 12-86 
Exercises 11-3, 11-25, 
11-41, 11-53, 11-75, 
12-17, 12-35, 12-53, 
12-59, 12-87 


Exercises 6-24, 6-28, 6-40, 
6-44, 13-29, 14-10, 14-18, 
14-32, 14-63 

Exercises 8-10,10-78, 10-80 

Exercises 9-5, 9-6, 9-7, 9-8, 
9-9 


INDUSTRIAL ENGINEERING AND ECONOMICS 


Airlines 
Overbooking 
Arrival and waiting times 


Passenger airline miles flown in UK 
Automobile features 


Bayesian network 
Breakdowns by shift 
Buying tickets by telephone 
Calculator owners 

College graduates in Tempe 
Credit card numbers 
Customer design evaluation 
Customer sampling 


Cytogenic revenue 
Diamond prices 
Disabled population 
Engineering education 


Fraud detection in phone cards 

Impact of quality on schedules and costs 
Inspection of shipments from suppliers 
Inspection 

Installation technician service 

License numbers 

Lottery 

Machine schedules 

Monthly champagne sales in France 
Multiple choice exam 

Optical inspection 

Orders for computer systems 

Parallel parking 


Pension plan preference 
Presidential elections 


Price of an electronic storage device 


Exercises 3-93, 4-180 
Exercises 4-22, 15-15, 
15-25, 7-12 
Exercise 6-70 
Examples 2-4, 2-5 
Exercise 2-14 
Example 2-38 
Exercise 9-103 
Exercise 2-179 
Exercise 7-33 
Exercise 9-25 
Exercises 2-62, 2-135 
Exercise 2-147 
Example 3-29 
Exercises 2-8, 2-34, 
3-173, 9-86 
Exercise 3-25 
Exercise 11-95 
Exercise 4-81 
Exercises 5-9, 8-105, 8-92, 
9-105, 9-108, 11-71 
Exercise 2-144 
Exercise 3-94 
Exercise 3-182 
Exercise 9-87 
Exercise 3-172 
Exercise 2-63 
Exercise 3-124 
Examples 2-11, 2-36 
Exercise 6-91 
Exercise 3-88 
Exercise 3-20 
Exercises 2-16, 2-35 
Example 10-11 
Exercise 10-38 
Example 9-14 
Exercises 2-110, 2-146, 
8-55, 10-69 
Exercise 3-23 


Prices of houses 


Printer orders 
Product and bar codes 


Repeatability in component assembly 
Revenue potential 


Risk analysis 

Shipment of customers’ orders 
Soldiers killed by horse kicks 
Survey favoring toll roads 
Time between arrivals 


Time to 
Fill an electronic form 
Locate a part 
Make pottery 
Prepare a micro-array slide 
Recharge a battery 
Unemployment data 
Unlisted phone numbers 


DEFECTIVES, FLAWS, ERRORS 


Automobile front lights 
Bearings 

Calculators 

Computers and fans 
Connector 

Cotton rolls 

Electronic components 


Contamination 
Integrated circuits 
Lenses 


Machining stages 
Optical or magnetic disks 


Optical alignment 

Orange juice containers 

Ovens 

Oxygen containers 

Pareto chart for automobile doors 
Printed circuit boards 

Printing 

Response surface design for yield 
Surface flaws in parts, steel, and panels 


Textbook 
Water meter 
Wires 


Yield 


LIFE AND FAILURE TIMES 


Assembly and components 


Batteries 


Exercises 11-4, 11-26, 
11-42, 11-54 
Exercise 5-94 
Examples 2-12, 3-13 
Exercise 3-67 
Exercise 10-55 
Example 3-10 
Exercise 5-93 
Exercise 5-95 
Exercise 2-174 
Exercise 3-135 
Exercise 9-26 
Exercises 4-45, 4-99, 
4-104, 4-119, 4-162, 5-20, 
5-21 


Exercise 4-42 
Exercise 5-87 
Exercise 5-58 
Example 4-24 
Exercise 4-58 
Exercise 6-85 
Exercise 10-81 


Exercise 2-81 

Example 10-16 

Exercise 8-96 

Exercises 2-111, 3-109, 4-97 

Exercise 2-107 

Exercise 2-108 

Exercises 4-8, 6-111, 8-101, 
9-107, 11-68 

Exercise 3-140 

Exercises 7-59, 8-53 

Exercises 9-90, 10-71, 
15-87 

Example 2-26 

Example 3-33 
Exercises 3-137, 3-171 

Exercise 3-103 

Exercise 2-125 

Exercise 5-45 

Exercise 2-161 

Exercise 6-52 

Example 9-12 

Exercises 4-186, 5-4 

Example 14-12 

Examples 2-23, 2-30 
Exercises 3-7, 3-139, 
3-176, 3-184, 4-106, 9-97, 
15-76 

Exercise 3-160 

Exercise 4-82 

Examples 3-31, 3-32 
Exercise 15-44 

Examples 6-5, 10-5, 10-7, 
14-6 

Exercises 5-26, 6-25, 6-41, 
8-11, 13-30, 14-20, 14-42, 
14-47, 14-53, 14-72 


Example 5-14 

Exercises 4-103, 4-83, 5-82 
Example 6-7 

Exercises 9-43, 9-93, 14-3 


767 


Censored components 
CPU 


Door latch 

Electronic amplifier 

Light bulb 

Machine 

Magnetic resonance imaging machine 
Packaged magnetic disk 
Recirculating pump 

Semiconductor laser 

Voltage regulators 

Yarn 


LENGTH OF 


Computer cable 

Door casing 

Hinge 

Injection-molded plastic case 
Keyway depth 

Metal rod 

Panels 

Plate glass parts 

Punched part 


768 


Exercise 7-76 
Example 4-23 
Exercises 4-129, 4-167 
Exercise 8-8 
Exercise 4-184 
Exercises 8-14, 8-16, 8-17 
Exercise 7-50 
Exercise 4-149 
Exercise 4-148 
Exercise 4-163 
Exercises 4-69, 4-143, 4-176 
Exercise 4-96 
Exercise 6-100 


Exercise 4-34 
Exercise 5-60 
Exercise 4-8 
Exercise 4-157 
Exercise 16-91 
Exercises 4-11, 4-21 
Exercise 5-85 
Exercise 3-68 
Exercise 5-56 


THICKNESS OF 


Aluminum cylinders 
Coatings 

Flange 

Halves 

Ink layers 


Laminated covering 

Layers and error propagation 

Parts 

Photoconductor and photo resist film 
Plastic connector 

Wood paneling 


WEIGHT OF 


Adobe bricks 

Candy 

Chemical herbicide 
Components 

Measured by a digital scale 
Paper 

Running shoes 

Sample and measurement error 


Exercise 4-57 
Exercises 3-66, 4-35 
Exercise 4-41 
Exercise 
Example 5-18 

Exercises 5-49, 5-59 
Exercise 4-173 
Examples 5-31, 5-91, 5-92 
Exercises 5-66, 15-11, 15-57 
Exercises 4-43, 10-22, 10-64 
Examples 2-2, 2-7 
Exercises 3-10, 3-42 


Exercises 5-27, 5-84 
Exercises 5-86, 7-49 
Exercises 4-9, 4-36, 4-40 
Exercises 5-65, 5-88 
Exercise 2-21 

Exercises 8-51, 9-78 
Exercise 4-71 

Exercise 4-72 
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Index of Applications in 


Examples and Exercises 


BIOLOGICAL 


Amino acid composition of soybean meal 


Anaerobic respiration 
Blood 
Cholesterol level 
Glucose level 
Hypertension 


Body mass index (BMI) 

Body temperature 

Cellular replication 

Circumference of orange trees 

Deceased beetles under autolysis and 
putrefaction 

Diet and weight loss 


Disease in plants 
Dugongs (sea cows) length 
Fatty acid in margarine 


Gene expression 


Gene occurrence 
Gene sequences 


Grain quality 
Height of plants 
Height or weight of people 


Insect fragments in chocolate bars 
1Q for monozygotic twins 

Leaf transmutation 

Leg strength 

Light-dependent photosynthesis 
Nisin recovery 


Pesticides and grape infestation 
Potato spoilage 
Protein 

in Livestock feed 

in Milk 


from Peanut milk 
Protopectin content in tomatoes 
Rat muscle 
Rat tumors 


Exercise 8-52 
Exercise 2-144 


Exercise 15-10 
Exercises 13-25, 14-37 
Exercises 4-143, 8-31, 
11-8, 11-30, 11-46 
Exercise 11-35 
Exercise 9-59 
Exercises 2-193, 3-100 
Exercise 10-46 


Exercise 2-92 

Exercises 10-43, 10-77, 
15-35 

Exercise 14-76 

Exercise 11-15 

Exercises 8-36, 8-66, 
8-76, 9-147, 9-113 

Exercises 6-65, 13-50, 
15-42 

Exercises 2-195, 3-11 

Exercises 2-25, 2-192, 
3-13, 3-147 

Exercise 8-21 

Exercises 4-170, 4-171 

Exercises 4-44, 4-66, 5-64, 
6-30, 6-37, 6-46, 6-63, 
6-73, 9-68 

Exercises 3-134, 4-101 

Exercise 10-45 

Exercises 2-88, 3-123 

Exercises 8-30, 9-64 

Exercise 2-24 

Exercises 12-14, 12-32, 
12-50, 12-64, 12-84, 14-83 

Exercise 10-94 

Exercise 13-14 


Exercise 14-75 

Exercises 13-13, 13-25, 
13-33 

Exercise 9-143 

Exercises 13-40, 15-40 

Exercise 6-15 

Exercise 8-50 


Rat weight 

Rejuvenated mitochondria 
Root vole population 
Sodium content of cornflakes 
Soil 


Splitting cell 

St John’s Wort 

Stork sightings 

Sugar content 

Synapses in the granule cell layer 
Tar content in tobacco 

Taste evaluation 


Tissues from an ivy plant 
Visual accommodation 

Weight of swine or guinea pigs 
Wheat grain drying 


CHEMICAL 


Acid-base titration 


Alloys 


Contamination 
Cooking oil 
Etching 


Infrared focal plane arrays 
Melting point of a binder 
Metallic material transition 
Moisture content in raw material 
Mole fraction solubility 

Mole ratio of sebacic acid 

Pitch carbon analysis 


Plasma etching 
Polymers 
Propellant 


Bond shear strength 


Burning rate 


Exercise 8-57 

Exercises 2-96, 3-88 

Exercise 14-16 

Exercise 9-61 

Exercises 3-24, 12-1, 12-2, 
12-23, 12-24, 12-41, 12-42 

Exercise 4-155 

Example 10-14 

Exercises 4-100, 11-96 

Exercises 8-46, 9-83, 9-114 

Exercise 9-145 

Exercise 8-95 

Exercises 14-13, 14-31, 
14-34, 14-50, 14-54 

Exercise 2-130 

Exercises 6-11, 6-16, 6-75 

Exercises 9-142, 13-48 

Exercises 13-47, 15-41 


Exercises 2-60, 2-132, 3-12, 
5-48 
Examples 6-4, 8-5 
Exercises 10-21, 10-44, 
10-59, 13-38, 15-17 
Exercise 2-128, 4-113 
Exercise 2-79 
Exercises 10-19, 10-65, 
10-34 
Exercise 9-146 
Exercise 9-42 
Examples 8-1, 8-2 
Exercise 3-6 
Exercises 12-75, 12-91 
Exercise 11-91 
Exercises 12-10, 12-36, 
12-50, 12-60, 12-68 
Examples 14-5, 14-8 
Exercise 7-32 
Exercises 7-15, 10-8, 
13-12, 13-24 


Examples 15-1, 15-2, 15-4 
Exercises 11-11, 11-31, 
11-49, 15-32 

Examples 9-1, 9-2, 9-3, 
9-4,9-5 Exercise 10-6 


Purity 
Thermal barrier coatings 


CHEMICAL ENGINEERING 


Aluminum smelting 
Automobile basecoat 
Blow molding 
Catalyst usage 
Concentration 


Conversion 
Cooling system in a nuclear submarine 
Copper content of a plating bath 


Dispensed syrup in soda machine 
Dry ash value of paper pulp 
Fill volume and capability 


Filtration rate 
Fish preparation 
Flow metering devices 


Foam expanding agents 


Green liquor 
Hardwood concentration 


Impurity level in chemical product 
Injection molding 


Laboratory analysis of chemical process 
samples 

Maximum heat of a hot tub 

Na,S concentration 


NbOCI, 
Oxygen purity 


pH 
and Catalyst concentration 
of Plating bath 
of a Solution 
of a Water sample 
Product color 
Product solution strength in 
recirculation unit 
Pulp brightness 
Reaction Time 


Redox reaction experiments 
Shampoo foam height 


Stack loss of ammonia 


Temperature 
Firing 
Furnace 
of Hall cell solution 
Vapor deposition 
Vapor phase oxidation of naphthalene 
Viscosity 


Exercise 15-42 
Exercise 10-75 


Exercise 10-92 

Exercises 14-56, 14-68 

Exercise 16-59 

Exercise 10-17 

Examples 16-2, 16-6 
Exercises 5-46, 6-68, 
6-84, 10-9, 10-54, 15-64 

Exercise 12-3 

Exercise 9-130 

Exercises 15-8, 15-34, 
15-58 

Exercises 8-29, 8-63, 8-75 

Exercise 14-57 

Examples 5-35, 8-6, 9-8, 9-9 
Exercises 2-180, 3-146, 
3-151, 4-62, 4-63, 5-62, 
9-100, 10-4, 10-85, 10-90, 
14-43, 15-38 

Exercise 14-44 

Exercise 13-46 

Examples 15-3, 15-5 
Exercises 9-126, 9-127 

Exercises 10-16, 10-56, 
10-88 

Exercise 12-100 

Example 13-2 

Exercise 14-11 

Exercises 15-3, 15-15 

Example 14-9 

Exercises 2-15, 2-137, 
10-70 


Exercise 2-43 

Exercise 10-33 

Exercises 11-7, 11-29, 
11-41, 11-62 

Exercise 6-36 

Examples 11-1, 11-2, 11-3, 
11-4, 11-5, 11-6, 11-7 


Exercise 14-61 
Exercises 15-1, 15-13 
Exercise 6-17 
Exercise 2-11 
Exercise 14-45 


Exercise 14-38 

Exercise 13-31 

Example 4-5 
Exercises 2-13, 2-33, 4-56 

Exercise 2-65 

Exercises 8-91, 9-15, 
9-16, 9-17, 9-18, 9-19, 
9-128 

Exercises 12-16, 12-34, 
12-52, 12-66, 12-85 


Exercise 13-15 

Exercises 6-55, 6-109 

Exercise 11-92 

Exercises 13-28, 13-32 

Exercise 6-54 

Exercises 6-66, 6-88, 6-90, 
6-96, 12-73, 12-103, 14-64, 
15-20, 15-36, 15-86 


Water temperature from power plant 
cooling tower 

Water vapor pressure 

Wine 


CIVIL ENGINEERING 


Cement and Concrete 
Hydration 
Mixture heat 


Mortar briquettes 
Strength 

Tensile strength 
Compressive strength 


Intrinsic permeability 


Highway pavement cracks 
Pavement deflection 


Retained strength of asphalt 
Speed limits 
Traffic 


Wearing seat belts 


Exercise 9-40 

Exercise 11-78 

Examples 12-14, 12-15 
Exercises 6-35, 6-51 


Example 10-8 
Exercises 9-10, 9-11, 9-12, 
9-13, 9-14 
Exercise 15-79 
Exercises 4-57, 15-24 
Exercise 15-25 
Exercises 13-3, 13-9, 
3-19, 14-14, 14-24, 
4-48, 7-7, 7-8, 8-13, 
8-18, 8-37, 8-69, 8-80, 
8-87, 8-90, 15-5 
Exercises 11-1, 11-23, 
1-39, 11-52 
xercise 3-138, 4-102 
xercises 11-2, 11-16, 
1-24, 11-40 
Exercises 13-11,13-23 
Exercises 8-59, 10-60 
Exercises 3-87, 3-149, 
3-153, 9-190 
Exercises 10-82, 10-83 


esmes) 


COMMUNICATIONS, COMPUTERS, AND NETWORKS 


Cell phone signal bars 

Cellular neural network speed 
Code for a wireless garage door 
Computer clock cycles 
Computer networks 


Corporate Web site errors 
Digital channel 


Electronic messages 

Email routes 
Encryption-decryption system 
Errors in a communications channel 


Passwords 


Programming design languages 

Response time in computer operation 
system 

Software development cost 

Telecommunication prefixes 

Telecommunications 


Transaction processing performance and 
OLTP benchmark 


Viruses 
Web browsing 


Examples 5-1, 5-3 

Exercise 8-39 

Exercise 2-34 

Exercise 3-8 

Example 4-21 Exercises 
2-10, 2-64, 2-164, 3-148, 
3-175, 4-65, 4-94 

Exercise 4-84 

Examples 2-3, 3-4, 3-6, 
3-9, 3-12, 3-16, 3-24, 
4-15, 5-7, 5-9, 5-10 

Exercises 3-158, 4-98, 4-115 

Exercise 2-184 

Exercise 2-181 

Examples 3-22, 4-17, 4-20 
Exercises 2-2, 2-4, 2-46, 
3-40, 4-116, 5-5, 5-12, 
6-94, 9-135 

Exercises 2-81, 2-97, 
2-194, 3-91, 3-108 

Exercise 10-40 


Exercise 8-82 

Exercise 13-49 

Exercise 2-45 

Examples 3-1, 3-14 
Exercises 2-17, 3-2, 3-85, 
3-105, 3-132, 3-155, 4-95, 
4-105, 4-111, 4-117, 4-160, 
5-78, 9-98, 15-9 


Exercises 2-68, 2-175, 
5-10, 5-34, 10-7 

Exercise 3-75 

Examples 3-25, 5-12, 5-13 
Exercises 2-32, 2-191, 
3-159, 4-87, 4-140, 5-6 


ELECTRONICS 


Automobile engine controller 
Bipolar transistor current 
Calculator circuit response 
Circuits 


Conductivity 
Current 


Drain and leakage current 
Electromagnetic energy absorption 
Error recovery procedures 

Inverter transient point 


Magnetic tape 
Nickel charge 
Parallel circuits 
Power consumption 


Power supply 


Printed circuit cards 


Redundant disk array 
Resistors 


Solder connections 

Strands of copper wire 

Surface charge 

Surface mount technology (SMT) 
Transistor life 

Voltage measurement errors 


ENERGY 


Consumption in Asia 
Enrichment percentage of reactor fuel rods 
Fuel octane ratings 


Gasoline cost by month 
Gasoline mileage 


Heating rate index 

Petroleum imports 

Released from cells 

Renewable energy consumption 
Steam usage 


Wind power 
ENVIRONMENTAL 


Arsenic 


Asbestos 

Biochemical oxygen demand (BOD) 
Calcium concentration in lake water 
Carbon dioxide in the atmosphere 
Chloride in surface streams 


Cloud seeding 
Earthquakes 


Examples 9-10, 9-11 
Exercise 14-7 
Exercises 13-6, 13-18 
Examples 2-35, 7-3 
Exercises 2-135, 2-136, 
2-170, 2-177, 2-190 
Exercise 12-105 
Examples 4-1, 4-5, 4-8, 4-9, 
4-12, 16-3 
Exercises 10-31, 15-30 
Exercises 13-41, 11-85 
Exercise 10-26 
Exercises 2-18, 2-166 
Exercises 12-98, 12-99, 
12-102 
Exercises 2-189, 3-125 
Exercises 2-61, 3-48 
Example 2-34 
Exercises 6-89, 11-79, 12-6, 
12-26, 12-44, 12-58, 12-80 
Example 9-13 
Exercises 2-3, 9-20, 9-21, 
9-22, 9-23, 9-24, 9-28 
Example 2-10 
Exercises 2-42, 3-122 
Exercise 2-127 
Example 7-1 
Exercise 6-86 
Exercises 3-1, 15-43, 15-45 
Exercise 2-77 
Exercise 14-15 
Example 16-5 
Exercise 7-51 
Exercise 4-48N 


Exercises 6-29, 6-45, 6-59 

Exercises 8-41, 8-71, 8-88 

Exercises 6-22, 6-26, 6-38, 
6-42, 6-58, 6-78, 10-7 

Exercise 15-98 

Exercises 10-89, 11-6, 
11-17, 11-28, 11-44, 
11-56, 12-27, 12-55, 
12-57, 12-77, 12-89, 15-37 

Exercise 14-46 

Exercise 6-72 

Exercise 2-168 

Exercise 15-78 

Exercises 11-5, 11-27, 
11-43, 11-55 

Exercises 4-132, 11-9 


Example 10-6 
Exercises 12-12, 12-30, 
12-48, 12-62, 12-76, 
12-88, 13-39 

Exercises 4-85, 4-169 

Exercises 11-13, 11-33, 11-51 

Exercise 8-9 

Exercise 3-58 

Exercises 11-10, 11-32, 
11-48, 11-59 

Exercise 9-60 

Exercises 6-63, 9-102, 
11-15, 15-46 


Emissions and fluoride emissions 
Global temperature 
Hydrophobic organic substances 
Mercury contamination 

Ocean wave height 

Organic pollution 

Oxygen concentration 

Ozone levels 

Radon release 

Rainfall in Australia 

Suspended solids in lake water 


Temperature in Phoenix, AZ 
Temperature of sewage discharge 
Voters and air pollution 

Waste water treatment tank 
Water demand and quality 
Watershed yield 


MATERIALS 


Baked density of carbon anodes 
Ceramic substrate 

Coating temperature 

Coating weight and surface roughness 
Compressive strength 

Flow rate on silicon wafers 

Insulation ability 

Insulation fluid breakdown time 

Izod impact test 


Luminescent ink 


Paint drying time 


Particle size 
Photoresist thickness 
Plastic breaking strength 
Polycarbonate plastic 
Rockwell hardness 
Temperature of concrete 
Tensile strength of 

Aluminum 

Fiber 

Steel 

Paper 


Titanium content 


Tube brightness in TV sets 


MECHANICAL 


Aircraft manufacturing 


Artillery shells 
Beam delamination 
Bearings 


Diameter 


Wear 


Exercises 2-28, 15-34 

Exercises 6-83, 11-74 

Exercise 10-93 

Example 8-4 

Exercise 4-181 

Example 3-18 

Exercises 8-94, 9-63, 9-140 

Exercises 2-9, 11-90 

Exercises 13-8, 13-20 

Exercises 8-33, 8-65, 8-77 

Exercises 6-32, 6-48, 6-60, 
6-80, 9-70 

Exercise 8-49 

Exercises 6-92, 6-97 

Exercises 9-27, 9-94 

Exercise 2-37 

Exercises 4-68, 9-137 

Exercise 11-70 


Exercise 14-4 
Example 16-4 
Exercises 10-24, 10-60 
Exercise 2-90 
Exercises 7-56, 11-60 
Exercises 13-2, 13-16, 15-28 
Exercise 14-5 
Exercises 6-8, 6-74 
Exercises 8-28, 8-62, 8-74, 
9-66, 9-80 
Exercise 5-28 
Examples 10-1, 10-2, 10-3 
Exercises 14-2, 14-19, 
15-8, 15-16 
Exercises 4-33, 16-17 
Exercise 5-63 
Exercises 10-5, 10-20, 10-55 
Example 2-8 
Exercises 2-66, 2-76 
Exercises 10-91, 9-115, 15-17 
Exercise 9-58 


Example 10-4 

Exercises 7-3, 7-4, 13-3, 
13-17 

Example 10-9 

Exercise 9-44 

Example 13-1 

Exercises 4-154, 11-86 

Exercises 8-47, 9-79, 15-2, 
15-12 

Exercises 7-12, 8-35, 8-67, 
8-79, 9-148, 9-67, 14-1 


Examples 6-6, 12-12, 14-1, 
15-6, 16-1 Exercises 6-8, 
8-97, 10-42, 15-31, 15-13, 
15-74 

Exercise 9-106 

Exercises 8-32, 8-64 

Examples 8-7, 8-8 
Exercise 9-95 

Exercises 4-181, 9-42, 
15-6, 15-14 

Example 4-25 
Exercises 5-22, 4-127, 
12-19, 12-39, 12-45, 12-67 
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